Posted in

Software Engineer – Deploy Observability Infra

Software Engineer – Deploy Observability Infra

CompanyNotion
LocationSan Francisco, CA, USA, New York, NY, USA
Salary$150000 – $250000
TypeFull-Time
Degrees
Experience LevelJunior, Mid Level

Requirements

  • Strong infrastructure knowledge: Experience with deploy systems, observability platforms (Opentelemetry infrastructure), and monitoring tools. Understanding of CI/CD pipelines and GitHub Actions.
  • Communication skills: Must be responsive and comfortable helping engineers with their day-to-day development experience, particularly in a busy Slack environment.
  • Multitasker and present: you like interrupts especially if this is blocking engineers. You switch between your project work and day to day observability and deploy challenges.
  • Interest in learning how things work: you love to learn how the tools you use work at a fundamental level. You get excited at the idea of digging into third-party source code to understand how they work and find new ways to use them.
  • Pragmatic and business-oriented: You care about business impact and prioritize projects accordingly. You don’t just go after cool stuff—you understand the balance between craft, speed, and the bottom line.

Responsibilities

  • Build and enhance deploy automation: Create and improve systems for automated, gradual, and safe deployment of changes across Notion’s infrastructure.
  • Develop observability solutions: Design and implement monitoring tools and dashboards that help engineers understand application performance and health.
  • Optimize deployment pipelines: Work on making deployments faster, more reliable, and more predictable while maintaining safety guardrails.
  • Create developer tooling: Build internal tools that help engineers monitor their changes, debug issues, and understand system behavior.
  • Improve incident response: Enhance our observability stack to enable quick identification and resolution of production issues.
  • Collaborate across teams: Work closely with product engineering teams to understand their observability needs and improve deploy experience.

Preferred Qualifications

  • Familiarity with TypeScript, especially in ways to keep it running smoothly in a large codebase.
  • AWS ECS experience. Familiarity with Amazon Elastic Container Service (ECS) for container orchestration and deployment.
  • Having familiarity with gradual rollout systems. Experience with canary deployments and implementing safe, gradual rollout strategies for production changes.