Software Engineer – Deploy Observability Infra
Company | Notion |
---|---|
Location | San Francisco, CA, USA, New York, NY, USA |
Salary | $150000 – $250000 |
Type | Full-Time |
Degrees | |
Experience Level | Junior, Mid Level |
Requirements
- Strong infrastructure knowledge: Experience with deploy systems, observability platforms (Opentelemetry infrastructure), and monitoring tools. Understanding of CI/CD pipelines and GitHub Actions.
- Communication skills: Must be responsive and comfortable helping engineers with their day-to-day development experience, particularly in a busy Slack environment.
- Multitasker and present: you like interrupts especially if this is blocking engineers. You switch between your project work and day to day observability and deploy challenges.
- Interest in learning how things work: you love to learn how the tools you use work at a fundamental level. You get excited at the idea of digging into third-party source code to understand how they work and find new ways to use them.
- Pragmatic and business-oriented: You care about business impact and prioritize projects accordingly. You don’t just go after cool stuff—you understand the balance between craft, speed, and the bottom line.
Responsibilities
- Build and enhance deploy automation: Create and improve systems for automated, gradual, and safe deployment of changes across Notion’s infrastructure.
- Develop observability solutions: Design and implement monitoring tools and dashboards that help engineers understand application performance and health.
- Optimize deployment pipelines: Work on making deployments faster, more reliable, and more predictable while maintaining safety guardrails.
- Create developer tooling: Build internal tools that help engineers monitor their changes, debug issues, and understand system behavior.
- Improve incident response: Enhance our observability stack to enable quick identification and resolution of production issues.
- Collaborate across teams: Work closely with product engineering teams to understand their observability needs and improve deploy experience.
Preferred Qualifications
- Familiarity with TypeScript, especially in ways to keep it running smoothly in a large codebase.
- AWS ECS experience. Familiarity with Amazon Elastic Container Service (ECS) for container orchestration and deployment.
- Having familiarity with gradual rollout systems. Experience with canary deployments and implementing safe, gradual rollout strategies for production changes.