Senior Site Reliability Engineer
Company | BenchSci |
---|---|
Location | Toronto, ON, Canada |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- 5+ years of experience working as a Senior Site Reliability Engineer preferred
- Expert knowledge of incident response, observability, and reliability tools and techniques in a cloud-native environment (Google Cloud is preferred, but AWS experience is also valuable)
- Experience with cloud design patterns (Google Cloud is considered an asset) and developing specialized application stacks on cloud services (Python backend, TypeScript frontend)
- Experience working in Python and JavaScript/TypeScript codebases
- Eagerness to share your own ideas, and openness to those of others
Responsibilities
- Build, deploy, and maintain observability platforms to enable teams to self-serve their metrics gathering and dash-boarding needs
- Lead software and system design initiatives by leveraging cloud-native design patterns and injecting your cloud expertise into the entire development lifecycle
- Partner with other teams to iterate on and improve BenchSci’s Incident Response processes
- Help other teams to respond, mitigate, and remediate production incidents
- Help other teams write effective post-mortems and improve our reliability culture and processes
- Work with your team, Staff Engineers, and Engineering Managers to help promote SRE best practices
- Help reduce toil and improve developer productivity by automating our team and business processes
- Partner with engineering and product stakeholders and other cross-functional teams to devise and refine requirements
- Communicate cross-cutting decisions to all potentially impacted teams
Preferred Qualifications
- 5+ years of experience working as a Senior Site Reliability Engineer preferred