Senior Site Reliability Engineer

5+ years of experience working as a Senior Site Reliability Engineer preferred
Expert knowledge of incident response, observability, and reliability tools and techniques in a cloud-native environment (Google Cloud is preferred, but AWS experience is also valuable)
Experience with cloud design patterns (Google Cloud is considered an asset) and developing specialized application stacks on cloud services (Python backend, TypeScript frontend)
Experience working in Python and JavaScript/TypeScript codebases
Eagerness to share your own ideas, and openness to those of others

Build, deploy, and maintain observability platforms to enable teams to self-serve their metrics gathering and dash-boarding needs
Lead software and system design initiatives by leveraging cloud-native design patterns and injecting your cloud expertise into the entire development lifecycle
Partner with other teams to iterate on and improve BenchSci’s Incident Response processes
Help other teams to respond, mitigate, and remediate production incidents
Help other teams write effective post-mortems and improve our reliability culture and processes
Work with your team, Staff Engineers, and Engineering Managers to help promote SRE best practices
Help reduce toil and improve developer productivity by automating our team and business processes
Partner with engineering and product stakeholders and other cross-functional teams to devise and refine requirements
Communicate cross-cutting decisions to all potentially impacted teams