Skip to content

Site Reliability Engineer
Company | SentinelOne |
---|
Location | United States |
---|
Salary | $96000 – $132000 |
---|
Type | Full-Time |
---|
Degrees | |
---|
Experience Level | Junior, Mid Level |
---|
Requirements
- 1-2+ years of experience in running operations at a large scale for a SaaS product
- 1-2+ years of production experience with orchestration systems like Kubernetes, Nomad, or Mesos
- Python / Golang / Java / Ruby as main scripting languages (we use Python)
- Familiarity with running Java and JavaScript applications including building and deploying
- AWS experience and familiarity with other platforms like GCE and Azure
- Experience using Infrastructure as Code to set up cloud-native services
- Familiarity with CI and practical delivery using Jenkins, GHA, ArgoCD, etc. or similar; familiarity with deployment strategies like blue-green, rolling deploys, canary deploys, and best practices around deployment automation
- Keeping a pulse on the latest SRE trends
Responsibilities
- Drive continuous deployment
- Command production incidents and drive the post-mortem process
- Partner with product engineering teams to improve product quality and reliability
- Simplify and automate operational tasks
- Eliminate bottlenecks in SentinelOne infrastructure and services
- Build tools to improve operations
Preferred Qualifications
- Ability to work in a diverse and distributed team is highly desired
- Self-starter attitude with a passion and motivation for new technologies and empathy for legacy systems
- Ability to learn quickly and navigate through unfamiliar programming languages, systems, and processes
- Curiosity, desire to learn and improve, and great communication skills
- Prior product-building experience is optional but strongly desired