Site Reliability Engineer

1-2+ years of experience in running operations at a large scale for a SaaS product
1-2+ years of production experience with orchestration systems like Kubernetes, Nomad, or Mesos
Python / Golang / Java / Ruby as main scripting languages (we use Python)
Familiarity with running Java and JavaScript applications including building and deploying
AWS experience and familiarity with other platforms like GCE and Azure
Experience using Infrastructure as Code to set up cloud-native services
Familiarity with CI and practical delivery using Jenkins, GHA, ArgoCD, etc. or similar; familiarity with deployment strategies like blue-green, rolling deploys, canary deploys, and best practices around deployment automation
Keeping a pulse on the latest SRE trends

Drive continuous deployment
Command production incidents and drive the post-mortem process
Partner with product engineering teams to improve product quality and reliability
Simplify and automate operational tasks
Eliminate bottlenecks in SentinelOne infrastructure and services
Build tools to improve operations

Ability to work in a diverse and distributed team is highly desired
Self-starter attitude with a passion and motivation for new technologies and empathy for legacy systems
Ability to learn quickly and navigate through unfamiliar programming languages, systems, and processes
Curiosity, desire to learn and improve, and great communication skills
Prior product-building experience is optional but strongly desired