Posted in

Site Reliability Engineer

Site Reliability Engineer

CompanySentinelOne
LocationUnited States
Salary$96000 – $132000
TypeFull-Time
Degrees
Experience LevelJunior, Mid Level

Requirements

  • 1-2+ years of experience in running operations at a large scale for a SaaS product
  • 1-2+ years of production experience with orchestration systems like Kubernetes, Nomad, or Mesos
  • Python / Golang / Java / Ruby as main scripting languages (we use Python)
  • Familiarity with running Java and JavaScript applications including building and deploying
  • AWS experience and familiarity with other platforms like GCE and Azure
  • Experience using Infrastructure as Code to set up cloud-native services
  • Familiarity with CI and practical delivery using Jenkins, GHA, ArgoCD, etc. or similar; familiarity with deployment strategies like blue-green, rolling deploys, canary deploys, and best practices around deployment automation
  • Keeping a pulse on the latest SRE trends

Responsibilities

  • Drive continuous deployment
  • Command production incidents and drive the post-mortem process
  • Partner with product engineering teams to improve product quality and reliability
  • Simplify and automate operational tasks
  • Eliminate bottlenecks in SentinelOne infrastructure and services
  • Build tools to improve operations

Preferred Qualifications

  • Ability to work in a diverse and distributed team is highly desired
  • Self-starter attitude with a passion and motivation for new technologies and empathy for legacy systems
  • Ability to learn quickly and navigate through unfamiliar programming languages, systems, and processes
  • Curiosity, desire to learn and improve, and great communication skills
  • Prior product-building experience is optional but strongly desired