Posted in

Staff Site Reliability Engineer

Staff Site Reliability Engineer

CompanyDAT Freight & Analytics
LocationSeattle, WA, USA, Beaverton, OR, USA, Denver, CO, USA
Salary$155000 – $155000
TypeFull-Time
Degrees
Experience LevelExpert or higher

Requirements

  • Strong leadership and mentoring abilities, especially with SRE or Platform Engineering/Infrastructure teams.
  • Total of 10+ years industry experience
  • 3+ years of software engineering experience (JavaScript, Python, Go, Java/Kotlin, C++, etc)
  • Extensive experience with modern observability tools (Datadog preferred).
  • Extensive experience with cloud platforms (preferably AWS).
  • Demonstrated success in leading large technical initiatives, including design, project management and gaining executive buy-in.
  • Proven experience modernizing legacy code and infrastructure.
  • Ability to work closely with peer teams, platform/software architects and management to drive key reliability improvements.
  • Deep understanding of cloud infrastructure, automation, and best practices for reliability.

Responsibilities

  • Collaborate with platform architects and management to ensure reliability targets are met.
  • Advise engineering teams on best practices for measuring reliability and uptime.
  • Assist and respond to critical engineering incidents
  • Lead and mentor SRE engineers to improve their engineering skills.
  • Provide technical guidance and best practices for use of cloud infrastructure and tooling. Be a driver for Infrastructure-as-Code within the platform.
  • Spearhead major reliability-focused initiatives and projects.
  • Help optimize our work to be customer-focused. Continually seek feedback from our customers on how we can improve.
  • Migrate legacy systems to modern, scalable cloud environments.
  • Help develop and drive a culture of continuous improvement with the Platform Engineering and Software Engineering groups.
  • Participate in an on-call rotation and occasionally act as Incident Commander.

Preferred Qualifications

  • Experience with our tools (Kubernetes, ArgoCD, Terraform, Github Actions) a plus.