Posted in

Compute Site Reliability Engineer – SRE – Kubernetes

Compute Site Reliability Engineer – SRE – Kubernetes

CompanyApple
LocationSeattle, WA, USA
Salary$135400 – $250600
TypeFull-Time
DegreesBachelor’s
Experience LevelMid Level, Senior

Requirements

  • Bachelor’s Degree in Computer Science, an engineering-related field, or equivalent related experience.
  • 3+ years in a Site Reliability Engineering, DevOps, or Infrastructure focused role
  • Basic understanding of Kubernetes architecture, including Pods, Deployments, Services, and ConfigMaps.
  • Familiarity with Linux systems administration and command-line tools.
  • Experience with scripting languages like Bash, Python, or Go.
  • Knowledge of monitoring tools such as Prometheus, Grafana, or similar.
  • Exposure to CI/CD pipelines and DevOps practices.
  • Awareness of cloud platforms (AWS, GCP, or Azure) and containerization.
  • Strong problem-solving skills and a willingness to learn new technologies.
  • Outstanding organizational and communications skills

Responsibilities

  • Operate, monitor, and triage all aspects of our production and non-production environments.
  • Design, build and implement innovative solutions for previous, present and future issues.
  • Prepare alert handling procedures, runbooks, and collaborate with other SRE teams.
  • Participate in on-call rotations to troubleshoot and resolve production issues, minimizing downtime.
  • Automate deployment and orchestration of services into the cloud environment as well as other routine processes.
  • Actively participate in capacity planning, scale testing, and disaster recovery exercises.

Preferred Qualifications

  • Strong verbal and written communication skills
  • Automation advocate – you truly believe in removing operational load via software.
  • Familiarity with Infrastructure as Code (IaC) tools like Puppet
  • A strong sense of ownership. At the same time, you’re a great teammate who communicates clearly and transparently – Self-motivated, inquisitive, and always looking to learn more.
  • Experience managing, scaling, and troubleshooting Java and Go applications
  • CNCF Kubernetes Administration certification