Posted in

Site Reliability Engineer Lead

Site Reliability Engineer Lead

CompanyPDQ
LocationWashington, USA, Texas, USA, Florida, USA, Waterbury, CT, USA, Georgia, USA, Arizona, USA, Concord, NH, USA, Tennessee, USA, Virginia, USA, Arkansas, USA, Minnesota, USA, Colorado, USA, Utah, USA, Kentucky, USA, Maryland, USA, Wisconsin, USA, North Carolina, USA, Oklahoma, USA, Montebello, CA, USA, Missouri, USA, Michigan, USA, Illinois, USA, United States
Salary$Not Provided – $Not Provided
TypeFull-Time
Degrees
Experience LevelSenior

Requirements

  • 5+ years of experience in SRE, DevOps, or Infrastructure Engineering, with at least 2+ years in a lead or strategic role.
  • Proven experience scaling observability platforms and driving SRE principles org-wide.
  • Deep experience with Prometheus, PromQL, Grafana, and ideally GroundCover.
  • Strong familiarity with Google Cloud Platform (GCP) or similar cloud environments.
  • A track record of creating robust incident response and postmortem practices.
  • The ability to plan for scale, reduce toil, and prioritize reliability as a shared responsibility across engineering.
  • Excellent collaboration and communication skills — you can work across teams and influence without authority.

Responsibilities

  • Design, implement, and maintain observability and monitoring systems that ensure application stability, performance, and scale.
  • Establish and own service level objectives (SLOs), SLIs, and SLAs across key systems.
  • Collaborate with engineering leaders to develop scalable, proactive monitoring and alerting for new and existing features.
  • Drive incident management best practices — tooling, runbooks, on-call processes, incident response coordination, and executive communication.
  • Lead synthetic testing and load testing initiatives to ensure production scale and stability.
  • Advocate for performance, reliability, and operational excellence across the engineering org.
  • Mentor engineers and influence architectural decisions related to system resiliency and uptime.

Preferred Qualifications

    No preferred qualifications provided.