Senior Manager – Devops
Company | 2K |
---|---|
Location | Seattle, WA, USA, Austin, TX, USA, Los Angeles, CA, USA, Novato, CA, USA |
Salary | $155800 – $230560 |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- 5+ years experience in the SRE, Devops, or system engineering fields
- 3+ years coaching and mentoring senior technical talent
- 5+ years experience using Observability tools like Datadog, New Relic, etc.
- 5+ years experience working with CICD tools like GHA, Jenkins, ArgoCD, etc.
- Proven leadership skills with a focus on adapting and changing as the organization or environment requires
- Deeply knowledgeable about modern infrastructure management tools and processes
- Relentless focus on availability, security, and performance with a customer-centric mindset
- Software engineering fundamentals in system design, architecture and tooling
- Fluent with at least one modern programming language and a good understanding of code management principles
- Experience architecting and maintaining large scale distributed infrastructure that spans on premise and cloud datacenters
- Has architected microservices using virtualization or containerization
- Experience managing stakeholder relationships, communicating with and addressing the company’s executive leadership team.
Responsibilities
- Develop, manage and drive the Devops and Observability teams strategic direction to enable engineering teams to ship and release reliably, securely and efficiently.
- Lead, and scale Devops and Observability teams to focus on root cause analysis, pattern identification and continuous improvement in order to optimize application performance, resilience and reliability.
- Develop and implement SRE best practices and techniques including detecting and responding to issues, and restoring applications/services across business domains.
- Build metrics-driven approach to ensure the stability and security of enterprise cloud services including SLIs, SLOs, and SLAs.
- Establish and supervise OKRs to measure overall progress for the teams.
- Architect and operate highly resilient systems in a multi-cloud global environment serving game and consumer services.
- Partner with cross functional teams to support and improve our overall security posture, Patch Management, Disaster Recovery and Business Continuity efforts.
- Establish working processes with software engineering teams to support our innovation efforts.
- Define and implement standards that will affect systems, services and multiple software environments.
Preferred Qualifications
- Experience building a CICD platform and/or Observability Platform from start to finish.
- Experience with developing software for highly scalable/distributed systems
- Experience using IaC for highly elastic workloads
- Experience in gaming or similar industries combining large scale internet facing systems with software development and entertainment services culture
- Familiarity with common source code repositories and infrastructure as code methodologies