Posted in

Lead Cloud Site Reliability Engineer

Lead Cloud Site Reliability Engineer

CompanyLSEG
LocationSt. Louis, MO, USA
Salary$Not Provided – $Not Provided
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior, Expert or higher

Requirements

  • A Bachelor’s degree in computer science, a related technical field involving software/systems engineering, or equivalent practical experience.
  • Experience with Object Oriented programming languages such as: Java, C#, Python, or Go.
  • Experience with Unix/Linux and Windows operating systems.
  • Hands on Experience with one of the following cloud platforms: Azure, AWS, or GCP.

Responsibilities

  • Maintain Service Level Objectives for the systems they own.
  • Constantly measure and improve availability, latency, and overall system health.
  • Write automation to scale systems sustainably, prevent service issues, or quickly recover service when they occur.
  • Partner with development teams to improve system reliability, observability, and release velocity.
  • Participate in on-call rotations, incident response, postmortems, and root cause analysis and resolution.
  • Advocate for strong engineering practices that allow building, deploying, and running scalable, reliable, and performant services.
  • Enable Cloud migration working with foundation and migration teams, performing architectural reviews, operational acceptable testing, and configuring Datadog dashboards and metrics.
  • Be part of a continuous learning and development culture.

Preferred Qualifications

  • Minimum 8-10 years in the industry
  • Experience on DevOps concepts and way of working
  • Experience with algorithms and data structures.
  • Experience in Observability practices with logging, metrics, tracing, and alerting.
  • Experience with Infrastructure as Code.
  • Understanding of identity and access management, and application security.