Posted in

Staff Backend Engineer – Observability

Staff Backend Engineer – Observability

CompanyCoupang
LocationSeattle, WA, USA
Salary$159000 – $324000
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior

Requirements

  • Bachelor’s Degree in Computer Science, Engineering, or a related technical field.
  • Strong experience in implementing and managing observability solutions in large-scale, complex environments.
  • Deep knowledge of monitoring, alerting, and logging systems and tools, such as Prometheus, Grafana, Elastic Stack, Datadog, or New Relic.
  • Familiarity with distributed tracing technologies, such as Jaeger or Zipkin.
  • Experience with cloud-based infrastructure, including AWS, Azure, or Google Cloud Platform.
  • Strong understanding of DevOps and SRE practices, including continuous integration, continuous delivery, and infrastructure as code (IaC).
  • Proficiency in scripting languages, such as Python, Bash, or Ruby.
  • Excellent communication and collaboration skills, with the ability to work with teams across different functions and technical domains.
  • Strong problem-solving and analytical skills, with a focus on data-driven decision-making.
  • A proven track record of leading and delivering successful observability projects and initiatives.

Responsibilities

  • Design, implement, and maintain observability solutions such as monitoring, alerting, logging, and tracing across various platforms, applications, and infrastructure.
  • Collaborate with cross-functional teams, including software engineers, SREs, and infrastructure teams, to identify and define observability requirements.
  • Develop and implement best practices for creating and maintaining effective monitoring, alerting, and telemetry systems.
  • Evaluate and recommend industry-leading observability tools and technologies to improve system visibility and reliability.
  • Define and track key performance indicators (KPIs) and service-level objectives (SLOs) related to system availability, performance, and reliability.
  • Assist in the troubleshooting and resolution of complex incidents and problems by analyzing data from observability tools.
  • Provide guidance and mentorship to other engineers on observability principles, practices, and tools.
  • Conduct ongoing evaluations of observability systems and identify opportunities for improvements and optimizations.
  • Drive the standardization and simplification of observability processes, tools, and frameworks across the organization.
  • Contribute to the development of training materials, documentation, and runbooks for observability systems and practices.

Preferred Qualifications

  • Experience with containerization and orchestration technologies, such as Docker and Kubernetes.
  • Familiarity with application performance management (APM) tools, such as Dynatrace or AppDynamics.
  • Professional certifications in cloud platforms, monitoring tools, or related technologies.