Posted in

Production Engineer

Production Engineer

CompanyNominal
LocationNew York, NY, USA
Salary$120000 – $200000
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • 7+ years of experience in software engineering with a strong focus on production systems and distributed architectures
  • Experience working on distributed systems at scale
  • Hands-on experience with Kafka/Redpanda, PostgreSQL or other SQL databases, MongoDB/NoSQL databases, Clickhouse or other OLAP databases
  • Deep understanding of release automation, CI/CD, and code lifecycle management
  • Familiarity with gRPC and experience building shared infrastructure components like middleware
  • A systems mindset—you understand the ripple effects of a single bug and know how to design to prevent them

Responsibilities

  • Drive reliability and observability improvements across large-scale distributed systems
  • Serve as a force multiplier across all engineering teams by reducing downtime, improving tooling, and freeing up senior engineers from firefighting
  • Own and evolve our incident review process, leading postmortems and embedding learnings into tools, practices, and culture across the company
  • Collaborate with teams to improve release hygiene, including: Automating release gating (e.g., ensuring code bakes in staging for appropriate windows), preventing code from stagnating in staging environments, and implementing pre-prod automated test pipelines to catch issues early
  • Build and maintain Nominal’s gRPC middleware to ensure safe, observable, and performant service communication
  • Improve alerting, debugging, and monitoring to ensure production health and rapid root cause analysis

Preferred Qualifications

  • Experience working on distributed systems at scale
  • Hands-on experience with Kafka/Redpanda, PostgreSQL or other SQL databases, MongoDB/NoSQL databases, Clickhouse or other OLAP databases
  • Deep understanding of release automation, CI/CD, and code lifecycle management
  • Familiarity with gRPC and experience building shared infrastructure components like middleware