Posted in

Staff Software Engineer – Compute Services

Staff Software Engineer – Compute Services

CompanyCoreWeave
LocationLivingston, NJ, USA, New York, NY, USA, Bellevue, WA, USA, Sunnyvale, CA, USA
Salary$230000 – $275000
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • 7+ years of experience in Software Engineering, Site Reliability Engineering, DevOps, or a related field.
  • Strong expertise in Kubernetes, containerization, and microservices architectures.
  • Expertise in monitoring and observability tools such as Prometheus, Grafana, Datadog, or Splunk.
  • Strong scripting and automation skills using Python, Go, Bash, or similar languages.
  • Strong Understanding of Linux fundamentals and principals.
  • Deep understanding of networking, security best practices, and compliance frameworks (SOC 2, ISO 27001, etc.).
  • Proven track record of leading incident management and post-mortem analysis.
  • Excellent problem-solving, analytical, and communication skills.

Responsibilities

  • Lead and mentor engineers, fostering a culture of collaboration and continuous improvement.
  • Design, implement, and maintain highly available, scalable, and secure computing environments in Kubernetes.
  • Develop and refine monitoring, alerting, and observability solutions to enhance system reliability and performance.
  • Manage Production Clusters and ensure development teams follow best practices for deployments and lifecycle of applications.
  • Develop Applications and Kubernetes Operators in Go
  • Implement and Promote proper GitOps management for applications.
  • Support the deployment and operations of CoreWeave’s Compute Infrastructure layer
  • Develop tooling and systems which bridge the gap between Linux, Networking, and Kubernetes.
  • Develop software applications in GoLang.

Preferred Qualifications

  • Knowledge of distributed systems, databases, and caching strategies.
  • Experience working with large scale computing clusters