Staff Software Engineer – Compute Services
Company | CoreWeave |
---|---|
Location | Livingston, NJ, USA, New York, NY, USA, Bellevue, WA, USA, Sunnyvale, CA, USA |
Salary | $230000 – $275000 |
Type | Full-Time |
Degrees | |
Experience Level | Senior, Expert or higher |
Requirements
- 7+ years of experience in Software Engineering, Site Reliability Engineering, DevOps, or a related field.
- Strong expertise in Kubernetes, containerization, and microservices architectures.
- Expertise in monitoring and observability tools such as Prometheus, Grafana, Datadog, or Splunk.
- Strong scripting and automation skills using Python, Go, Bash, or similar languages.
- Strong Understanding of Linux fundamentals and principals.
- Deep understanding of networking, security best practices, and compliance frameworks (SOC 2, ISO 27001, etc.).
- Proven track record of leading incident management and post-mortem analysis.
- Excellent problem-solving, analytical, and communication skills.
Responsibilities
- Lead and mentor engineers, fostering a culture of collaboration and continuous improvement.
- Design, implement, and maintain highly available, scalable, and secure computing environments in Kubernetes.
- Develop and refine monitoring, alerting, and observability solutions to enhance system reliability and performance.
- Manage Production Clusters and ensure development teams follow best practices for deployments and lifecycle of applications.
- Develop Applications and Kubernetes Operators in Go
- Implement and Promote proper GitOps management for applications.
- Support the deployment and operations of CoreWeave’s Compute Infrastructure layer
- Develop tooling and systems which bridge the gap between Linux, Networking, and Kubernetes.
- Develop software applications in GoLang.
Preferred Qualifications
- Knowledge of distributed systems, databases, and caching strategies.
- Experience working with large scale computing clusters