Infrastructure Engineer – Supercomputing
Company | xAI |
---|---|
Location | Palo Alto, CA, USA, San Francisco, CA, USA |
Salary | $180000 – $370000 |
Type | Full-Time |
Degrees | |
Experience Level | Junior, Mid Level |
Requirements
- Strong communication skills
- Experience with Kubernetes
- Experience with Pulumi
- Proficiency in Rust and Go
- Knowledge of Flux / ArgoCD
- Experience in operating GPU supercomputing clusters
- Ability to implement IaC best practices
- Experience with security best practices for internal researchers and live external traffic
Responsibilities
- Operate GPU supercomputing clusters for AI training and serving production models
- Implement IaC best practices
- Enhance deployment pipelines
- Ensure robust, secure service delivery across production environments
- Work with both on-premise clusters and cloud providers
- Help with security best practices for internal researchers and live external traffic
Preferred Qualifications
- Writing scalable and highly available containerized applications in Rust
- Managing compute fleets with Pulumi, Terraform, Ansible, or other stateful automation libraries