Posted in

Infrastructure Engineer – Supercomputing

Infrastructure Engineer – Supercomputing

CompanyxAI
LocationPalo Alto, CA, USA, San Francisco, CA, USA
Salary$180000 – $370000
TypeFull-Time
Degrees
Experience LevelJunior, Mid Level

Requirements

  • Strong communication skills
  • Experience with Kubernetes
  • Experience with Pulumi
  • Proficiency in Rust and Go
  • Knowledge of Flux / ArgoCD
  • Experience in operating GPU supercomputing clusters
  • Ability to implement IaC best practices
  • Experience with security best practices for internal researchers and live external traffic

Responsibilities

  • Operate GPU supercomputing clusters for AI training and serving production models
  • Implement IaC best practices
  • Enhance deployment pipelines
  • Ensure robust, secure service delivery across production environments
  • Work with both on-premise clusters and cloud providers
  • Help with security best practices for internal researchers and live external traffic

Preferred Qualifications

  • Writing scalable and highly available containerized applications in Rust
  • Managing compute fleets with Pulumi, Terraform, Ansible, or other stateful automation libraries