Software Engineer – Senior / Staff / Principal – Cloud Infrastructure
Company | Genesis Therapeutics |
---|---|
Location | Burlingame, CA, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- 5+ years of experience building and maintaining cloud infrastructure at scale, e.g. within AWS or GCP
- Proficient with Python, Bash, Terraform, and Kubernetes
- Ideally, experience building and maintaining compute clusters running distributed ML training jobs with 1,000+ GPUs
Responsibilities
- Work on our infrastructure team to maintain and grow our multi-cloud compute infrastructure that supports our ML model training, computational chemistry research, and ongoing drug discovery efforts
- Build out our configuration and procedures for monitoring, resource allocation, and deployment automation, as we continue to grow our autoscaling compute clusters to handle larger workloads
- Work on orchestration scheduling framework to increase our execution throughput, reliability, and compute utilization across heterogeneous pipelines
Preferred Qualifications
- Nice to have: hands-on experience with physical hardware + datacenter management