Skip to content

Senior AI Infrastructure Engineer – DGX Cloud
Company | NVIDIA |
---|
Location | Santa Clara, CA, USA |
---|
Salary | $148000 – $287500 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior |
---|
Requirements
- BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience.
- 5+ years of experience.
- A kind team player who is adaptable in a highly dynamic and changing environment.
- A track record showing a good balance between initiating your own projects, convincing others to collaborate with you and collaborating well on projects initiated by others.
- Experience with infrastructure automation and distributed systems design developing tools for running large scale private or public cloud systems in production.
- Experience in one or more of the following: Python, Go, Typescript, C/C++, Java
- In depth knowledge in one or more of Linux, Networking, Storage, and Containers.
Responsibilities
- Design, build, deploy, and run internal tooling built on top of cloud infrastructure.
- Design, implement, ship, and maintain essential data pipelines, data lake, and reporting that will be used by executive leadership to decide on business priorities.
- Integrate tooling with internal and customer workflows along with cloud service providers to streamline incident, change, and problem management processes.
- Reduce the toil of running an incident, maintenance, through software automation and AI/ML solutions.
Preferred Qualifications
- Experience building and integrating with incident tooling such as FireHydrant, Rootly, incident.io, blameless.
- Experience building plugins, templates, and entity schemas in Backstage.
- Background with infrastructure technologies such as Kubernetes, terraform, docker, helm charts and durable execution systems such as temporal.
- Background with basic ML and data science concepts and tooling such as Hive, Apache Beam, Apache Spark, Pytorch, etc.
- Experience with business analytics tooling such as Looker, Tableau, PowerBI.