Posted in

Applied AI Research Engineering Intern

Applied AI Research Engineering Intern

CompanyNVIDIA
LocationSanta Clara, CA, USA
Salary$18 – $71
TypeInternship
DegreesBachelor’s, Master’s
Experience LevelInternship

Requirements

  • Pursuing Bachelors or Masters in Computer Science or a related field
  • Excellent Golang, Rust and/or Python programming and software design skills, including debugging, performance and service health analysis, and test design
  • Good understanding of algorithms and data structures, solid knowledge of RESTful APIs
  • Highly motivated, dedicated, and curious about new technologies
  • Excellent communication, planning, and problem solving skills.

Responsibilities

  • Collaborate on the design and development of the Dynamo Kubernetes stack.
  • Introduce new features to the Dynamo Python SDK and Dynamo Rust Runtime Core Library; design, implement, and optimize distributed inference components in Rust and Python.
  • Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT-LLM, llama.cpp, mistral.rs).
  • Improve intelligent routing and KV-cache management subsystems.
  • Contribute to open-source repositories, participate in code reviews, assist with issue triage on GitHub, work closely with the community to address issues, capture feedback, and evolve the framework’s APIs and architecture.

Preferred Qualifications

  • Understanding of machine learning or NLP concepts
  • Experience in software shipping cycles (dev, deploy, release, CI) and open-source software development
  • Experience working with inference engines such as vLLM, SGLang TensorRT-LLM and similar
  • Experience building and deploying containers in Kubernetes environments