Applied AI Research Engineering Intern

Pursuing Bachelors or Masters in Computer Science or a related field
Excellent Golang, Rust and/or Python programming and software design skills, including debugging, performance and service health analysis, and test design
Good understanding of algorithms and data structures, solid knowledge of RESTful APIs
Highly motivated, dedicated, and curious about new technologies
Excellent communication, planning, and problem solving skills.

Collaborate on the design and development of the Dynamo Kubernetes stack.
Introduce new features to the Dynamo Python SDK and Dynamo Rust Runtime Core Library; design, implement, and optimize distributed inference components in Rust and Python.
Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT-LLM, llama.cpp, mistral.rs).
Improve intelligent routing and KV-cache management subsystems.
Contribute to open-source repositories, participate in code reviews, assist with issue triage on GitHub, work closely with the community to address issues, capture feedback, and evolve the framework’s APIs and architecture.

Understanding of machine learning or NLP concepts
Experience in software shipping cycles (dev, deploy, release, CI) and open-source software development
Experience working with inference engines such as vLLM, SGLang TensorRT-LLM and similar
Experience building and deploying containers in Kubernetes environments