Senior AI Engineer – Nemo Retriever – Model Optimization and Mlops
Company | NVIDIA |
---|---|
Location | Redmond, WA, USA, Santa Clara, CA, USA, Milwaukee, WI, USA |
Salary | $184000 – $356500 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s |
Experience Level | Senior, Expert or higher |
Requirements
- Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).
- 8+ years of demonstrated experience in a similar or related role
- Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.
- Experience delivering software in a cloud context and is familiar with the patterns and processes of handling cloud infrastructure
- Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, Helm, data center deployments, etc.
- Familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.
- Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI, and RAG workflows.
- Self-starter with a passion for growth, enthusiasm for continuous learning, and sharing findings across the team.
- Extremely motivated, highly passionate, and curious about new technologies.
Responsibilities
- Develop and maintain NIMs that containerize optimized models using OpenAPI standards using Python or an equivalent performant language.
- Work closely with partner teams to understand requirements, build & evaluate POCs, and develop roadmaps for production-level tools
- Enable development of integrated systems – AI Blueprints that provide a unified, turnkey experience.
- Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer while ensuring key operational standards.
- Provide peer reviews to other specialists, including feedback on performance, scalability, and correctness.
Preferred Qualifications
-
No preferred qualifications provided.