Posted in

Senior AI Engineer – Nemo Retriever – Model Optimization and Mlops

Senior AI Engineer – Nemo Retriever – Model Optimization and Mlops

CompanyNVIDIA
LocationRedmond, WA, USA, Santa Clara, CA, USA, Milwaukee, WI, USA
Salary$184000 – $356500
TypeFull-Time
DegreesBachelor’s, Master’s
Experience LevelSenior, Expert or higher

Requirements

  • Bachelor’s or Master’s Degree program in Computer Science, Computer Engineering, or a related field (or equivalent experience).
  • 8+ years of demonstrated experience in a similar or related role
  • Python programming expertise with Deep Learning (DL) frameworks such as PyTorch.
  • Experience delivering software in a cloud context and is familiar with the patterns and processes of handling cloud infrastructure
  • Knowledge of MLOps technologies such as Docker-Compose, Containers, Kubernetes, Helm, data center deployments, etc.
  • Familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.
  • Excellent in-depth hands-on understanding of NLP, LLM, MLLM, Generative AI, and RAG workflows.
  • Self-starter with a passion for growth, enthusiasm for continuous learning, and sharing findings across the team.
  • Extremely motivated, highly passionate, and curious about new technologies.

Responsibilities

  • Develop and maintain NIMs that containerize optimized models using OpenAPI standards using Python or an equivalent performant language.
  • Work closely with partner teams to understand requirements, build & evaluate POCs, and develop roadmaps for production-level tools
  • Enable development of integrated systems – AI Blueprints that provide a unified, turnkey experience.
  • Help build and maintain our Continuous Delivery pipeline with the goal of moving changes to production faster and safer while ensuring key operational standards.
  • Provide peer reviews to other specialists, including feedback on performance, scalability, and correctness.

Preferred Qualifications

    No preferred qualifications provided.