Posted in

Senior Deep Learning Software Engineer – Inference

Senior Deep Learning Software Engineer – Inference

CompanyNVIDIA
LocationWashington, USA, Oregon, USA, Santa Clara, CA, USA
Salary$148000 – $287500
TypeFull-Time
DegreesMaster’s, PhD
Experience LevelSenior

Requirements

  • Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
  • At least 5 years of relevant software development experience
  • Excellent C/C++ programming and software design skills
  • SW Agile skills are helpful
  • Python experience is a plus

Responsibilities

  • Performance optimization, analysis, and tuning of DL models in various domains like LLM, Recommender, GNN, Generative AI
  • Scale performance of DL models across different architectures and types of NVIDIA accelerators
  • Contribute features and code to NVIDIA’s inference benchmarking frameworks, TensorRT, Triton and LLM software solutions
  • Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding to develop innovative solutions

Preferred Qualifications

  • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus
  • Prior background with performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus
  • GPU programming experience (CUDA or OpenCL) is a plus