Senior Deep Learning Software Engineer – Inference
Company | NVIDIA |
---|---|
Location | Washington, USA, Oregon, USA, Santa Clara, CA, USA |
Salary | $148000 – $287500 |
Type | Full-Time |
Degrees | Master’s, PhD |
Experience Level | Senior |
Requirements
- Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
- At least 5 years of relevant software development experience
- Excellent C/C++ programming and software design skills
- SW Agile skills are helpful
- Python experience is a plus
Responsibilities
- Performance optimization, analysis, and tuning of DL models in various domains like LLM, Recommender, GNN, Generative AI
- Scale performance of DL models across different architectures and types of NVIDIA accelerators
- Contribute features and code to NVIDIA’s inference benchmarking frameworks, TensorRT, Triton and LLM software solutions
- Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding to develop innovative solutions
Preferred Qualifications
- Prior experience with training, deploying or optimizing the inference of DL models in production is a plus
- Prior background with performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus
- GPU programming experience (CUDA or OpenCL) is a plus