Senior Software Engineer – Tensorrt Inference
Company | NVIDIA |
---|---|
Location | Santa Clara, CA, USA |
Salary | $184000 – $356500 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s, PhD |
Experience Level | Senior, Expert or higher |
Requirements
- BS, MS, PhD or equivalent experience in Computer Science, Computer Engineering or a related field.
- 8+ years of software development experience on a large codebase or project.
- Strong proficiency in C++ and Python programming languages.
- Experience with development of: Deep Learning Frameworks, Compilers, or System Software.
- Foundational knowledge of Machine Learning techniques, or GPU optimizations.
- Excellent problem-solving skills and the ability to learn and work effectively in a fast-paced, collaborative environment.
- Strong communication skills and the ability to articulate complex technical concepts.
Responsibilities
- Design, develop and optimize NVIDIA TensorRT to achieve tightly coordinated and responsive inference applications for datacenter, workstations, and PCs.
- Develop software in C++, Python, and CUDA to enable seamless and efficient deployment of state-of-the-art LLM and Generative AI models.
- Collaborate with deep learning experts and GPU architects throughout the company to influence Hardware and Software strategy for inference.
Preferred Qualifications
- Background in developing inference backends and compilers for GPUs.
- Knowledge of GPU programming and optimizations using CUDA or OpenCL.
- Experience working with LLM inference frameworks like TRT-LLM, vLLM, SGLang.
- Experience working with deep learning frameworks like TensorRT, PyTorch, JAX.
- Knowledge of CUDA performance analysis, optimization techniques, and tools.