Senior Math Libraries Engineer – AI and HPC
Company | NVIDIA |
---|---|
Location | Santa Clara, CA, USA, Hillsboro, OR, USA |
Salary | $184000 – $356500 |
Type | Full-Time |
Degrees | Master’s, PhD |
Experience Level | Senior |
Requirements
- PhD or Master’s degree in Computer Science, Applied Math, or related science or engineering field of study (or equivalent experience)
- 5+ years of experience in designing, developing, testing, maintenance, and performance optimization of HPC software using C++
- Strong fundamentals in kernel generation and composable library design for linear algebra
- Leadership skills in driving software development projects
- Strong collaboration, communication, and documentation habits
- Kernel generation. JIT focus/experience desired
Responsibilities
- Scoping, designing, and implementing high quality and performance numerical dense linear algebra software on GPUs
- Owning the execution of projects involving multiple engineers and sometimes teams
- Providing technical leadership and feedback to library engineers working with you on projects and sometimes mentor interns
- Working closely with product management and other internal and external customers to understand feature and performance requirements and contribute to the technical roadmaps of libraries
- Finding opportunities to improve library performance and reduce code maintenance overhead through re-architecting
- Finding and explaining complex solutions, exercising leadership, and coordinating with multiple teams to work towards goals
Preferred Qualifications
- Experience with parallel programming, ideally using CUDA, MPI, OpenMP, OpenACC, pthreads
- Good understanding of Machine Learning and Deep Learning technologies
- Good knowledge of GPU (preferred) or CPU hardware architecture
- Experience with low level programming using assembly for performance optimization and operator fusion is a huge plus
- Experience with agile software development practices using project management tools such as JIRA
- A scripting language, preferably Python