Deep Learning Server Software Intern – Dgx
Company | NVIDIA |
---|---|
Location | Santa Clara, CA, USA |
Salary | $18 – $71 |
Type | Internship |
Degrees | Bachelor’s, Master’s, PhD |
Experience Level | Internship |
Requirements
- Pursuing BS, MS, or PhD in Electrical Engineering, Computer Science or related field
- Proficiency in Python programming
- Has built and maintained scalable API solutions
- Previous experience with Natural Language Processing (NLP) and an understanding of Large Language Models (LLM)/GenAI technologies such as OpenAI API, ChatGPT, GPT-4, Bard, Synthesia, Langchain, HuggingFace Transformers, PyTorch or similar
- Familiarity with prompt engineering and vector databases
- Prior experience with MLOps and/or CI/CD pipeline development, containerization, model deployment in test and production environments
Responsibilities
- Design, enhance and implement cluster monitoring infrastructure for NVIDIA’s deep learning enterprise server platforms
- Collaborate with engineering teams across the company to implement python scripts, database and dynamic reports
- Develop and deploy tools that would be used by wider teams at NVIDIA as we design next generation deep learning platforms
- Interact with diverse technical groups, spanning all organizational levels
Preferred Qualifications
- Understanding of system architecture concepts
- Deep knowledge of a specific domain or industry, with a focus on NLP/LLM
- Applied research background leveraging frameworks to build LLM prototypes, knowledge of best practices for production LLM development
- Be a team player with the ability to clearly communicate complex LLM capabilities and limitations to non-technical stakeholders