Skip to content

Senior Software Developer – HPC Cluster Management
Company | NVIDIA |
---|
Location | Santa Clara, CA, USA |
---|
Salary | $184000 – $356500 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior, Expert or higher |
---|
Requirements
- Degree in Computer Science or related field (or equivalent experience)
- 7+ years of experience in software development and/or related roles
- Familiarity with the Linux operating system and networking concepts in Linux
- Good practical knowledge about common software installed as part of a typical Linux installation
- Proficient in Python and familiar with object-oriented software design, design patterns, and concurrent programming techniques
- Emphasis on high quality of work and producing clean code
- Eager to learn and use new technologies
Responsibilities
- Development of the head node and compute node installation and provisioning processes
- Work on functionality in the area of edge site deployment
- Integrating our product with the latest hardware (e.g GPUs, DPUs, accelerators, high-speed interconnects such as Infiniband)
- Work on features related to composable infrastructure management
- Develop new features for our BIOS and firmware upgrade management
- Develop functionality that makes Bright clusters usable for a wider range of workloads, and increases scalability to allow clusters to scale to huge number of nodes
- Adding support for new Linux distributions
- Improving support for alternative CPU architectures such as ARM
- Work on adding features to our Ansible collections for Cluster Installation and Management
- Assist our support team with customer support requests in the above mentioned features and help our customers to use our product more efficiently
Preferred Qualifications
- Experience with Ansible
- Experience with high-performance computing and system administration
- Knowledge of Kubernetes, AWS, Azure, GCE, OpenStack, Jenkins and distributed programming
- Proficiency in C++