Posted in

Senior Software Developer – HPC Cluster Management

Senior Software Developer – HPC Cluster Management

CompanyNVIDIA
LocationSanta Clara, CA, USA
Salary$184000 – $356500
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior, Expert or higher

Requirements

  • Degree in Computer Science or related field (or equivalent experience)
  • 7+ years of experience in software development and/or related roles
  • Familiarity with the Linux operating system and networking concepts in Linux
  • Good practical knowledge about common software installed as part of a typical Linux installation
  • Proficient in Python and familiar with object-oriented software design, design patterns, and concurrent programming techniques
  • Emphasis on high quality of work and producing clean code
  • Eager to learn and use new technologies

Responsibilities

  • Development of the head node and compute node installation and provisioning processes
  • Work on functionality in the area of edge site deployment
  • Integrating our product with the latest hardware (e.g GPUs, DPUs, accelerators, high-speed interconnects such as Infiniband)
  • Work on features related to composable infrastructure management
  • Develop new features for our BIOS and firmware upgrade management
  • Develop functionality that makes Bright clusters usable for a wider range of workloads, and increases scalability to allow clusters to scale to huge number of nodes
  • Adding support for new Linux distributions
  • Improving support for alternative CPU architectures such as ARM
  • Work on adding features to our Ansible collections for Cluster Installation and Management
  • Assist our support team with customer support requests in the above mentioned features and help our customers to use our product more efficiently

Preferred Qualifications

  • Experience with Ansible
  • Experience with high-performance computing and system administration
  • Knowledge of Kubernetes, AWS, Azure, GCE, OpenStack, Jenkins and distributed programming
  • Proficiency in C++