Posted in

Senior Firmware Release Lifecycle Infrastructure Architect

Senior Firmware Release Lifecycle Infrastructure Architect

CompanyNVIDIA
LocationSanta Clara, CA, USA
Salary$224000 – $356500
TypeFull-Time
DegreesBachelor’s
Experience LevelExpert or higher

Requirements

  • 12+ years of experience in software architecture, systems programming, automation infrastructure and firmware package creation — preferably within data center or enterprise environments.
  • Bachelors degree or equivalent experience.
  • Strong background in designing scalable and modular architectures, with the ability to identify and mitigate performance bottlenecks.
  • Advanced Python programming skills with deep understanding of object-oriented design principles and scalable code practices.
  • Expertise in Linux system programming, with proficiency in shell scripting, system debugging, and automation toolchains.
  • Experience with firmware workflows and lifecycle management, including familiarity with Redfish APIs, update mechanisms, and industry standards (e.g., DMTF).
  • Hands-on experience integrating third-party tools and building robust automation frameworks using Jenkins, Docker, and Kubernetes.
  • Excellent communication skills; ability to document and present technical designs to stakeholders across hardware, firmware, software, and QA.
  • Familiarity with operating system fundamentals such as process scheduling, memory management, and system security models.

Responsibilities

  • As an Architect for Firmware Lifecycle Management (FLM), you will be responsible for developing infrastructure that ingests firmware binary images and assembles secure, validated firmware bundles to be deployed across NVIDIA’s GPU servers, blade systems, and full-rack solutions.
  • Design scalable system architectures capable of supporting high-concurrency firmware packaging pipelines across multiple platforms and SKUs.
  • Collaborate with cross-functional teams — including firmware, hardware, software, and QA — to gather requirements and deliver robust solutions that scale across tens of products and hundreds of variants.
  • Architect and implement front-end, back-end, APIs, UI, and CLIs to support FLM workflows, while ensuring maintainability and performance.
  • Drive the integration of third-party software components and services (e.g., Jenkins, Artifactory, Vault) into the FLM ecosystem.
  • Own the design and evolution of scalable APIs, focused on long-term maintainability and extensibility.
  • Implement automation frameworks and pipelines using Jenkins, Docker, and Kubernetes; familiarity with Artifactory and Robot Framework is a plus.
  • Continuously seek opportunities for process automation, observability, and resilient system design, including telemetry and high availability improvements.

Preferred Qualifications

  • Good understanding of Out-of-Band/In-Band management, and one or more of the protocols MCTP, PLDM, SPDM, Redfish, and common firmware stacks such as OpenBMC and BIOS/UEFI.
  • Prior experience working in firmware provisioning. Knowledge of low-level hardware interfaces such as PCIe, I2C, SPI, USB.
  • Familiarity with RESTful architectures, JSON-over-HTTPS, OAuth-based authentication, and secure API development. Ability to lead cross-team architectural discussions and provide mentorship to junior developers.