Posted in

Software Engineering Manager – Collective Communications

Software Engineering Manager – Collective Communications

CompanyMeta
LocationMenlo Park, CA, USA
Salary$177000 – $251000
TypeFull-Time
DegreesBachelor’s, Master’s
Experience LevelSenior, Expert or higher

Requirements

  • 8+ years, or PhD + 4 years, of software engineering work experience, including hands on technical management
  • 2+ years of experience managing a networking related Software Engineering Team
  • BS or MS in Computer Science or related technical discipline or equivalent experience
  • Working knowledge of Collective Communications Libraries such as NVIDIA Collective Communications Library (NCCL) and Message Passing Interface (MPI)
  • Experience with software development for Distributed and Embedded systems
  • Demonstrated experience recruiting, building, structuring, and leading technical organizations, including performance management
  • Experience supporting, coaching, mentoring, and developing software engineers

Responsibilities

  • Help define technical roadmap for the team, drive execution of associated tasks and support the team in resolving dependencies
  • Guide and mentor team members to develop appropriate skillsets to grow in their careers, and where necessary address under performance
  • Collaborate effectively with other groups across the wider organisation such as Co-Design, Software, Hardware, Infrastructure and Operations teams
  • Communicate cross-functionally and drive engineering efforts
  • Interact with external partners as needed in resolving dependencies associated with objectives

Preferred Qualifications

  • Experience with distributed GPU reliability/performance improvement on RoCE/Infiniband
  • Knowledge of network transport stack such as RoCE (RDMA)
  • Experience working with Deep Learning frameworks like PyTorch, Caffe2 or TensorFlow
  • Knowledge of Machine Learning, Deep Learning and Large Language Models