Posted in

Network Engineer

Network Engineer

CompanyOpenAI
LocationSan Francisco, CA, USA
Salary$460000 – $555000
TypeFull-Time
Degrees
Experience LevelSenior

Requirements

  • 5+ years of experience in networking or related infrastructure roles
  • Strong expertise in networking technologies, protocols, and design principles
  • Hands-on experience with troubleshooting complex networking issues, including both LAN and WAN environments
  • Deep understanding of how to set up TCP/IP networks from scratch (e.g., BGP, ECMP routing, etc.)
  • Deep understanding of network protocols such as TCP/IP, BGP, & VLAN
  • Familiarity with optical connectors and optical circuit switches (OCS)
  • Understanding of advanced concepts in routing, forwarding, and network management systems
  • Experience with telemetry, traffic engineering, and congestion management to optimize network performance
  • Skilled in collaborating across teams, combining technical expertise with excellent problem-solving and communication abilities
  • Ownership of problems end-to-end and maintain a commitment to continuous learning to effectively solve challenges
  • Familiar with InfiniBand, RoCE, or RDMA in HPC (High-Performance Computing) or similar environments

Responsibilities

  • Design, manage, and optimize WAN and LAN infrastructure for OpenAI’s supercomputers
  • Develop and maintain data collection and monitoring systems to ensure network visibility and performance
  • Troubleshoot and resolve network issues, such as TCP/IP, BGP, and physical
  • Automate network issue detection and resolution to reduce operational overhead
  • Work closely with hardware and systems engineers to meet the performance demands of distributed AI training workloads

Preferred Qualifications

    No preferred qualifications provided.