Skip to content

Network Engineer
Company | OpenAI |
---|
Location | San Francisco, CA, USA |
---|
Salary | $460000 – $555000 |
---|
Type | Full-Time |
---|
Degrees | |
---|
Experience Level | Senior |
---|
Requirements
- 5+ years of experience in networking or related infrastructure roles
- Strong expertise in networking technologies, protocols, and design principles
- Hands-on experience with troubleshooting complex networking issues, including both LAN and WAN environments
- Deep understanding of how to set up TCP/IP networks from scratch (e.g., BGP, ECMP routing, etc.)
- Deep understanding of network protocols such as TCP/IP, BGP, & VLAN
- Familiarity with optical connectors and optical circuit switches (OCS)
- Understanding of advanced concepts in routing, forwarding, and network management systems
- Experience with telemetry, traffic engineering, and congestion management to optimize network performance
- Skilled in collaborating across teams, combining technical expertise with excellent problem-solving and communication abilities
- Ownership of problems end-to-end and maintain a commitment to continuous learning to effectively solve challenges
- Familiar with InfiniBand, RoCE, or RDMA in HPC (High-Performance Computing) or similar environments
Responsibilities
- Design, manage, and optimize WAN and LAN infrastructure for OpenAI’s supercomputers
- Develop and maintain data collection and monitoring systems to ensure network visibility and performance
- Troubleshoot and resolve network issues, such as TCP/IP, BGP, and physical
- Automate network issue detection and resolution to reduce operational overhead
- Work closely with hardware and systems engineers to meet the performance demands of distributed AI training workloads
Preferred Qualifications
No preferred qualifications provided.