Skip to content

Staff Network Operations Engineer
Company | Crusoe |
---|
Location | San Francisco, CA, USA |
---|
Salary | $195000 – $230000 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Expert or higher |
---|
Requirements
- 10+ years of related experience building and operating at scale in a production environment.
- In-depth knowledge of network protocols including TCP/IP, QoS, BGP, OSPF/IS-IS, EVPN, VXLAN, QoS and MPLS-related technologies like RSVP-TE, LDP, etc.
- Good understanding of network monitoring protocols and tools, such as SNMP, IPFIX, Sflow/netflow, and Telemetry.
- Experience with tools like Kentik, Arbor, Thousand eyes, Catch point, packet design etc
- Familiar with data center network architecture, such as Fat Tree architecture, CLOS, BGP-TE, and peering for edge.
- Hands-on experience with major network devices like Mellanox, Cisco, Arista, Juniper, and other mainstream vendors.
- Familiar with mainstream commercial switch/router chipsets, such as Broadcom, Barefoot, etc.
- Familiarity with technologies like RDMA, Infiniband, and RoCE will be a plus.
- In-depth knowledge of public cloud architecture connectivity options to AWS, GCP, Azure, Ali Cloud, OCI, etc.
- Good understanding of IPv6 and IPv4-IPv6 coexistence technologies.
- Programming/scripting in Python, Ansible, Puppet, Chef, or other languages will be a plus.
- Self-motivated, with good communication and writing skills.
- Team player and participate in Crusoe Energy Cloud network global on-call rotation.
- Bachelor’s in Computer Science, Information Science, Engineering, Mathematics, or a related field, or experience equivalent to a Bachelor’s degree based on three or more years of work experience.
Responsibilities
- Manage, and optimize Crusoe Energy Cloud’s global network, including edge, backbone, data center, and public cloud connectivity.
- Collaborate with Network Engineering and cross-functional teams including but not limited to Software Infrastructure, and Product, to drive the innovation and evolution of the Crusoe Energy Cloud network.
- Lead operational excellence initiatives—developing monitoring, alerting, and self-healing systems to ensure high network availability.
- Perform advanced troubleshooting and root cause analysis for incidents, guiding post-mortem reviews and improvements.
- Mentor network engineers and establish best practices for incident response, documentation, and operational readiness.
- Will be part of a 24/7 Oncall Support for the Crusoe Network.
Preferred Qualifications
- Familiarity with technologies like RDMA, Infiniband, and RoCE will be a plus.
- Programming/scripting in Python, Ansible, Puppet, Chef, or other languages will be a plus.