Skip to content

Sr. Hardware / Infrastructure Site Reliability Engineer – Starlink
Company | SpaceX |
---|
Location | Redmond, WA, USA |
---|
Salary | $160000 – $220000 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior, Expert or higher |
---|
Requirements
- Bachelor’s degree in computer science, information systems/IT, or an engineering discipline and 5+ years of professional experience in systems administration, site reliability engineering, or DevOps; OR 7+ years of professional experience in systems administration, site reliability engineering, or DevOps in lieu of a degree
- 2+ years of professional experience with Linux operating systems
- Experience with Terraform, Ansible, or other infrastructure tools
- Experience with containerization technologies (i.e. Docker, Kubernetes)
- Experience in Bash, Python, and/or other scripting languages
- Development experience in Python, Go, C++, or C
Responsibilities
- Develop automation to deploy and manage on-premise compute resources
- Deploy and manage core infrastructure such as databases, monitoring and distributed storage
- Closely collaborate with software engineers to create highly scalable, operable and maintainable products
- Engage in and improve the whole lifecycle of hardware — from inception and design, through deployment, operation, scaling and retirement
- Invent tools and processes that enable fast, accurate, and easy-to-use development and deployment systems
- Hands-on integration and troubleshooting across the entire Starlink stack
- Create and manage core infrastructure such as databases and software libraries
- Identify areas for improvement and create innovative solutions that enable high developer velocity
Preferred Qualifications
- 3+ years of experience with Python and Python-based development frameworks
- Strong understanding of operating systems, virtualization, and hypervisor technologies
- Knowledge of Linux boot process and systems configuration
- Deep understanding of testing, continuous integration, build, deployment & monitoring
- Focus on performance bottlenecks and performance improvement techniques
- Experience with automatically managing dozens, hundreds, or thousands of servers
- Strong networking knowledge of TCP/IP
- Excellent communications skills with the ability to communicate with customers, peers, management etc. in both formal and informal situations