Posted in

Sr. Hardware / Infrastructure Site Reliability Engineer – Starlink

Sr. Hardware / Infrastructure Site Reliability Engineer – Starlink

CompanySpaceX
LocationRedmond, WA, USA
Salary$160000 – $220000
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior, Expert or higher

Requirements

  • Bachelor’s degree in computer science, information systems/IT, or an engineering discipline and 5+ years of professional experience in systems administration, site reliability engineering, or DevOps; OR 7+ years of professional experience in systems administration, site reliability engineering, or DevOps in lieu of a degree
  • 2+ years of professional experience with Linux operating systems
  • Experience with Terraform, Ansible, or other infrastructure tools
  • Experience with containerization technologies (i.e. Docker, Kubernetes)
  • Experience in Bash, Python, and/or other scripting languages
  • Development experience in Python, Go, C++, or C

Responsibilities

  • Develop automation to deploy and manage on-premise compute resources
  • Deploy and manage core infrastructure such as databases, monitoring and distributed storage
  • Closely collaborate with software engineers to create highly scalable, operable and maintainable products
  • Engage in and improve the whole lifecycle of hardware — from inception and design, through deployment, operation, scaling and retirement
  • Invent tools and processes that enable fast, accurate, and easy-to-use development and deployment systems
  • Hands-on integration and troubleshooting across the entire Starlink stack
  • Create and manage core infrastructure such as databases and software libraries
  • Identify areas for improvement and create innovative solutions that enable high developer velocity

Preferred Qualifications

  • 3+ years of experience with Python and Python-based development frameworks
  • Strong understanding of operating systems, virtualization, and hypervisor technologies
  • Knowledge of Linux boot process and systems configuration
  • Deep understanding of testing, continuous integration, build, deployment & monitoring
  • Focus on performance bottlenecks and performance improvement techniques
  • Experience with automatically managing dozens, hundreds, or thousands of servers
  • Strong networking knowledge of TCP/IP
  • Excellent communications skills with the ability to communicate with customers, peers, management etc. in both formal and informal situations