Posted in

Senior Site Reliability Engineer

Senior Site Reliability Engineer

CompanyMotional
LocationBoston, MA, USA
Salary$155000 – $207000
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior

Requirements

  • BS in Computer Science, Engineering, or equivalent AWS Certifications and work experience.
  • 5+ years in SRE, DevOps or related roles
  • Strong experience with AWS Cloud Platforms inclusive of DevOps, Automation, Networking, Connectivity and Cost Optimization
  • Experience with infrastructure-as-code tools (e.g. Terraform, CloudFormation).
  • Knowledge of CI/CD tools (e.g. GitLab CI, Jenkins).
  • Strong expertise in containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Solid understanding of networking topologies and concepts
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Cloudwatch, Datadog).
  • Strong communication and interpersonal skills.
  • Exceptional problem solving skills.
  • Ability to thrive in a fast-paced, dynamic environment and manage multiple priorities.

Responsibilities

  • Develop and implement strategies to enhance system reliability, performance, and scalability. Monitor system performance and health, proactively identifying and resolving issues before they impact users.
  • Lead the response to high-severity incidents, coordinating cross-functional teams to resolve issues and minimize downtime. Develop or implement systems to facilitate incident management and troubleshooting.
  • Partner with the DevOps and other engineering teams to analyze and optimize AWS spend by implementing cost-effective strategies and identifying cost-saving opportunities and efficiency improvements in cloud infrastructure.
  • Mentor and guide junior team members on developing technical problem-solving skills and adopting industry best practices
  • Collaborate closely with development and research teams around the world (Singapore, US) to drive the automation of operational tasks and processes to improve efficiency and reduce manual intervention.
  • Stay abreast of the latest industry developments to ensure that internal SRE practices align with Motional’s overall business objectives and industry trends.

Preferred Qualifications

  • Experience in the AV industry or robotics.
  • Proficient in other Cloud Platforms such as GCP
  • Experience designing tooling to simplify the operational management of SaaS/PaaS systems
  • Experience with various programming languages (e.g. GO, Python, Java, C++, or Bash).
  • Experience with Linux environments and software.
  • Experience with build tools (e.g. Bazel, CMake).
  • Knowledge of ArgoCD or FLUX