Posted in

Senior Software Engineer – Site Reliability Engineering

Senior Software Engineer – Site Reliability Engineering

CompanyMastercard
LocationO’Fallon, MO, USA
Salary$115000 – $184000
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior

Requirements

  • Bachelor’s Degree in Computer Science, Computer Systems, Information Technology or related. Equivalent experience is acceptable.
  • Solid knowledge and experience with web applications and distributed systems infrastructure.
  • Solid Knowledge and understanding of Software Engineering Concepts and Methodologies.
  • Excellent verbal and written communication adjustable to a diverse audience with various levels of technical and business acumen.
  • Experience with monitoring and alerting tools like Dynatrace, Splunk, Prometheus.
  • Interest and ability to learn new coding languages like Java and Python, frameworks like Spring, and paradigms as needed.
  • IT experience including demonstrating thought-leadership and relationship building across large-scale organizations.
  • Knowledge and understanding of Service Level Objectives, Observability, Golden Signals and Availability calculations.
  • Experience with static analysis tools to improve software quality.
  • Knowledge of CI/CD platforms (Jenkins, Bamboo, Concourse, XLR, etc).

Responsibilities

  • Be part of a team of site reliability engineers supporting services before they go live through activities such as system design consulting, performance engineering, tuning, chaos testing, capacity planning and launch reviews.
  • Increase, maintain and communicate service metrics once live by measuring and monitoring availability, latency, performance and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability, velocity and recommend performance tuning enhancements.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Review production incidents to identify and drive solutions to prevent reoccurrence and minimize customer impact.
  • Manage individual project priorities, deadlines, and deliverables.
  • Create and maintain technology roadmaps.
  • Look at all tasks with an eye for automation; then work to automate them.
  • Build, manage and maintain robust dashboards reflecting system health.
  • Apply expert technical capabilities across discipline(s) to troubleshoot and solve problems.

Preferred Qualifications

  • Experience with Java, Python, Scala, or other Object-oriented programming languages.
  • Experience with Git, BitBucket, Stash or other version control systems.
  • Experience with Maven and/or Gradle.
  • Experience building pipelines in Jenkins, Bamboo, Concourse or XLR.
  • Experience building test suites in JMeter, LoadRunner, Gatling and/or Blazemeter.
  • Experience with performance tuning of cloud-native applications.
  • Experience working across teams to troubleshoot complex issues and providing guidance.