Posted in

Site Reliability Engineer

Site Reliability Engineer

CompanyS&P Global
LocationCalgary, AB, Canada, London, ON, Canada, Remote in Canada
Salary$85000 – $100000
TypeFull-Time
DegreesBachelor’s, Master’s
Experience LevelJunior, Mid Level

Requirements

  • Bachelor’s/Master’s Degree in Computer Science, Information Systems or equivalent.
  • 2 to 3 years of Site Reliability Engineering (SRE) experience.
  • Scripting skills on any of these following Shell scripts, Python, Perl, PowerShell etc.
  • Building cloud infrastructure as code (IoC) using Terraform, Cloud Formation Templates (CFTs) etc.
  • Ability to perform Apache/Tomcat + J2EE installations on Linux based systems.
  • Working knowledge of AWS cloud technologies: VPC, EC2, EKS, ELB, RDS, Lambda, SES, SNS, Containers, API Gateway, Docker, Kubernetes etc.
  • Knowledge with observability tools such as ELK, DataDog, Grafana, Splunk
  • CI/CD delivery using configuration management tools such as GitHub, VSTS, Ansible, Puppet, Chef, Salt, Jenkins, Maven etc.
  • Knowledge in load-balancing and high-availability planning with BigIP, Application Load balancers, NLB.

Responsibilities

  • Utilizes technical knowledge and analytical skills to architect and optimize cloud infrastructure.
  • Standardizes the technology stack for cloud and data centers.
  • Implements cloud service catalogs and service governance solutions.
  • Drives the implementation of a cloud-first strategy in partnership with development and business stakeholders.
  • Supports and operates large-scale desktop and web application systems.
  • Addresses problems of critical and high severity, requiring thorough review of dependencies across systems, infrastructure, networks, applications, databases, data, and protocols.
  • Participates in critical incident management calls with hands-on ability to troubleshoot IIS/.NET/Apache/application and infrastructure and network performance issues.
  • Provides 3rd level support for escalations and communicates progress updates to stakeholders on incident resolutions.
  • Coordinates with Development and Tech teams on product rollouts, releases, and critical fixes while coordinating with Performance and QA teams on performance metrics commitment review.
  • Coordinates with Tech teams on OS upgrades, vulnerability remediation to ensure audit and compliance.

Preferred Qualifications

    No preferred qualifications provided.