Posted in

Devops Engineer

Devops Engineer

CompanyAristocrat Leisure
LocationMontreal, QC, Canada
Salary$Not Provided – $Not Provided
TypeFull-Time
Degrees
Experience LevelMid Level, Senior

Requirements

  • Expertise in Terraform, Go-Lang, Java, or experience in another programming language
  • Experience and knowledge of CI/CD design and practice
  • Public Cloud, preferably GCP but AWS and Azure are good too!
  • Experience with Cloud Architecture Design principles and Cloud Architect certification
  • Experience creating infra-as code solutions using tools such as Terraform, Azure ARM templates, Cloudformation – a must
  • Experience with CI/CD tools and methodologies such as Jenkins, ArgoCD, CircleCI, etc – a must
  • Hands-on implementation of Continuous Integration and Continuous Delivery in complex environments.
  • Proven experience working in a production environment – a must
  • Solid experience implementing production-grade Kubernetes Clusters with containerized environments and microservices (Docker, Kubernetes)
  • Experience working with Configuration Management tools (Chef, Puppet, Ansible is an advantage)
  • Solid understanding of networking technologies mainly focusing on networking for clouds

Responsibilities

  • Design, deploy, and maintain GCP cloud environments across development (Dev), User Acceptance Testing (UAT), staging (STG), and production (PRD)
  • Implement and optimize Infrastructure as Code (IaC) using Terraform, ensuring consistent infrastructure across all environments
  • Prepare for and support future multi-cloud adoption (AWS and Azure)
  • Build and maintain CI/CD pipelines using GitHub Actions to automate software delivery for mobile game platforms
  • Ensure seamless deployments across all environments by managing automated testing, artifact storage, and environment promotion workflows
  • Troubleshoot and resolve CI/CD failures quickly to ensure continuous delivery
  • Implement and maintain monitoring, alerting, and logging systems using Grafana Cloud
  • Analyze and respond to system performance metrics, ensuring optimal resource utilization and application stability
  • Conduct root cause analysis (RCA) for incidents and implement preventative measures
  • Enforce access control policies using Privileged Access Management (PAM) and Role-Based Access Control (RBAC)
  • Regular patch management across cloud resources should be applied to align with the NISA security framework
  • Ensure compliance with internal security policies and external regulatory standards
  • Participate in a 7-day on-call rotation once a month, responding to critical cloud infrastructure incidents
  • Document and escalate major incidents as needed, ensuring minimal downtime for revenue-critical applications
  • Perform after-hours troubleshooting and apply emergency fixes during production outages
  • Partner with mobile game developers to understand infrastructure needs and provide tailored DevOps solutions
  • Follow the Kanban framework to manage and prioritize tasks, ensuring clear workflows and delivery timelines
  • Identify and implement process improvements to enhance operational efficiency and system reliability

Preferred Qualifications

  • Experience with Service Mesh solutions such as Anthos, Istio, Consul–Advantage
  • Experience with monitoring and log analysis tools such as ELK, Prometheus, Grafana, New Relic, etc. – Advantage