Devops Engineer
Company | Aristocrat Leisure |
---|---|
Location | Montreal, QC, Canada |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Mid Level, Senior |
Requirements
- Expertise in Terraform, Go-Lang, Java, or experience in another programming language
- Experience and knowledge of CI/CD design and practice
- Public Cloud, preferably GCP but AWS and Azure are good too!
- Experience with Cloud Architecture Design principles and Cloud Architect certification
- Experience creating infra-as code solutions using tools such as Terraform, Azure ARM templates, Cloudformation – a must
- Experience with CI/CD tools and methodologies such as Jenkins, ArgoCD, CircleCI, etc – a must
- Hands-on implementation of Continuous Integration and Continuous Delivery in complex environments.
- Proven experience working in a production environment – a must
- Solid experience implementing production-grade Kubernetes Clusters with containerized environments and microservices (Docker, Kubernetes)
- Experience working with Configuration Management tools (Chef, Puppet, Ansible is an advantage)
- Solid understanding of networking technologies mainly focusing on networking for clouds
Responsibilities
- Design, deploy, and maintain GCP cloud environments across development (Dev), User Acceptance Testing (UAT), staging (STG), and production (PRD)
- Implement and optimize Infrastructure as Code (IaC) using Terraform, ensuring consistent infrastructure across all environments
- Prepare for and support future multi-cloud adoption (AWS and Azure)
- Build and maintain CI/CD pipelines using GitHub Actions to automate software delivery for mobile game platforms
- Ensure seamless deployments across all environments by managing automated testing, artifact storage, and environment promotion workflows
- Troubleshoot and resolve CI/CD failures quickly to ensure continuous delivery
- Implement and maintain monitoring, alerting, and logging systems using Grafana Cloud
- Analyze and respond to system performance metrics, ensuring optimal resource utilization and application stability
- Conduct root cause analysis (RCA) for incidents and implement preventative measures
- Enforce access control policies using Privileged Access Management (PAM) and Role-Based Access Control (RBAC)
- Regular patch management across cloud resources should be applied to align with the NISA security framework
- Ensure compliance with internal security policies and external regulatory standards
- Participate in a 7-day on-call rotation once a month, responding to critical cloud infrastructure incidents
- Document and escalate major incidents as needed, ensuring minimal downtime for revenue-critical applications
- Perform after-hours troubleshooting and apply emergency fixes during production outages
- Partner with mobile game developers to understand infrastructure needs and provide tailored DevOps solutions
- Follow the Kanban framework to manage and prioritize tasks, ensuring clear workflows and delivery timelines
- Identify and implement process improvements to enhance operational efficiency and system reliability
Preferred Qualifications
- Experience with Service Mesh solutions such as Anthos, Istio, Consul–Advantage
- Experience with monitoring and log analysis tools such as ELK, Prometheus, Grafana, New Relic, etc. – Advantage