Associate D&a Site Reliability Engineer – SRE
Company | Mondelez International |
---|---|
Location | New Mexico, USA, Mumbai, Maharashtra, India |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Senior, Expert or higher |
Requirements
- 6+ years in SRE, DevOps, or Cloud Platform Engineering with end-to-end platform ownership experience.
- Expert in Terraform (secure modules, policy-as-code, Terraform Cloud/Enterprise).
- Strong GCP knowledge (GKE, Compute Engine, IAM, VPC, Cloud Storage, Cloud Armor, Identity Aware Proxy).
- Deep Kubernetes (GKE) experience (autoscaling, network policies, RBAC, PSPs, Kubernetes security).
- Proven skills in managing SLIs/SLOs and automated incident response.
- Strong background in cloud security and vulnerability management. Use of tools like Sonarqube, Wiz, Tenable, GitHub actions and Dependabot.
- Experience with observability stacks (Prometheus, Grafana, Stackdriver, Datadog) and root cause analysis.
- Hands-on CI/CD experience (Github CI/CD, ArgoCD, Jenkins) integrated with Terraform.
- Proficient in Python, Bash, or Go for automation.
- Familiar with FinOps best practices and compliance frameworks (ISO, SOC2, etc.).
Responsibilities
- Execute the business analytics agenda in conjunction with analytics team leaders.
- Work with best-in-class external partners who leverage analytics tools and processes.
- Use models/algorithms to uncover signals/patterns and trends to drive long-term business performance.
- Execute the business analytics agenda using a methodical approach that conveys to stakeholders what business analytics will deliver.
- Drive continuous improvement in platform performance, reliability, security, and usability.
- Proactively identify, remediate, and prevent security vulnerabilities. Automate compliance checks and vulnerability scans.
- Own SLIs/SLOs, build self-healing systems with clear incident response.
- Architect and govern reusable, secure Terraform-based GCP infrastructure.
- Integrate FinOps principles to optimize D&A workload and resource consumption cost, performance, and utilization.
- Implement comprehensive monitoring to identify trends, prevent issues, and improve reliability.
- Enforce security policies as code (shift left security) and support security audits.
- Partner with Dev, FinOps, CloudOps, and Security teams to ensure alignment.
Preferred Qualifications
- GCP certifications (Cloud Architect, DevOps Engineer).
- Multi-cloud (AWS/GCP) Terraform experience.
- Terraform Cloud/Enterprise and policy-as-code (Sentinel, OPA) experience.
- AI-driven monitoring or SRE tooling development background.
- Workload identity federation and GKE security hardening experience.