Associate D&a Site Reliability Engineer - SRE

Associate D&a Site Reliability Engineer – SRE

Company	Mondelez International
Location	New Mexico, USA, Mumbai, Maharashtra, India
Salary	$Not Provided – $Not Provided
Type	Full-Time
Degrees
Experience Level	Senior, Expert or higher

6+ years in SRE, DevOps, or Cloud Platform Engineering with end-to-end platform ownership experience.
Expert in Terraform (secure modules, policy-as-code, Terraform Cloud/Enterprise).
Strong GCP knowledge (GKE, Compute Engine, IAM, VPC, Cloud Storage, Cloud Armor, Identity Aware Proxy).
Deep Kubernetes (GKE) experience (autoscaling, network policies, RBAC, PSPs, Kubernetes security).
Proven skills in managing SLIs/SLOs and automated incident response.
Strong background in cloud security and vulnerability management. Use of tools like Sonarqube, Wiz, Tenable, GitHub actions and Dependabot.
Experience with observability stacks (Prometheus, Grafana, Stackdriver, Datadog) and root cause analysis.
Hands-on CI/CD experience (Github CI/CD, ArgoCD, Jenkins) integrated with Terraform.
Proficient in Python, Bash, or Go for automation.
Familiar with FinOps best practices and compliance frameworks (ISO, SOC2, etc.).

Execute the business analytics agenda in conjunction with analytics team leaders.
Work with best-in-class external partners who leverage analytics tools and processes.
Use models/algorithms to uncover signals/patterns and trends to drive long-term business performance.
Execute the business analytics agenda using a methodical approach that conveys to stakeholders what business analytics will deliver.
Drive continuous improvement in platform performance, reliability, security, and usability.
Proactively identify, remediate, and prevent security vulnerabilities. Automate compliance checks and vulnerability scans.
Own SLIs/SLOs, build self-healing systems with clear incident response.
Architect and govern reusable, secure Terraform-based GCP infrastructure.
Integrate FinOps principles to optimize D&A workload and resource consumption cost, performance, and utilization.
Implement comprehensive monitoring to identify trends, prevent issues, and improve reliability.
Enforce security policies as code (shift left security) and support security audits.
Partner with Dev, FinOps, CloudOps, and Security teams to ensure alignment.