Sr Site Reliability Engineer – Prisma Access
Company | Palo Alto Networks |
---|---|
Location | Plano, TX, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- Strong experience in both Software Engineering and customer operations
- Hands on experience in building fault-tolerant and scalable systems
- Strong development/automation skills
- Experience in designing and implementing monitoring, alerting, logging and remediation systems
- Be comfortable with reading and writing Python code. Java/Go is a plus
- Moderate Unix/Linux experience
- Experience with databases
Responsibilities
- Design and enhance software architecture to improve scalability, service reliability, capacity, and performance
- Write automation code for provisioning and operating infrastructure at massive scale
- Work with development teams to make sure the applications fit within the infrastructure and scalability/reliability is designed and implemented from the grounds up
- Work with QA on building pipelines and automation for delivering and deploying applications to production
- Participate in the occasional on-call rotation supporting the infrastructure
- Roll up the sleeves to solve incidents, formulate theories and test your hypothesis, and narrow down possibilities to find the root cause
- You write postmortem reviews and remediation recommendation
Preferred Qualifications
- Experience working with micro services, deploying applications on kubernetes using helm preferred
- Experience with AWS, Azure, GCP or other cloud providers is a plus
- Experience with Configuration Management and IaC: Jenkins, Git Lab preferred, CI/CD
- Preferred experience: Python, Go, Terraform, Helm, Ansible, Jenkins, GitLab, Elastic search