Skip to content

Sr Site Reliability Engineer
Company | Medline |
---|
Location | Northbrook, IL, USA |
---|
Salary | $110240 – $165360 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior |
---|
Requirements
- Bachelor’s degree in Computer Science, Information Technology, or related field
- 5+ years of experience in platform support, systems administration, or infrastructure engineering
- Proficiency in both Windows and Linux system administration
- Scripting experience using PowerShell, Bash, or similar tools
- Experience with monitoring tools such as LogicMonitor and Splunk
- Familiarity with DevOps principles and automation practices
- Experience supporting enterprise applications and deployment processes
- Willingness to participate in rotating on-call support
Responsibilities
- Install, configure, and maintain platform components (Windows/Linux servers, file systems, middleware, etc.) across development, test, and production environments.
- Prepare environments for application deployments and platform-level changes.
- Monitor system health using tools like LogicMonitor and Splunk; respond to alerts and incidents with a root cause and resolution mindset.
- Improve system performance and reliability through configuration tuning and monitoring enhancements.
- Develop and maintain scripts (e.g., PowerShell, Bash, Python) to automate health checks, administrative tasks, and environment validation.
- Collaborate with application teams and infrastructure engineers to validate system readiness for deployments and major changes.
- Ensure platform-level changes follow Medline’s change control, documentation, and testing procedures.
- Apply system security best practices; ensure patching, access management, and configuration policies are in place and audit-ready.
- Participate in ITGC, SOX, and security reviews to maintain operational compliance.
- Maintain accurate runbooks, technical documentation, and troubleshooting guides.
- Share knowledge across the team to support 24×7 platform operations and reduce key-person risk.
- Identify opportunities to improve observability, reduce noise, and increase system resilience.
- Collaborate with SREs and automation engineers to advocate for platform improvements, capacity management, and performance optimization.
Preferred Qualifications
- Experience with Azure, containerization (e.g., Docker), and infrastructure-as-code (e.g., Terraform, Ansible)
- Understanding of microservices, cloud fundamentals, and CI/CD pipelines (e.g., Jenkins, GitHub Actions)
- Exposure to Supply Chain or WMS/SAP environments
- Certification in Azure Fundamentals, SRE, or DevOps practices