Senior Site Reliability Engineer – Cloud Networking
Company | Carta |
---|---|
Location | Kitchener, ON, Canada |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- Experience with Docker and Kubernetes or other container orchestration services.
- Experience in designing, deploying, and maintaining API services, with a strong understanding of gRPC/Protobuf, Thrift, Avro or GraphQL.
- Extensive experience with cloud services such as AWS, Google Cloud Platform, or Azure, including services like EC2, S3, RDS, and Lambda.
- Proficient in using tools such as Terraform, Ansible, or CloudFormation for managing and provisioning cloud infrastructure.
- Experience with networking concepts and tools, including Container Network Interface (CNI), Network policy implementations.
- Proficiency in Python, with the ability to write efficient, maintainable, and scalable code.
Responsibilities
- Build and guide internal usage of Kubernetes/EKS including maintaining and monitoring EKS clusters, writing helm charts and configuring ingress and gateways.
- Build and scale our internal platform offerings (compute, storage and networking services) to ensure the reliability, and performance of our applications.
- Collaborate with application software engineers (as needed) to guide their design and ensure it scales for what Carta needs in the long run.
- Act as an agent of change and push boundaries to incrementally improve our systems as we expand globally.
Preferred Qualifications
- Experience operating CI/CD and its associated best practices is also appreciated though not essential.