Senior Site Reliability Engineer - Cloud Networking

Senior Site Reliability Engineer – Cloud Networking

Experience with Docker and Kubernetes or other container orchestration services.
Experience in designing, deploying, and maintaining API services, with a strong understanding of gRPC/Protobuf, Thrift, Avro or GraphQL.
Extensive experience with cloud services such as AWS, Google Cloud Platform, or Azure, including services like EC2, S3, RDS, and Lambda.
Proficient in using tools such as Terraform, Ansible, or CloudFormation for managing and provisioning cloud infrastructure.
Experience with networking concepts and tools, including Container Network Interface (CNI), Network policy implementations.
Proficiency in Python, with the ability to write efficient, maintainable, and scalable code.

Build and guide internal usage of Kubernetes/EKS including maintaining and monitoring EKS clusters, writing helm charts and configuring ingress and gateways.
Build and scale our internal platform offerings (compute, storage and networking services) to ensure the reliability, and performance of our applications.
Collaborate with application software engineers (as needed) to guide their design and ensure it scales for what Carta needs in the long run.
Act as an agent of change and push boundaries to incrementally improve our systems as we expand globally.

Experience operating CI/CD and its associated best practices is also appreciated though not essential.