Site Reliability Engineer – Cloud Native Platform – Traffic Infrastructure
Company | ByteDance |
---|---|
Location | San Jose, CA, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | Bachelor’s, Master’s |
Experience Level | Senior, Expert or higher |
Requirements
- At least a Bachelor’s degree in any of these faculties: Computer Science, Information Technology, Programming & Systems Analysis, Science (Computer Studies)
- Experience in Kubernetes administration.
- Experience in Unix/Linux systems from kernel to shell and beyond.
- Experience with Kubernetes CNI deployment and troubleshooting, including (but not limited to) the following CNIs: Cilium, Kube-Router, Calico, Flannel.
- Experience in designing, analyzing, and building automation tools for large scale and complex systems.
Responsibilities
- Deploy and administrate Kubernetes clusters both on-prem and in cloud (AWS, GCP, etc.).
- Collaborate with software engineers to build enterprise-level platform (PaaS) with cutting-edge Cloud Native Computing Foundation (CNCF) technologies.
- Design, develop, automate, and continuously improve platform services and pipelines, such as monitoring, alerting, logging, tracing, CI/CD, etc.
- Improve Kubernetes system efficiency and debug issues related to networking, storage, scheduling, etc.
- Collaborate with open-source communities to advance Kubernetes and Cloud Native technologies.
- Research, design, and develop computer and network software or specialised utility programs.
- Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.
- Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.
- Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.
Preferred Qualifications
- Master’s degree (or Bachelor’s degree with 5+ years of experience) in Computer Engineering, Computer Science, or related fields.
- CKA (Certified Kubernetes Administrator) certification.
- Experience in using and contributing to open-source projects in Kubernetes ecosystem, e.g. Kubespray, CNI, Helm, KubeEdge, Istio/Linkerd, Prometheus, ArgoCD, OPA, Harbor, Envoy, etc.
- Experience in networking technologies such TCP/IP, BGP, DNS, load balancers, etc.
- Experience in CI/CD pipeline design and development.
- Experience in Kubernetes API, Operator, and Custom Resource Definition (CRD) development.