Posted in

Site Reliability Engineer – Cloud Native Platform – Traffic Infrastructure

Site Reliability Engineer – Cloud Native Platform – Traffic Infrastructure

CompanyByteDance
LocationSan Jose, CA, USA
Salary$Not Provided – $Not Provided
TypeFull-Time
DegreesBachelor’s, Master’s
Experience LevelSenior, Expert or higher

Requirements

  • At least a Bachelor’s degree in any of these faculties: Computer Science, Information Technology, Programming & Systems Analysis, Science (Computer Studies)
  • Experience in Kubernetes administration.
  • Experience in Unix/Linux systems from kernel to shell and beyond.
  • Experience with Kubernetes CNI deployment and troubleshooting, including (but not limited to) the following CNIs: Cilium, Kube-Router, Calico, Flannel.
  • Experience in designing, analyzing, and building automation tools for large scale and complex systems.

Responsibilities

  • Deploy and administrate Kubernetes clusters both on-prem and in cloud (AWS, GCP, etc.).
  • Collaborate with software engineers to build enterprise-level platform (PaaS) with cutting-edge Cloud Native Computing Foundation (CNCF) technologies.
  • Design, develop, automate, and continuously improve platform services and pipelines, such as monitoring, alerting, logging, tracing, CI/CD, etc.
  • Improve Kubernetes system efficiency and debug issues related to networking, storage, scheduling, etc.
  • Collaborate with open-source communities to advance Kubernetes and Cloud Native technologies.
  • Research, design, and develop computer and network software or specialised utility programs.
  • Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis.
  • Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.
  • Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.

Preferred Qualifications

  • Master’s degree (or Bachelor’s degree with 5+ years of experience) in Computer Engineering, Computer Science, or related fields.
  • CKA (Certified Kubernetes Administrator) certification.
  • Experience in using and contributing to open-source projects in Kubernetes ecosystem, e.g. Kubespray, CNI, Helm, KubeEdge, Istio/Linkerd, Prometheus, ArgoCD, OPA, Harbor, Envoy, etc.
  • Experience in networking technologies such TCP/IP, BGP, DNS, load balancers, etc.
  • Experience in CI/CD pipeline design and development.
  • Experience in Kubernetes API, Operator, and Custom Resource Definition (CRD) development.