Posted in

Staff Software Engineer – ML Serving Platform

Staff Software Engineer – ML Serving Platform

CompanyPinterest
LocationSan Francisco, CA, USA, Remote in USA
Salary$170371 – $350763
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior, Expert or higher

Requirements

  • Comprehensive understanding of production-scale ML use cases and systems, with a focus on scalability and efficiency
  • Hands-on experience in building large-scale ML systems in production environments, preferably with expertise in state-of-the-art ML inference technologies and optimizations
  • In-depth knowledge of common ML frameworks and systems, including PyTorch, TensorRT, and vLLM, along with their best practices and internal mechanisms
  • Familiarity in GPU programming and the common optimization techniques such as ML compilation and quantization
  • Strong programming skills in Python and C++, coupled with a solid grasp of distributed systems principles
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience

Responsibilities

  • Architect and develop large-scale, robust, and efficient ML inference engines and serving systems leveraging GPUs and other hardware accelerators
  • Formulate and implement strategic roadmaps for ML inference technologies at team and company level
  • Collaborate with cross-functional teams to drive innovative ML projects, applying advanced inference optimization techniques
  • Engage extensively with ML engineers across Pinterest to understand their technical requirements, address pain points, and create generalized solutions
  • Provide technical mentorship and guidance to junior engineers within the team

Preferred Qualifications

    No preferred qualifications provided.