Staff Software Engineer - ML Serving Platform

Staff Software Engineer – ML Serving Platform

Company	Pinterest
Location	San Francisco, CA, USA, Remote in USA
Salary	$170371 – $350763
Type	Full-Time
Degrees	Bachelor’s
Experience Level	Senior, Expert or higher

Comprehensive understanding of production-scale ML use cases and systems, with a focus on scalability and efficiency
Hands-on experience in building large-scale ML systems in production environments, preferably with expertise in state-of-the-art ML inference technologies and optimizations
In-depth knowledge of common ML frameworks and systems, including PyTorch, TensorRT, and vLLM, along with their best practices and internal mechanisms
Familiarity in GPU programming and the common optimization techniques such as ML compilation and quantization
Strong programming skills in Python and C++, coupled with a solid grasp of distributed systems principles
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience

Architect and develop large-scale, robust, and efficient ML inference engines and serving systems leveraging GPUs and other hardware accelerators
Formulate and implement strategic roadmaps for ML inference technologies at team and company level
Collaborate with cross-functional teams to drive innovative ML projects, applying advanced inference optimization techniques
Engage extensively with ML engineers across Pinterest to understand their technical requirements, address pain points, and create generalized solutions
Provide technical mentorship and guidance to junior engineers within the team

No preferred qualifications provided.