Staff Software Engineer – ML Serving Platform
Company | |
---|---|
Location | San Francisco, CA, USA, Remote in USA |
Salary | $170371 – $350763 |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Senior, Expert or higher |
Requirements
- Comprehensive understanding of production-scale ML use cases and systems, with a focus on scalability and efficiency
- Hands-on experience in building large-scale ML systems in production environments, preferably with expertise in state-of-the-art ML inference technologies and optimizations
- In-depth knowledge of common ML frameworks and systems, including PyTorch, TensorRT, and vLLM, along with their best practices and internal mechanisms
- Familiarity in GPU programming and the common optimization techniques such as ML compilation and quantization
- Strong programming skills in Python and C++, coupled with a solid grasp of distributed systems principles
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience
Responsibilities
- Architect and develop large-scale, robust, and efficient ML inference engines and serving systems leveraging GPUs and other hardware accelerators
- Formulate and implement strategic roadmaps for ML inference technologies at team and company level
- Collaborate with cross-functional teams to drive innovative ML projects, applying advanced inference optimization techniques
- Engage extensively with ML engineers across Pinterest to understand their technical requirements, address pain points, and create generalized solutions
- Provide technical mentorship and guidance to junior engineers within the team
Preferred Qualifications
-
No preferred qualifications provided.