Senior AI Modeling Engineer
Company | Modular |
---|---|
Location | Mountain View, CA, USA, Remote in Canada |
Salary | $198000 – $270000 |
Type | Full-Time |
Degrees | |
Experience Level | Senior |
Requirements
- Solid understanding of deep learning architectures and concepts such as transformers, diffusion models, attention, KV cache, and more
- Strong experience with Python, core numerical libraries (like NumPy) and deep learning frameworks (e.g., Pytorch, JAX, Tensorflow)
- Solid Python coding skills
- Good grasp of related mathematical concepts, especially linear algebra
Responsibilities
- Develop AI models, with a focus on GenAI and LLMs, using the MAX platform’s Python APIs
- Research and stay up-to-date on the latest model architectures, such as Llama and DeepSeek
- Implement novel models and architectures based on research papers
- Test and evaluate the inference accuracy of models
- Mentor less experienced engineers on model architectures, implementation details, and effective use of modern APIs like PyTorch
- Collaborate with the MAX Platform team by providing feedback on API design and developer experience
- Work with leads and product managers to estimate and plan the development of new models
Preferred Qualifications
- Kernel development experience (CUDA, TritonLang, etc.)
- Deep learning research experience
- ML API development experience
- Experience training or deploying deep learning models
- Understanding of performance tradeoffs in modern accelerators