Senior AI Runtime Engineer
Company | Modular |
---|---|
Location | Mountain View, CA, USA, Remote in Canada |
Salary | $198000 – $286000 |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Senior |
Requirements
- 5+ years of experience working on high-performance computing systems.
- Experience in C++ programming and complex software systems.
- Experience with CPU or GPU runtime optimizations and performance analysis on CPUs, GPUs, or AI accelerators.
- Proficiency with one or more profiling tools (CPU or GPU).
- Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture.
Responsibilities
- Design and develop runtime and cross-stack optimizations to improve CPU and GPU efficiency, addressing issues such as CPU overhead, caching, and data locality across multiple devices.
- Port the Modular runtime stack to new hardware platforms and develop an API to streamline this process.
- Collaborate with the compiler, kernels, serving, and models teams to design core technologies that achieve state-of-the-art end-to-end performance on various CPU and GPU hardware.
- Collaborate with the customer success team and engage with customers to understand their performance requirements and use cases.
- Collaborate with tooling and infrastructure teams to design systems for automated performance analysis and benchmarking.
Preferred Qualifications
- Experience with ML graph optimizations, parallel / distributed programming, heterogeneous ML computation, and/or code generation.
- Exposure to MLIR, LLVM, and/or the Mojo programming language.
- Advanced degree in Computer Science or a related area is a plus.