Posted in

Next-Gen AI Hardware and Systems

Next-Gen AI Hardware and Systems

CompanyxAI
LocationPalo Alto, CA, USA, San Francisco, CA, USA
Salary$180000 – $500000
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • Proficiency in coding for accelerators, including CUDA, cutlass, HIP, ROCm, Triton, and other assembly languages or DSLs.
  • Experience simulating training workloads on novel AI hardware architectures.
  • Experience with compiler toolchain development and a deep understanding of the advantages and disadvantages of modular tools such as OpenXLA, MLIR, and LLVM.
  • Hands-on experience with distributed training and/or high-QPS production inference.
  • Deep understanding of the microarchitecture of AI accelerators and GPUs.
  • R&D experience in using AI for software compiler and/or hardware design.

Responsibilities

  • Optimizing front-end compilers to transform training and inference workloads into compute graphs for new AI hardware, estimating their performance.
  • Writing PTX-level or lower-level codes for critical kernels to maximize performance on cutting-edge microarchitectures.
  • Designing and refining new hardware architectures to push the boundaries of computational efficiency.
  • Leveraging AI-driven approaches to revolutionize software compiler development and hardware design processes.

Preferred Qualifications

    No preferred qualifications provided.