Next-Gen AI Hardware and Systems

Company	xAI
Location	Palo Alto, CA, USA, San Francisco, CA, USA
Salary	$180000 – $500000
Type	Full-Time
Degrees
Experience Level	Senior, Expert or higher

Proficiency in coding for accelerators, including CUDA, cutlass, HIP, ROCm, Triton, and other assembly languages or DSLs.
Experience simulating training workloads on novel AI hardware architectures.
Experience with compiler toolchain development and a deep understanding of the advantages and disadvantages of modular tools such as OpenXLA, MLIR, and LLVM.
Hands-on experience with distributed training and/or high-QPS production inference.
Deep understanding of the microarchitecture of AI accelerators and GPUs.
R&D experience in using AI for software compiler and/or hardware design.

Optimizing front-end compilers to transform training and inference workloads into compute graphs for new AI hardware, estimating their performance.
Writing PTX-level or lower-level codes for critical kernels to maximize performance on cutting-edge microarchitectures.
Designing and refining new hardware architectures to push the boundaries of computational efficiency.
Leveraging AI-driven approaches to revolutionize software compiler development and hardware design processes.

No preferred qualifications provided.