Machine Learning Research Scientist/Engineer - Audio

Machine Learning Research Scientist/Engineer – Audio

Company	Scale AI
Location	Seattle, WA, USA, San Francisco, CA, USA, New York, NY, USA
Salary	$200000 – $325000
Type	Full-Time
Degrees	Master’s, PhD
Experience Level	Senior, Expert or higher

Ph.D. or Master’s degree in Computer Science, Machine Learning, Electrical Engineering, or a related field with a focus on speech or audio processing
Deep understanding of neural generative models for audio, including recent advances in transformer-based architectures, diffusion models, and large-scale pretraining for speech
Experience with fine-tuning, reinforcement learning, or reward modeling for audio quality optimization or speech preference modeling
A strong research portfolio with publications at top venues such as Interspeech, NeurIPS, ICLR, ICML, ACL, or similar
Exceptional written and verbal communication skills, with the ability to clearly convey complex technical ideas to both internal and external partners

Develop novel training and post-training techniques for audio models, advancing core areas such as speaker adaptation, prosody control, noise robustness, etc.
Design new reward models and preference optimization techniques specifically for speech quality, emotion, and intelligibility in TTS, STS, and ASR
Investigate model failure modes in realistic deployment settings and propose scalable solutions for bias mitigation, robustness to accents, and long-tail speaker variation
Create industry-standard evaluations for the world’s leading speech models
Publish breakthrough research at leading conferences and shape best practices across the speech research community.

Prior experience working directly with model training, data curation, or evaluation at scale is a significant plus.