Machine Learning Research Scientist/Engineer – Audio
Company | Scale AI |
---|---|
Location | Seattle, WA, USA, San Francisco, CA, USA, New York, NY, USA |
Salary | $200000 – $325000 |
Type | Full-Time |
Degrees | Master’s, PhD |
Experience Level | Senior, Expert or higher |
Requirements
- Ph.D. or Master’s degree in Computer Science, Machine Learning, Electrical Engineering, or a related field with a focus on speech or audio processing
- Deep understanding of neural generative models for audio, including recent advances in transformer-based architectures, diffusion models, and large-scale pretraining for speech
- Experience with fine-tuning, reinforcement learning, or reward modeling for audio quality optimization or speech preference modeling
- A strong research portfolio with publications at top venues such as Interspeech, NeurIPS, ICLR, ICML, ACL, or similar
- Exceptional written and verbal communication skills, with the ability to clearly convey complex technical ideas to both internal and external partners
Responsibilities
- Develop novel training and post-training techniques for audio models, advancing core areas such as speaker adaptation, prosody control, noise robustness, etc.
- Design new reward models and preference optimization techniques specifically for speech quality, emotion, and intelligibility in TTS, STS, and ASR
- Investigate model failure modes in realistic deployment settings and propose scalable solutions for bias mitigation, robustness to accents, and long-tail speaker variation
- Create industry-standard evaluations for the world’s leading speech models
- Publish breakthrough research at leading conferences and shape best practices across the speech research community.
Preferred Qualifications
- Prior experience working directly with model training, data curation, or evaluation at scale is a significant plus.