Posted in

Machine Learning Research Scientist/Engineer – Audio

Machine Learning Research Scientist/Engineer – Audio

CompanyScale AI
LocationSeattle, WA, USA, San Francisco, CA, USA, New York, NY, USA
Salary$200000 – $325000
TypeFull-Time
DegreesMaster’s, PhD
Experience LevelSenior, Expert or higher

Requirements

  • Ph.D. or Master’s degree in Computer Science, Machine Learning, Electrical Engineering, or a related field with a focus on speech or audio processing
  • Deep understanding of neural generative models for audio, including recent advances in transformer-based architectures, diffusion models, and large-scale pretraining for speech
  • Experience with fine-tuning, reinforcement learning, or reward modeling for audio quality optimization or speech preference modeling
  • A strong research portfolio with publications at top venues such as Interspeech, NeurIPS, ICLR, ICML, ACL, or similar
  • Exceptional written and verbal communication skills, with the ability to clearly convey complex technical ideas to both internal and external partners

Responsibilities

  • Develop novel training and post-training techniques for audio models, advancing core areas such as speaker adaptation, prosody control, noise robustness, etc.
  • Design new reward models and preference optimization techniques specifically for speech quality, emotion, and intelligibility in TTS, STS, and ASR
  • Investigate model failure modes in realistic deployment settings and propose scalable solutions for bias mitigation, robustness to accents, and long-tail speaker variation
  • Create industry-standard evaluations for the world’s leading speech models
  • Publish breakthrough research at leading conferences and shape best practices across the speech research community.

Preferred Qualifications

  • Prior experience working directly with model training, data curation, or evaluation at scale is a significant plus.