Posted in

Senior Data Scientist – Information Retrieval & NLP

Senior Data Scientist – Information Retrieval & NLP

CompanyZoomInfo
LocationSan Francisco, CA, USA, Waltham, MA, USA, Vancouver, WA, USA, Bethesda, MD, USA
Salary$167760 – $230670
TypeFull-Time
DegreesMaster’s, PhD
Experience LevelSenior, Expert or higher

Requirements

  • 7+ yrs hands-on ML/NLP experience (or 4+ yrs post-PhD/Master’s) with at least two delivered, revenue-impacting products.
  • Expertise in transformer stacks (BERT/GPT/T5), RAG, vector-based IR, and latency/throughput optimization.
  • Proven track record building NER or entity-resolution systems at 100M+ record scale; knowledge-graph experience is a plus.
  • Strong applied research chops (PyTorch or TensorFlow) paired with software-engineering rigor (Python, Go/Java a plus).
  • Desire to work within MLOps tools and frameworks: Docker, K8s, GitOps, Terraform, feature stores, model registries, automated retraining.
  • Ability to persuade exec and non-tech audiences with data-driven storytelling; comfortable owning strategy & budget.

Responsibilities

  • Invent and productionize Transformer/RAG architectures that surface the right contact, company, or insight.
  • Drive quantization, distillation, and SLM fine-tuning so models stay fast and affordable at petabyte scale.
  • Prototype and launch hybrid dense/sparse retrieval pipelines on vector DBs (Pinecone, Weaviate, FAISS, OpenSearch).
  • Own high-recall NER models that tag people, orgs, locations, and industry-specific entities across multi-language text.
  • Build cross-dataset entity-resolution frameworks that dedupe hundreds of millions of records with sub-second latency; enrich with knowledge-graph signals where valuable.
  • Design large-scale A/B and back-testing plans; close the loop from experiment to KPI uplift.
  • Translate product goals into measurable ML KPIs; influence roadmap, capacity, and investment decisions.
  • Mentor junior scientists/engineers; publish internal requirements documents, external blogs, and present at conferences.

Preferred Qualifications

  • Knowledge-graph experience is a plus.