Posted in

Lead Data Engineer

Lead Data Engineer

CompanyCollective Health
LocationSan Francisco, CA, USA, Plano, TX, USA, Lehi, UT, USA
Salary$134500 – $220500
TypeFull-Time
Degrees
Experience LevelSenior, Expert or higher

Requirements

  • 8+ years of data engineering experience in fast-paced, data-driven environments.
  • Expertise in building scalable ETL pipelines with Spark (PySpark or Scala) and SQL.
  • Deep understanding of data architecture, schema design, and dimensional modeling for analytics and machine learning.
  • Proficiency in distributed systems such as Spark, Databricks, or Snowflake.
  • Experience with event-driven architectures and streaming platforms like Kafka or Kinesis.
  • Excellent communication skills – ability to collaborate cross-functionally and translate complex technical concepts into business impact.
  • Mentorship experience – experience guiding engineers and fostering a collaborative, inclusive team culture.
  • Security-first mindset – familiarity with data privacy, encryption, and compliance in healthcare or other regulated industries is a plus.

Responsibilities

  • Architect Scalable Data Solutions – Design, develop, and optimize large-scale data pipelines using Spark (PySpark, Scala), Databricks, and distributed data processing frameworks.
  • Advance Data Modeling & Architecture – Lead the design and evolution of data models to support analytical, operational, and machine-learning requirements.
  • Enhance Data Performance & Reliability – Improve data processing performance, scalability, and reliability, while ensuring data quality and governance.
  • Drive Cross-Functional Collaboration – Partner with Product, Engineering, Data Science, and Analytics teams to deliver high-impact data solutions that generate actionable business and clinical insights.
  • Mentor & Provide Technical Leadership – Guide junior and mid-level engineers, conduct code reviews, and establish best practices in data engineering.
  • Ensure Data Governance & Security – Implement robust security, privacy, and compliance measures for sensitive healthcare data, ensuring adherence to industry regulations.
  • Influence Data Strategy – Provide input on data infrastructure decisions, emerging technologies, and process improvements.

Preferred Qualifications

  • Familiarity with data privacy, encryption, and compliance in healthcare or other regulated industries is a plus.