Posted in

Data Engineer

Data Engineer

CompanyDiscord
LocationSan Francisco, CA, USA
Salary$160000 – $180000
TypeFull-Time
Degrees
Experience LevelJunior, Mid Level

Requirements

  • 2+ years of experience building data pipelines in production with deep knowledge of performant scalable patterns
  • 2+ years of experience in designing, developing, and maintaining robust data models from structured and unstructured sources
  • 2+ years of experience writing accurate and effective code in SQL and Python
  • Experience implementing and monitoring audits for data quality with massive data sets (e.g. billions of rows)
  • Experience proactively identifying opportunities to improve ETL & dashboard performance and cost
  • Experience leveraging your excellent communication skills to thrive in ambiguous environments where problems are not well-defined and evolve quickly
  • A desire to work with amazing, passionate people who care deeply about solving challenging problems to improve Discord
  • A collaborative attitude and a healthy dose of natural curiosity

Responsibilities

  • Create and maintain data pipelines and foundational datasets to support analytics, modeling, experimentation, and product/business needs
  • Design and build database architectures with massive and complex data, balancing ergonomic benefits with computational load and cost
  • Collaborate closely with data science and engineering teams to improve the coverage, accuracy, and reliability of instrumentation
  • Develop audits for data quality at scale, implementing alerting and anomaly detection as necessary
  • Create scalable dashboards and reports to support business objectives and enable data-driven decision making
  • Partner with data scientists, engineers, and product teams to accomplish all of the above!

Preferred Qualifications

  • Passion for Discord or online communities
  • Experience owning and proactively improving the data models for a functional area
  • Experience collaborating directly with data science and product engineering teams
  • Experience with modern data storage and processing technologies (i.e. BigQuery SQL, Looker, Airflow, and DBT or similar)
  • Experience with designing data architecture to power a variety of use cases, including experimentation
  • Experience with advertising products and third-party data ingestion is a strong plus