Posted in

Data Engineer

Data Engineer

CompanyCVS Health
LocationIrving, TX, USA
Salary$122949 – $140000
TypeFull-Time
DegreesMaster’s
Experience LevelJunior, Mid Level

Requirements

  • Master’s degree (or foreign equivalent) in Computer Science, Information Systems, Data Science, Statistics, Mathematics, Analytics, Civil Engineering or a related field
  • Two (2) years of experience in the job offered or related occupation
  • Six (6) months of experience in Hadoop and Hive
  • Six (6) months of experience in cloud technologies: Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP)
  • Six (6) months of experience in building data intensive applications, data pipelines, tackling challenging architectural and scalability problems
  • Six (6) months of experience in infrastructure planning and management of production deployment, post-production support, production bug fixes and production monitoring and on-call support, including collaborating with cross-functional teams, business analysts and customers
  • Six (6) months of experience in analyzing large data sets from multiple data sources
  • Six (6) months of experience in SAS or SQL programming languages
  • Six (6) months of experience in Spark, PySpark, or Scala
  • Six (6) months of experience in Hadoop architecture or HDFS commands

Responsibilities

  • Develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and address reporting needs
  • Write ETL (Extract/Transform/Load) processes
  • Design database systems and develop tools for real-time and offline analytic processing that improve existing systems and expand capabilities
  • Collaborate with Data Science team to transform data and integrate algorithms and models into automated processes
  • Test and maintain systems and troubleshoot malfunctions
  • Leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines
  • Utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems
  • Build data marts and data models to support Data Science and other internal customers
  • Integrate data from a variety of sources and ensure adherence to data quality and accessibility standards
  • Analyze current information technology environments to identify and assess critical capabilities and recommend solutions to complex business problems
  • Experiment with available tools and advise on new tools to provide optimal solutions that meet the requirements dictated by the model/use case

Preferred Qualifications

    No preferred qualifications provided.