Data Engineer

Master’s degree (or foreign equivalent) in Computer Science, Information Systems, Data Science, Statistics, Mathematics, Analytics, Civil Engineering or a related field
Two (2) years of experience in the job offered or related occupation
Six (6) months of experience in Hadoop and Hive
Six (6) months of experience in cloud technologies: Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP)
Six (6) months of experience in building data intensive applications, data pipelines, tackling challenging architectural and scalability problems
Six (6) months of experience in infrastructure planning and management of production deployment, post-production support, production bug fixes and production monitoring and on-call support, including collaborating with cross-functional teams, business analysts and customers
Six (6) months of experience in analyzing large data sets from multiple data sources
Six (6) months of experience in SAS or SQL programming languages
Six (6) months of experience in Spark, PySpark, or Scala
Six (6) months of experience in Hadoop architecture or HDFS commands

Develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and address reporting needs
Write ETL (Extract/Transform/Load) processes
Design database systems and develop tools for real-time and offline analytic processing that improve existing systems and expand capabilities
Collaborate with Data Science team to transform data and integrate algorithms and models into automated processes
Test and maintain systems and troubleshoot malfunctions
Leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines
Utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems
Build data marts and data models to support Data Science and other internal customers
Integrate data from a variety of sources and ensure adherence to data quality and accessibility standards
Analyze current information technology environments to identify and assess critical capabilities and recommend solutions to complex business problems
Experiment with available tools and advise on new tools to provide optimal solutions that meet the requirements dictated by the model/use case

No preferred qualifications provided.