Posted in

Big Data Engineer

Big Data Engineer

CompanySynchrony Financial
LocationNewport Beach, CA, USA, Bentonville, AR, USA, Stamford, CT, USA, Dallas, TX, USA, Chicago, IL, USA, Altamonte Springs, FL, USA, Charlotte, NC, USA, Alpharetta, GA, USA, Rapid City, SD, USA, New York, NY, USA, Phoenix, AZ, USA, Kansas City, KS, USA, Canton, OH, USA, St Paul, MN, USA, Cincinnati, OH, USA
Salary$85000 – $140000
TypeFull-Time
DegreesBachelor’s
Experience LevelJunior, Mid Level

Requirements

  • Bachelor’s degree in a quantitative field (such as Engineering, Computer Science, Statistics, Econometrics); in lieu of degree, High School Diploma/GED and minimum 2 years of Information Technology experience
  • Hands-on experience writing shell scripts, complex SQL queries, Hive scripts, Hadoop commands and Git
  • Ability to write abstracted, reusable code components
  • Programming experience in at least one of the following languages: Scala, Java or Python
  • Analytical mindset
  • Willingness and aptitude to learn new technologies quickly
  • Superior oral and written communication skills
  • Ability to collaborate across teams of internal and external technical staff, business analysts, software support and operations staff.

Responsibilities

  • Develop big data applications for Synchrony in Hadoop ecosystem
  • Participate in the agile development process including backlog grooming, coding, code reviews, testing and deployment
  • Work with team members to achieve business results in a fast paced and quickly changing environment
  • Work independently to develop analytic applications leveraging technologies such as: Hadoop, NoSQL, In-memory Data Grids, Kafka, Spark, Ab Initio
  • Provide data analysis for Synchrony’s data ingestion, standardization and curation efforts ensuring all data is understood from a business context
  • Identify enablers and level of effort required to properly ingest and transform data for the data lake.
  • Profile data to assist with defining the data elements, propose business term mappings, and define data quality rules
  • Work with the Data Office to ensure that data dictionaries for all ingested and created data sets are properly documented in data dictionary repository
  • Ensure the lineage of all data assets are properly documented in the appropriate enterprise metadata repositories
  • Assist with the creation and implementation of data quality rules
  • Ensure the proper identification of sensitive data elements and critical data elements
  • Create source-to-target data mapping documents
  • Test current processes and identify deficiencies
  • Investigate program quality to make improvements to achieve better data accuracy
  • Understand functional and non-functional requirement and prepare test data accordingly
  • Plan, create and manage the test case and test script
  • Identify process bottlenecks and suggest actions for improvement
  • Execute test script and collect test results
  • Present test cases, test results, reports and metrics as required by the Office of Agile
  • Perform other duties as needed to ensure the success of the team and application and ensure the team’s compliance with the applicable Data Sourcing, Data Quality, and Data Governance standards.

Preferred Qualifications

  • Strong business acumen including a broad understanding of Synchrony business processes and practices
  • Demonstrated ability to work effectively in an agile team environment
  • Financial Industry or Credit processing experience
  • Experience with working on a geographically distributed team managing onshore/offshore resources with shifting priorities
  • Previous experience working in client facing environment
  • Proficient in the maintenance of data dictionaries and other information in Collibra
  • Excellent analytical, organizational and influencing skills with a proven track record of successfully executing on assignments and initiatives
  • Performance tuning experience
  • Exposure to the following Ab Initio tools: GDE – Graphical Development Environment; Co>Operating System ; Control Center; Metadata Hub; Enterprise Meta>Environment; Enterprise Meta>Environment Portal; Acquire>It; Express>It; Conduct>It; Data Quality Environment; Query>It.
  • Familiar with Ab Initio, Hortonworks/Cloudera, Zookeeper, Oozie and Kafka
  • Familiar with Public Cloud (i.e. AWS, GCP, Azure) data engineering services
  • Familiar with data management tools (i.e. Collibra)
  • Background in ETL, data warehousing or data lake