Big Data Engineer

Company	Synchrony Financial
Location	Newport Beach, CA, USA, Bentonville, AR, USA, Stamford, CT, USA, Dallas, TX, USA, Chicago, IL, USA, Altamonte Springs, FL, USA, Charlotte, NC, USA, Alpharetta, GA, USA, Rapid City, SD, USA, New York, NY, USA, Phoenix, AZ, USA, Kansas City, KS, USA, Canton, OH, USA, St Paul, MN, USA, Cincinnati, OH, USA
Salary	$85000 – $140000
Type	Full-Time
Degrees	Bachelor’s
Experience Level	Junior, Mid Level

Requirements

Bachelor’s degree in a quantitative field (such as Engineering, Computer Science, Statistics, Econometrics); in lieu of degree, High School Diploma/GED and minimum 2 years of Information Technology experience
Hands-on experience writing shell scripts, complex SQL queries, Hive scripts, Hadoop commands and Git
Ability to write abstracted, reusable code components
Programming experience in at least one of the following languages: Scala, Java or Python
Analytical mindset
Willingness and aptitude to learn new technologies quickly
Superior oral and written communication skills
Ability to collaborate across teams of internal and external technical staff, business analysts, software support and operations staff.

Responsibilities

Develop big data applications for Synchrony in Hadoop ecosystem
Participate in the agile development process including backlog grooming, coding, code reviews, testing and deployment
Work with team members to achieve business results in a fast paced and quickly changing environment
Work independently to develop analytic applications leveraging technologies such as: Hadoop, NoSQL, In-memory Data Grids, Kafka, Spark, Ab Initio
Provide data analysis for Synchrony’s data ingestion, standardization and curation efforts ensuring all data is understood from a business context
Identify enablers and level of effort required to properly ingest and transform data for the data lake.
Profile data to assist with defining the data elements, propose business term mappings, and define data quality rules
Work with the Data Office to ensure that data dictionaries for all ingested and created data sets are properly documented in data dictionary repository
Ensure the lineage of all data assets are properly documented in the appropriate enterprise metadata repositories
Assist with the creation and implementation of data quality rules
Ensure the proper identification of sensitive data elements and critical data elements
Create source-to-target data mapping documents
Test current processes and identify deficiencies
Investigate program quality to make improvements to achieve better data accuracy
Understand functional and non-functional requirement and prepare test data accordingly
Plan, create and manage the test case and test script
Identify process bottlenecks and suggest actions for improvement
Execute test script and collect test results
Present test cases, test results, reports and metrics as required by the Office of Agile
Perform other duties as needed to ensure the success of the team and application and ensure the team’s compliance with the applicable Data Sourcing, Data Quality, and Data Governance standards.

Preferred Qualifications

Strong business acumen including a broad understanding of Synchrony business processes and practices
Demonstrated ability to work effectively in an agile team environment
Financial Industry or Credit processing experience
Experience with working on a geographically distributed team managing onshore/offshore resources with shifting priorities
Previous experience working in client facing environment
Proficient in the maintenance of data dictionaries and other information in Collibra
Excellent analytical, organizational and influencing skills with a proven track record of successfully executing on assignments and initiatives
Performance tuning experience
Exposure to the following Ab Initio tools: GDE – Graphical Development Environment; Co>Operating System ; Control Center; Metadata Hub; Enterprise Meta>Environment; Enterprise Meta>Environment Portal; Acquire>It; Express>It; Conduct>It; Data Quality Environment; Query>It.
Familiar with Ab Initio, Hortonworks/Cloudera, Zookeeper, Oozie and Kafka
Familiar with Public Cloud (i.e. AWS, GCP, Azure) data engineering services
Familiar with data management tools (i.e. Collibra)
Background in ETL, data warehousing or data lake