Senior Associate – Data Engineer – AI and Automation
Company | Pfizer |
---|---|
Location | Phoenixville, PA, USA, Cork, Ireland, New York, NY, USA, Dublin, Ireland |
Salary | $80300 – $133900 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s |
Experience Level | Mid Level, Senior |
Requirements
- A bachelor’s or master’s degree in computer science, Artificial Intelligence, Machine Learning, or a related discipline.
- Over 3 years of experience as a Data Engineer, Data Architect, or in Data Warehousing, Data Modeling, and Data Transformations.
- Over 1 years of experience in AI, machine learning, and large language models (LLMs) development and deployment.
- Proven track record of successfully implementing AI solutions in a healthcare or pharmaceutical setting is preferred.
- Strong understanding of data structures, algorithms, and software design principles
- Experience in Python, SQL, and familiarity with Java or Scala.
- Familiarity with Hadoop, Spark, and Kafka for big data processing.
Responsibilities
- Responsible for data modeling and engineering within the advanced data platforms teams to achieve digital outcomes. Create test plans, test scripts, and perform data validation.
- Conceive, design, and implement Cloud Data Lake, Data Warehouse, Data Marts, and Data APIs.
- Develop complex data products that are beneficial for Pfizer Global Supply and allow for reusability across enterprise.
- Ability to collaborate with contractors to deliver technical enhancements.
- Develop automated systems for building, testing, monitoring, and deploying ETL data pipelines within a continuous integration environment.
- Develop internal APIs and data solutions to enhance application functionality and facilitate connectivity.
- Collaborate with backend engineering teams to analyze data, enhancing its quality and consistency.
- Conduct root cause analysis and address production data issues.
- Design, develop, and implement AI models and algorithms to solve sophisticated data analytics and supply chain initiatives.
- Stay abreast of the latest advancements in AI and machine learning technologies and apply them to Pfizer’s projects.
- Provide technical expertise and guidance to team members and stakeholders on AI-related initiatives.
- Document and present findings, methodologies, and project outcomes to various stakeholders.
- Integrate and collaborate with different technical teams across Digital to drive overall implementation and delivery.
- Ability to work with large and complex datasets, including data cleaning, preprocessing, and feature selection.
Preferred Qualifications
- Experience with data warehousing solutions such as Amazon Redshift, Google BigQuery, or Snowflake.
- Knowledge of ETL tools like Apache NiFi, Talend, or Informatica.
- Hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP).
- Understanding of Docker and Kubernetes for containerization and orchestration.
- Knowledge of AI-driven tools for data pipeline automation, such as Apache Airflow or Prefect. Ability to use GenAI or Agents to augment data engineering practices
- Skills in integrating data from various sources, including APIs, databases, and external files.
- Understanding of data modeling and database design principles, including graph technologies like Neo4j or Amazon Neptune.
- Proficiency in handling structured data from relational databases, data warehouses, and spreadsheets.
- Experience with unstructured data sources such as text, images, and log files, and tools like Apache Solr or Elasticsearch.
- Familiarity with data excellence concepts, including data governance, data quality management, and data stewardship.