Data Engineer III

Company	Robert Half
Location	San Ramon, CA, USA
Salary	$102000 – $150000
Type	Full-Time
Degrees	Bachelor’s, Master’s
Experience Level	Senior

Requirements

Bachelor’s degree in Computer Science, Information Systems, Engineering, a related technical discipline, or equivalent work experience required
Master’s or equivalent experience strongly preferred
5+ years working with Python and SQL in a data engineering context for end-to-end big data ML / analytics workloads
3+ years leading design, development, and troubleshooting of cloud data services (e.g., AWS), container orchestration (e.g., Docker, Kubernetes/EKS), and IaC tools (e.g., Terraform, CloudFormation) for data ingestion pipelines
3+ years working with cross-functional Data Science software engineering teams, collaborating with Data Science, MLOps, and DevOps
3+ years implementing data quality tests, and optimizing SQL/Python code and data models for cost/performance in the cloud
3+ years helping to architect and optimize global ML workflows and data platforms including automated testing, CI/CD, and monitoring in distributed processing (e.g., Spark) and low‑latency architectures
Experience mentoring less-senior team members
Experience and understanding of the full Software Development Lifecycle (SDLC) for Data Science software development
5+ years maintaining and optimizing ETL/ELT job scheduling and monitoring in production environments
5+ years leading design, development & troubleshooting of scalable ETL/ELT pipelines for business-critical applications

Responsibilities

Lead the architecture and design of complex data solutions and pipelines for large-scale or enterprise projects
Define project scope and technical approach for data engineering initiatives, evaluating and incorporating new technologies as needed
Develop and optimize critical data pipelines, ensuring they are maintainable, scalable, and secure
Provide technical leadership and mentorship to less senior data engineers, setting standards for code quality and best practices
Work closely with leadership and other teams to align data engineering solutions with business strategy and requirements
Evaluate and adopt advanced tools or frameworks (e.g., fabric architecture or virtualization technologies) to enhance the data engineering process
Lead the code reviews with the team to debug, and to ensure the quality of the code
Oversee the overall health, security, and scalability of the data infrastructure and pipeline ecosystem
Identify and execute on opportunities to streamline data processing and improve system performance (e.g., refactoring pipelines, updating architectures)
Establish best practices for monitoring, logging, and alerting to ensure data pipeline reliability (setting SLAs, recovery procedures, etc.)
Lead initiatives to scale and modernize the data infrastructure, such as migrating to new platforms or optimizing cloud resource usage for cost-effectiveness
Work with cross-functional teams (including IT and DevOps) to ensure seamless deployment and integration of data engineering solutions into the broader technology environment
Evaluate proposed initiatives, clarify objectives and provide initial feedback
Propose relevant project activities and provide clear, compelling updates to management and peers on a regular basis
Interact frequently with team, business owners and users to maintain a good working relationship with the user community and other partners
Provide consultation to Senior Leadership on complex projects
Provide effective change leadership in driving continuous improvement and innovations to realize measurable productivity results from identified improvement initiatives
Establish and champion data governance frameworks and data quality best practices for the organization
Implement comprehensive data quality strategies (e.g., master data management, advanced validation rules) to ensure trust in enterprise data
Collaborate with senior leadership and other departments to align data engineering efforts with broader business goals and compliance requirements
Lead initiatives to improve data literacy and data sharing, including developing documentation standards and promoting a culture of data stewardship
Serve as a liaison between the data engineering team and other technical/business teams, ensuring effective communication and collaboration on data-related matters
Stay abreast of emerging data technologies and advocate for the adoption of tools that improve data quality, governance, or collaboration (e.g., evaluating new data management platforms)

Preferred Qualifications

Master’s or equivalent experience strongly preferred
Familiarity with data virtualization tools (e.g., Dremio) preferred
Familiarity with Salesforce and Heroku (or similar) data infrastructures preferred
2+ years working with data virtualization tools (e.g., Dremio) and helping to convert Data Science prototypes into scalable services, integrating feature stores and metadata tracking