Big Data Tools Engineer – Software Dev Engineer I
Company | Yahoo |
---|---|
Location | United States |
Salary | $88500 – $184375 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s |
Experience Level | Entry Level/New Grad, Junior |
Requirements
- A Bachelor’s or Master’s degree in Computer Science or equivalent work experience
- Proficiency in Python is essential for scripting and workflow management; experience with Java and C++ is preferred for backend data operations
- Knowledge of data structures, algorithms, and database management systems like SQL, HBase, and BigQuery
- Experience with cloud services, especially AWS (EMR, Glue, S3) and GCP (Dataproc, BigQuery)
- Comfortable working in an Agile environment with regular sprints, planning, and retrospectives
- Ability to design large-scale, distributed systems that are highly available and resilient
- Some experience working with Linux/Unix operating systems
Responsibilities
- Overseeing the execution of various data jobs, ensuring they adhere to SLAs and do not encounter issues
- Utilizing tools like Airflow to manage the scheduling, execution, and monitoring of data workflows across cloud platforms such as AWS and GCP
- Designing and optimizing queries to run efficiently on platforms such as BigQuery, Hive, Pig, and Spark, ensuring high performance and scalability
- Collaborating with different teams to integrate data flows, provide support for query executions, and handle credentials for secure data operations
- Implementing new features to support advanced query capabilities, including federated queries and lineage tracking
Preferred Qualifications
- Experience with development and deployment on public cloud platforms such as AWS, GCP, Azure, or others
- Experiencing developing containerized applications and working with container orchestration services
- Experience with Apache Hadoop, Presto, Hive, Oozie, Pig, Storm, Spark, Jupyter
- Understanding of data structures & algorithms
- Knowledge of JVM internals and its performance tuning
- Excellent debugging/testing skills, and excellent analytical and problem solving skills
- Experience with continuous integration tools such as Jenkins and Hudson
- Strong verbal and written communication skills to collaborate effectively with cross-functional teams