Skip to contentPrincipal GenAI Data Infrastructure Engineer – 3D Foundation Model
Company | Roblox |
---|
Location | San Mateo, CA, USA |
---|
Salary | $289460 – $338270 |
---|
Type | Full-Time |
---|
Degrees | Bachelor’s |
---|
Experience Level | Senior, Expert or higher |
---|
Requirements
- Minimum 8+ years of professional experience
- Significant experience working with and processing very large datasets (Petabytes or more)
- Proficient in writing clean, efficient, well-tested code in languages like Python, C++, or Go
- Experience with cloud data platforms and distributed processing technologies (e.g., Spark, Ray, Kubeflow, S3, etc.)
- Bachelor’s degree or higher in Computer Science, Computer Engineering, Data Science, or a similar technical field
Responsibilities
- Design, build, and own critical components of the data infrastructure supporting generative AI efforts
- Develop sophisticated systems for crawling and extracting diverse data from the Roblox platform
- Implement robust pipelines and tooling for cleaning, transforming, and curating petabyte-scale datasets
- Design and optimize data storage solutions for distributed model training workloads
- Collaborate closely with ML Engineers and Data Scientists to understand their data requirements
- Drive improvements in data infrastructure architecture, engineering best practices, and operational excellence
- Ensure data quality and governance are implemented at the system level through automated checks and monitoring
Preferred Qualifications
- Passionate about the potential of generative AI, particularly in creative domains like 3D/4D content
- Thrive in building complex, high-scale distributed systems from the ground up
- Value collaboration and enjoy working closely with cross-functional teams
- Comfortable operating in a dynamic, fast-paced research environment where challenges and requirements can evolve