Posted in

Principal GenAI Data Infrastructure Engineer – 3D Foundation Model

Principal GenAI Data Infrastructure Engineer – 3D Foundation Model

CompanyRoblox
LocationSan Mateo, CA, USA
Salary$289460 – $338270
TypeFull-Time
DegreesBachelor’s
Experience LevelSenior, Expert or higher

Requirements

  • Minimum 8+ years of professional experience
  • Significant experience working with and processing very large datasets (Petabytes or more)
  • Proficient in writing clean, efficient, well-tested code in languages like Python, C++, or Go
  • Experience with cloud data platforms and distributed processing technologies (e.g., Spark, Ray, Kubeflow, S3, etc.)
  • Bachelor’s degree or higher in Computer Science, Computer Engineering, Data Science, or a similar technical field

Responsibilities

  • Design, build, and own critical components of the data infrastructure supporting generative AI efforts
  • Develop sophisticated systems for crawling and extracting diverse data from the Roblox platform
  • Implement robust pipelines and tooling for cleaning, transforming, and curating petabyte-scale datasets
  • Design and optimize data storage solutions for distributed model training workloads
  • Collaborate closely with ML Engineers and Data Scientists to understand their data requirements
  • Drive improvements in data infrastructure architecture, engineering best practices, and operational excellence
  • Ensure data quality and governance are implemented at the system level through automated checks and monitoring

Preferred Qualifications

  • Passionate about the potential of generative AI, particularly in creative domains like 3D/4D content
  • Thrive in building complex, high-scale distributed systems from the ground up
  • Value collaboration and enjoy working closely with cross-functional teams
  • Comfortable operating in a dynamic, fast-paced research environment where challenges and requirements can evolve