Member of Technical Staff – Data, World Models

Posted 1ds ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Technical Staff member building and managing data pipelines for AI model training at Moonvalley. Seeking candidates with data engineering expertise and ML knowledge.

Responsibilities:

  • Design, automate, maintain, and optimize Python ETL pipelines (Spark/Ray) for large-scale multimodal data.
  • Build and maintain data cataloging, lineage, quality tooling, integrity verification, access controls, and lifecycle management systems.
  • Provide guidance, internal tools, and documentation to colleagues on data best practices.
  • Serve as a custodian of the company’s datasets, ensuring overall data health, quality, and discoverability.

Requirements:

  • Knowledge of Python ETL pipelines and supporting infrastructure, data formats, and storage systems at scale.
  • Experience managing datasets, annotations, and data versioning for model training.
  • Solid grasp of ML fundamentals is essential to collaborate effectively with researchers.
  • Skilled at writing high-quality specifications for AI agents.

Benefits:

  • Competitive salary and equity
  • Private health coverage
  • Pension contribution (UK, Canada, US)
  • Unlimited paid vacation
  • Fully-distributed, async-first culture
  • Hardware setup of your choice
  • Stipends for phone, internet, and meals