Machine Learning Data Engineer, Replica Pipelines

Karlsruhe · KarlsruheFull-timemid
Data ScienceData EngineeringData EngineerMachine LearningData & AI
0 views0 saves0 applied

Quick Summary

Requirements Summary

The chance to contribute to the advancement of autonomous systems and AI. Collaborative culture: A dynamic and supportive work environment where your ideas are valued.

Technical Tools
Data ScienceData EngineeringData EngineerMachine LearningData & AI
Parallel Domain is building the world’s most advanced simulation and digital twin platform for autonomy, robotics, and computer vision. Our Replica product creates large-scale, photorealistic digital twins of real-world environments used for testing, validation, and development of autonomous systems.
  • We are hiring a Machine Learning Data Engineer responsible for building and scaling the data pipelines that support Replica and ML model development. You will ensure that data flows efficiently from raw customer inputs through validated, structured formats suitable for training, evaluation, and production systems.
  • Own data ingestion: Build reliable pipelines to normalize and validate customer and synthetic data.
  • Define data standards: Create schemas, validation checks, and quality metrics for Replica datasets.
  • Build curation tooling: Implement tools for dataset filtering, versioning, and annotation support.
  • Enable ML workflows: Generate high-quality data feeds for training and evaluation across ML models.
  • Data engineering experience: Proven experience building scalable data pipelines and tooling.
  • ML-aware engineering: Understanding of how data is used in model training and evaluation. 
  • 3D Foundations: Practical experience with 3D concepts, geometry, and the linear algebra principles underpinning computer vision (e.g., projections, transformations)
  • Technical skills: Strong Python proficiency and comfort with large datasets.
  • Collaborative mindset: Experience working closely with ML engineers on data needs.
  • Advanced degree: MS or PhD in ML, computer vision, robotics, or related field.
  • Cloud/infra experience: Familiarity with cloud storage and distributed processing frameworks.
  • Robotics data knowledge: Experience handling camera, lidar, or radar data 
  • Visualization tools experience: Familiarity with data visualization systems like Foxglove, Rerun, or Voxel51
  • MLOps tooling exposure: Experience with dataset versioning, preprocessing automation, or training pipeline orchestration.
  • Competitive compensation: Salary dependent on your skills, qualifications, experience, and location.
  • Impactful work: The chance to contribute to the advancement of autonomous systems and AI.
  • Collaborative culture: A dynamic and supportive work environment where your ideas are valued.
  • Professional growth: Opportunities to learn and develop your skills in a cutting-edge field.
  • Listing Details

    Posted
    January 12, 2026
    First seen
    March 26, 2026
    Last seen
    April 24, 2026

    Posting Health

    Days active
    28
    Repost count
    0
    Trust Level
    25%
    Scored at
    April 24, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    P
    Machine Learning Data Engineer, Replica Pipelines