Robotics Data Engineer

Home Working at MBZUAI Vacancies Robotics Data Engineer

Vacancy Overview

Application Open:

Full-Time

 

Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) is seeking a highly skilled Robotics Data Engineer with strong technical hands-on experience to join the Robot Learning Laboratory, a state-of-the-art core facility. This role is ideal for candidates who have a strong background in storage and computer engineering, with a passion for building specialized data infrastructure for robotics research. With an emphasis on practical implementation complemented by research-driven innovation, this position focuses on designing and implementing systems for collecting, processing, and managing multi-modal sensor data from robots and simulation environments.

 

Key Responsibilities

 

Robotics Data Collection and Ingestion

  • Design and implement data collection systems for physical robots (mobile robots, manipulators, drones)
  • Build ROS/ROS2 data recording pipelines with multi-sensor synchronization
  • Develop rosbag processing tools for extraction, conversion, and validation
  • Implement real-time data streaming from robots to data lake (edge-to-cloud pipelines)
  • Build sensor calibration and synchronization frameworks (camera-LiDAR, IMU fusion)
  • Design data collection protocols and quality standards for robotics experiments
  • Develop on-robot data preprocessing and compression for bandwidth optimization

Sensor Data Processing

  • Build processing pipelines for diverse sensor modalities:
  • Vision: Camera images, stereo pairs, RGB-D, event cameras
  • LiDAR: 3D point clouds, 2D laser scans, multi-LiDAR fusion
  • IMU: Accelerometer, gyroscope, magnetometer data
  • Odometry: Wheel encoders, visual odometry, GPS
  • Force/Torque: Contact sensors, tactile sensors
  • Implement sensor data transformations (coordinate frames, time synchronization)
  • Develop data quality checks (sensor health, data completeness, outlier detection)
  • Build data compression and format conversion tools (ROS → Parquet/HDF5)
  • Implement sensor data visualization tools (point clouds, trajectories, video playback)

Robotics Data Infrastructure

  • Design storage schemas for robotics datasets (trajectories, episodes, experiments)
  • Build metadata management for robotics data (robot configuration, environment, task)
  • Implement data versioning and provenance tracking for reproducibility
  • Develop data indexing and search systems for efficient retrieval
  • Build data annotation tools for robotics tasks (object labeling, trajectory segmentation)
  • Design data sharing and collaboration workflows for research teams
  • Implement data access APIs optimized for robotics workloads

SLAM and Mapping Data Management

  • Process and store SLAM outputs (maps, trajectories, loop closures)
  • Build map database systems (occupancy grids, point cloud maps, semantic maps)
  • Implement map versioning and comparison tools
  • Develop map visualization and analysis tools
  • Build map merging and alignment pipelines for multi-robot systems
  • Design efficient storage formats for large-scale maps (compression, tiling)

Integration with Robotics Stack

  • Integrate data pipelines with ROS/ROS2 ecosystems
  • Build ROS nodes for data streaming and processing
  • Develop custom ROS message types and services for data platform
  • Implement ROS bag replay and simulation integration
  • Build bridges between ROS and data lake (Kafka, S3)
  • Create ROS-based monitoring and debugging tools
  • Develop ROS packages for data collection and quality checks

Real-Time Data Processing

  • Build low-latency data processing pipelines for online learning
  • Implement streaming analytics for robot telemetry and health monitoring
  • Develop real-time anomaly detection for sensor failures
  • Build event-driven architectures for robot state changes
  • Implement data buffering and replay systems for debugging
  • Design edge computing solutions for on-robot processing
  • Optimize data pipelines for minimal latency (<100ms)

Field Support and Troubleshooting

  • Provide on-site support for robotics experiments and data collection
  • Troubleshoot data collection issues in real-time during experiments
  • Conduct data quality validation immediately after experiments
  • Train researchers on data collection best practices
  • Maintain and upgrade data collection hardware and software
  • Participate in on-call rotation for critical robotics experiments
  • Document issues and solutions in knowledge base

Academic Qualifications Required

  • Master degree in Robotics, Electrical Engineering, Computer Science, or a related field.
  • A PhD degree will be preferred.

Professional Experience Required
Essential:

  • 3+ years in robotics software development or robotics data engineering.
  • 2+ years hands-on experience with ROS/ROS2 in production or research environments.
  • Proven track record building robotics data pipelines or sensor processing systems.
  • Excel in multi-tasking, organization, communication, collaboration, conflict resolution, work ethic, and time-management to thrive under pressure with a results-driven service mindset.

Robotics Expertise:

  • ROS/ROS2: Deep knowledge of ROS architecture, topics, services, actions, tf, rosbag
  • Sensors: Hands-on experience with cameras, LiDAR, IMU, GPS, and sensor drivers
  • Sensor Fusion: Understanding of multi-sensor calibration and synchronization
  • SLAM: Familiarity with SLAM algorithms and outputs (ORB-SLAM, Cartographer, RTAB-Map)
  • Coordinate Frames: Strong understanding of tf, coordinate transformations, and kinematics
  • Robotics Middleware: Experience with ROS message types, custom messages, and serialization

Software Engineering:

  • Programming: Expert-level Python and C++; experience with ROS client libraries (rospy, rclpy, roscpp)
  • Data Processing: Experience with NumPy, Pandas, OpenCV, PCL (Point Cloud Library)
  • Data Formats: Knowledge of rosbag, HDF5, Parquet, Protocol Buffers
  • Version Control: Proficiency with Git, GitLab/GitHub workflows
  • Linux: Strong Linux skills (Ubuntu, command line, system administration)

Data Engineering:

  • Pipelines: Experience building ETL/ELT pipelines with Airflow, Spark, or similar
  • Storage: Familiarity with object storage (S3, MinIO), databases (PostgreSQL, MongoDB)
  • Streaming: Understanding of real-time data streaming (Kafka, RabbitMQ)
  • Cloud: Basic knowledge of cloud platforms (AWS, Azure, GCP)

Preferred Skills

  • Advanced Robotics: Experience with mobile robots, manipulators, or drones in research or production
  • Computer Vision: Deep knowledge of vision algorithms (object detection, tracking, segmentation)
  • 3D Processing: Experience with point cloud processing, 3D reconstruction, or mesh generation
  • Simulation: Familiarity with Gazebo, Isaac Sim, CARLA, or other robotics simulators
  • Real-Time Systems: Understanding of real-time constraints and latency optimization
  • Hardware: Experience with sensor hardware setup, calibration, and troubleshooting
  • Machine Learning: Familiarity with ML for robotics (imitation learning, reinforcement learning)
  • Open Source: Contributions to ROS packages or robotics open-source projects
  • Research: Publications in robotics conferences (ICRA, IROS, RSS) or experience in research labs
  • Multi-Robot: Experience with multi-robot systems and distributed robotics

 

 

Apply Now:

Drag & Drop Files, Choose Files to Upload
Drag & Drop Files, Choose Files to Upload