EdgeFirst Dataset Format

The EdgeFirst Dataset Format is designed to handle multi-sensor datasets with rich annotations. It separates sensor data (immutable) from annotation data (dynamic), allowing both to be managed independently and efficiently.

When Do You Need This?

Most users can skip these details. EdgeFirst Studio handles dataset formats automatically when you:

Record MCAP files on a Raivin platform and publish them as snapshots
Import existing datasets through the Studio UI
Export datasets for offline use
Train models using ModelPack or Ultralytics

This chapter is for users who need to:

Build custom ML pipelines outside of Studio
Integrate EdgeFirst datasets with third-party tools
Programmatically generate or modify annotations
Understand the data structures for advanced debugging

Overview

graph TB
    subgraph Dataset["🗂️ EdgeFirst Dataset"]
        direction TB
        Storage["📦 Storage Container (ZIP or Directory)"]
        Annotations["📊 Annotations (Arrow)"]
    end
    
    Storage --> |"Images, PCD, etc."| Sensor["🎥 Sensor Data (Immutable)"]
    Annotations --> |"Labels, Boxes, Masks"| Labels["🏷️ Annotation Data (Editable)"]
    
    Sensor --> Camera["📷 Camera"]
    Sensor --> Radar["📡 Radar"]
    Sensor --> LiDAR["🔦 LiDAR"]
    
    Labels --> Box2D["📐 2D Boxes"]
    Labels --> Box3D["📦 3D Boxes"]
    Labels --> Masks["🎭 Masks"]
    
    style Dataset fill:#e1f5ff,stroke:#0277bd,stroke-width:3px
    style Storage fill:#fff3e0,stroke:#ef6c00,stroke-width:2px
    style Annotations fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    style Sensor fill:#e8f5e9,stroke:#2e7d32,stroke-width:2px
    style Labels fill:#fce4ec,stroke:#c2185b,stroke-width:2px

Key Principles

Normalized coordinates: All spatial data uses 0–1 range (resolution-independent)
Sensor container: ZIP or directory holding images and point clouds
Annotation container: Apache Arrow file with columnar annotation data
Multiple sensor types: Camera, Radar (point cloud + data cube), LiDAR
Flexible structure: Supports sequence-based, image-based, or mixed datasets

Dataset Organization

Datasets can be organized in three patterns depending on your use case:

graph TB
    subgraph Sequence["Sequence-Based"]
        S1["📹 Sequences<br/>with temporal order"]
    end
    
    subgraph Image["Image-Based"]
        I1["🖼️ Independent images<br/>no order"]
    end
    
    subgraph Mixed["Mixed"]
        M1["🎬 Sequences +<br/>🖼️ Images"]
    end
    
    style Sequence fill:#c8e6c9,stroke:#388e3c,stroke-width:2px
    style Image fill:#ffccbc,stroke:#d84315,stroke-width:2px
    style Mixed fill:#e1bee7,stroke:#6a1b9a,stroke-width:2px

Each organization has different directory structures and file naming conventions. See Dataset Organization for detailed examples.

Annotation Data

EdgeFirst stores annotations in Apache Arrow format—a columnar, high-performance data structure. Think of it as a database table where each row represents an annotation and each column represents an annotation property.

import polars as pl

# Load and view your annotations
df = pl.read_ipc("dataset.arrow")
print(df.head())

Learn more about the annotation schema, field definitions, and sample metadata in Annotation Schema.

Sensor Data

EdgeFirst supports multiple sensor types:

Camera: JPEG or PNG images with embedded EXIF metadata (GPS, timestamp)
Radar: Point clouds (PCD format) and data cubes (16-bit PNG)
LiDAR: Point clouds (PCD format) with optional depth and reflectivity maps

See Sensor Data for detailed specifications on each format.

What's Inside This Chapter

📁 Dataset Organization

How to structure your dataset files depending on whether you have video sequences, standalone images, or both. Includes file naming conventions and directory hierarchies.

📏 Bounding Box Formats

Understanding the EdgeFirst bounding box format: center-based coordinates in Arrow files vs. top-left coordinates in the legacy JSON API. Includes 3D boxes and polygon masks.

📊 Annotation Schema & Fields

Detailed reference for every field in your annotations: sample identifiers, labels, geometry types (masks, boxes), and optional metadata (GPS, IMU, degradation).

📡 Sensor Data

Complete specification for camera, radar, and LiDAR data formats, file extensions, and encoding details.

🔄 Format Conversion

How to convert between Arrow (for analysis) and JSON (for editing) formats with Python code examples. Understand the key differences between formats.

Important Concepts

Samples vs. Annotations

A sample is a single image or frame
An annotation is one labeled object within a sample
One sample can have zero or many annotations

Samples without annotations are still important! They represent images with no objects of interest and should be included in your training dataset.

Normalized Coordinates

All spatial coordinates in EdgeFirst are normalized to 0–1 range:

normalized_x = pixel_x / image_width
normalized_y = pixel_y / image_height

This makes annotations resolution-independent and portable across different image sizes.

Frame Numbers

For sequences: Frame numbers indicate temporal order (frame 0, 1, 2, ...)
For images: Frame is null (no temporal ordering)
Frame numbers are unique within a sequence but don't need to be continuous (frames can be dropped or downsampled)

Quick Example

Here's a typical dataset structure:

my_dataset/
├── my_dataset.arrow                    # Annotations
└── my_dataset/                         # Sensor data
    ├── sequence_01/                    # First video sequence
    │   ├── sequence_01_001.camera.jpeg
    │   ├── sequence_01_002.camera.jpeg
    │   └── sequence_01_001.radar.pcd
    ├── image_background.jpg            # Standalone image
    └── sequence_02/                    # Second video sequence
        └── sequence_02_001.camera.jpeg

Next Steps

Learn how to organize your dataset files
Understand bounding box coordinate systems
Explore the annotation schema
Check the official specification for authoritative technical details