EdgeFirst Datasets
This section describes how datasets are being handled in EdgeFirst. The dataset creation starts from data capture using devices such as a mobile phone or various EdgeFirst Platforms. The annotation process is handled in EdgeFirst Studio once the captured data has been uploaded. Otherwise existing datasets in various formats such as Darknet can be imported into EdgeFirst Studio along with the annotations. Datasets from scratch can either be auto-annotated or manually-annotated. Typically, datasets are auto-annotated and small corrections can be made through the manual annotation process.
The EdgeFirst Dataset Format is purposely designed to efficiently store multiple annotation types and sensor types. There are 2D and 3D types of annotations that correlate with each other. For example, a 2D annotation is a set of pixel-based bounding boxes and segmentation masks whereas a 3D annotation is a set of 3D bounding boxes in meters. However, a single object can be described by all three linked annotation types: a 2D bounding box, a 2D mask, and a 3D bounding box. There could be multiple sensors involved in creating the dataset such as the camera, Radar and/or LiDAR, etc. The readings from these sensors is rarely modified and needs to be disinguished in the dataset. Thus, the EdgeFirst Dataset is known for its file pairs (typically Zip and Arrow) for storing the sensor data and annotations separately. The EdgeFirst Dataset File Structure comes in various forms depending if the data is a sequence or not.
We provide public EdgeFirst Datasets and commercially clean datasets which can be licensed from Au-Zone Technologies. Some example third-party datasets such as COCO are also provided for evaluation and research purposes only. Refer to their respective licensing restrictions.