aiMotive Dataset: A Multimodal Dataset for Robust Autonomous Driving with Long-Range Perception (2022-11-17T00:00:00.000000Z)

TL;DR

This work introduces a multimodal dataset for robust autonomous driving with long-range perception and trained unimodal and multi-modal baseline models for 3D object detection.

Abstract

Autonomous driving is a popular research area within the computer vision research community. Since autonomous vehicles are highly safety-critical, ensuring robustness is essential for real-world deployment. While several public multimodal datasets are accessible, they mainly comprise two sensor modalities (camera, LiDAR) which are not well suited for adverse weather. In addition, they lack far-range annotations, making it harder to train neural networks that are the base of a highway assistant function of an autonomous vehicle. Therefore, we introduce a multimodal dataset for robust autonomous driving with long-range perception. The dataset consists of 176 scenes with synchronized and calibrated LiDAR, camera, and radar sensors covering a 360-degree field of view. The collected data was captured in highway, urban, and suburban areas during daytime, night, and rain and is annotated with 3D bounding boxes with consistent identifiers across frames. Furthermore, we trained unimodal and multimodal baseline models for 3D object detection. Data are available at \url{https://github.com/aimotive/aimotive_dataset}.

Authors

D. Ribli

3 papers

Tamás Matuszka

1 papers

Iván Barton

1 papers

TL;DR

Abstract

Authors

References33 items

Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

Far3Det: Towards Far-Field 3D Detection

A Novel Neural Network Training Method for Autonomous Driving Using Semi-Pseudo-Labels and 3D Data Augmentations

BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Raw High-Definition Radar for Multi-Task Learning

3D Object Detection for Autonomous Driving: A Survey

One Million Scenes for Autonomous Driving: ONCE Dataset

RADIATE: A Radar Dataset for Automotive Perception in Bad Weather

Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D

Center-based 3D Object Detection and Tracking

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

A*3D Dataset: Towards Autonomous Driving in Challenging Environments

Argoverse: 3D Tracking and Forecasting With Rich Maps

FCOS: Fully Convolutional One-Stage Object Detection

nuScenes: A Multimodal Dataset for Autonomous Driving

The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes

Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather

SECOND: Sparsely Embedded Convolutional Detection

The ApolloScape Dataset for Autonomous Driving

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Squeeze-and-Excitation Networks

Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library

Feature Pyramid Networks for Object Detection

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Are we ready for autonomous driving? The KITTI vision benchmark suite

The Hungarian method for the assignment problem

Level 5 perception dataset 2020

International Journal of Computer Vision manuscript No. (will be inserted by the editor) The PASCAL Visual Object Classes (VOC) Challenge

Modern Terrestrial Reference Systems ( Part 1 )

Bevfusion : Multitask multi - sensor fusion with unified bird ’ seye view representation

10: Qualitative results: detections of the LiDAR+radar baseline model.

more visible in distant regions (Table 11, Table 20-23). Finally, we would like to solicit contributions from the research community in order to develop more performant unimodal and multimodal models

Field of Study

Journal Information

Name

Volume

Venue Information

Name

Type

URL

Alternate Names