This work introduces a multimodal dataset for robust autonomous driving with long-range perception and trained unimodal and multi-modal baseline models for 3D object detection.
Autonomous driving is a popular research area within the computer vision research community. Since autonomous vehicles are highly safety-critical, ensuring robustness is essential for real-world deployment. While several public multimodal datasets are accessible, they mainly comprise two sensor modalities (camera, LiDAR) which are not well suited for adverse weather. In addition, they lack far-range annotations, making it harder to train neural networks that are the base of a highway assistant function of an autonomous vehicle. Therefore, we introduce a multimodal dataset for robust autonomous driving with long-range perception. The dataset consists of 176 scenes with synchronized and calibrated LiDAR, camera, and radar sensors covering a 360-degree field of view. The collected data was captured in highway, urban, and suburban areas during daytime, night, and rain and is annotated with 3D bounding boxes with consistent identifiers across frames. Furthermore, we trained unimodal and multimodal baseline models for 3D object detection. Data are available at \url{https://github.com/aimotive/aimotive_dataset}.
Ádám Butykai
1 papers
Peter Hajas
1 papers
Dávid Kiss
1 papers
Domonkos Kov'acs
1 papers
Sándor Kunsági-Máté
1 papers
P'eter Lengyel
1 papers
Gábor Németh
1 papers
Levente Peto
1 papers
D. Szeghy
1 papers
Szabolcs Vajna
1 papers
Bálint Varga
1 papers