3260 papers • 126 benchmarks • 313 datasets
Uses sparse LiDAR semantic labels for training and testing
(Image credit: Papersgraph)
These leaderboards are used to track progress in 3d-semantic-occupancy-prediction-9
No benchmarks available.
Use these libraries to find 3d-semantic-occupancy-prediction-9 models and implementations
No datasets available.
No subtasks available.
OccFormer, a dual-path transformer network to effectively process the 3D volume for semantic occupancy prediction, achieves a long-range, dynamic, and efficient encoding of the camera-generated 3D voxel features.
A cylindrical tri-perspective view to represent point clouds effectively and comprehensively and a PointOcc model to process them efficiently are proposed and demonstrated that the proposed PointOcc achieves state-of-the-art performance with much faster speed.
This paper introduces InverseMatrixVT3D, an efficient method for transforming multi-view image features into 3D feature volumes for 3D semantic occupancy prediction, which achieves the top performance in detecting vulnerable road users (VRU), crucial for autonomous driving and road safety.
This work introduces HyDRa, a novel camera-radar fusion architecture for diverse 3D perception tasks that achieves a new state-of-the-art for cameraradar fusion of 64.2 NDS (+1.8) and 58.4 AMOTA (+1.5) on the public nuScenes dataset.
Adding a benchmark result helps the community track progress.