Point Cloud Pre-training

Point cloud data represents 3D shapes as a set of discrete points in 3D space. This kind of data is primarily sourced from 3D scanners, LiDAR systems, and other similar technologies. Point cloud processing has a wide range of applications, such as robotics, autonomous vehicles, and augmented/virtual reality. Pre-training on point cloud data is similar in spirit to pre-training on images or text. By pre-training a model on a large, diverse dataset, it learns essential features of the data type, which can then be fine-tuned on a smaller, task-specific dataset. This two-step process (pre-training and fine-tuning) often results in better performance, especially when the task-specific dataset is limited in size.

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

Content

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

Unsupervised Point Cloud Pre-training via Occlusion Completion

Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes

Point Cloud Pre-training with Natural 3D Structures

POS-BERT: Point Cloud One-Stage BERT Pre-Training

ProposalContrast: Unsupervised Pre-training for LiDAR-based 3D Object Detection

Boosting Point-BERT by Multi-Choice Tokens

BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios