SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud (2017-10-19T00:00:00.000000Z)

TL;DR

An end-to-end pipeline called SqueezeSeg based on convolutional neural networks (CNN), which takes a transformed LiDAR point cloud as input and directly outputs a point-wise label map, which is then refined by a conditional random field (CRF) implemented as a recurrent layer.

Abstract

We address semantic segmentation of road-objects from 3D LiDAR point clouds. In particular, we wish to detect and categorize instances of interest, such as cars, pedestrians and cyclists. We formulate this problem as a point-wise classification problem, and propose an end-to-end pipeline called SqueezeSeg based on convolutional neural networks (CNN): the CNN takes a transformed LiDAR point cloud as input and directly outputs a point-wise label map, which is then refined by a conditional random field (CRF) implemented as a recurrent layer. Instance-level labels are then obtained by conventional clustering algorithms. Our CNN model is trained on LiDAR point clouds from the KITTI [1] dataset, and our point-wise segmentation labels are derived from 3D bounding boxes from KITTI. To obtain extra training data, we built a LiDAR simulator into Grand Theft Auto $\boldsymbol{V}$ (GTA-V), a popular video game, to synthesize large amounts of realistic training data. Our experiments show that SqueezeSeg achieves high accuracy with astonishingly fast and stable runtime ($8.7\pm 0.5$ ms per frame), highly desirable for autonomous driving. Furthermore, additionally training on synthesized data boosts validation accuracy on real-world data. Our source code is open-source released111https://github.com/BichenWuUCB/SqueezeSeg. The paper is accompanied by a video222https://youtu.be/Xyn5Zd31m6s containing a high level introduction and demonstrations of this work.

Authors

K. Keutzer

18 papers

Bichen Wu

6 papers

Alvin Wan

3 papers

SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud

TL;DR

Abstract

Authors

References23 items

Fast segmentation of 3D point clouds: A paradigm on LiDAR data for autonomous vehicle applications

Real-Time and Accurate Segmentation of 3-D Point Clouds Based on Gaussian Process Regression

Fast LIDAR-based road detection using fully convolutional neural networks

SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving

Multi-view 3D Object Detection Network for Autonomous Driving

Driving in the Matrix: Can virtual worlds replace human-generated annotations for real world tasks?

Playing for Data: Ground Truth from Computer Games

Vehicle Detection from 3D Lidar Using Fully Convolutional Network

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Fusing LIDAR and images for pedestrian detection using convolutional neural networks

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size

3D Convolutional Neural Networks for landing zone detection from LiDAR

Conditional Random Fields as Recurrent Neural Networks

Fully convolutional networks for semantic segmentation

ImageNet classification with deep convolutional neural networks

Are we ready for autonomous driving? The KITTI vision benchmark suite

What could move? Finding cars, pedestrians and bicyclists in 3D laser data

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

On the segmentation of 3D LIDAR point clouds

Segmentation of 3D lidar data in non-flat urban environments using a local convexity criterion

Stanley: The robot that won the DARPA Grand Challenge

“TensorFlow: Large-scale machine learning on heterogeneous systems,”

LIDAR-based 3D Object Perception

Field of Study

Journal Information

Name

Page

Venue Information

Name

Type

URL

Alternate Names