Point cloud labeling using 3D Convolutional Neural Network

doi:10.1109/ICPR.2016.7900038

Proceedings Article10.1109/ICPR.2016.7900038

Point cloud labeling using 3D Convolutional Neural Network

Jing Huang, +1 more

- 01 Dec 2016

- pp 2670-2675

428

TL;DR: This paper introduces a 3D point cloud labeling scheme based on 3D Convolutional Neural Network that minimizes the prior knowledge of the labeling problem and does not require a segmentation step or hand-crafted features as most previous approaches did.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/TPAMI.2020.3005434

Deep Learning for 3D Point Clouds: A Survey

Yulan Guo, +5 more

- 01 Dec 2021

- IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents a comprehensive review of recent progress in deep learning methods for point clouds, covering three major tasks, including 3D shape classification, 3D object detection and tracking, and 3D point cloud segmentation.

...read moreread less

2K

•Proceedings Article•10.1109/CVPR.2017.701

OctNet: Learning Deep 3D Representations at High Resolutions

Gernot Riegler, +2 more

- 21 Jul 2017

TL;DR: The utility of the OctNet representation is demonstrated by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.

...read moreread less

1.7K

•Proceedings Article•10.1109/CVPR.2017.11

Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

Martin Simonovsky, +1 more

- 21 Jul 2017

TL;DR: This work generalizes the convolution operator from regular grids to arbitrary graphs while avoiding the spectral domain, which allows us to handle graphs of varying size and connectivity.

...read moreread less

1.6K

•Proceedings Article•10.1109/ICCV.2019.00939

SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences

Jens Behley, +6 more

- 01 Oct 2019

TL;DR: In this paper, the KITTI Vision Odometry Benchmark was used to provide dense point-wise annotations for the complete 360-degree field-of-view of the employed automotive LiDAR.

...read moreread less

1.4K

•Posted Content

A Review on Deep Learning Techniques Applied to Semantic Segmentation.

Alberto Garcia-Garcia, +4 more

- 22 Apr 2017

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A review on deep learning methods for semantic segmentation applied to various application areas as well as mandatory background concepts to help researchers decide which are the ones that best suit their needs and their targets.

...read moreread less

1.4K

...

Expand

References

Journal Article•10.1109/5.726791

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

- 01 Jan 1998

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

53.5K

•Proceedings Article•10.1109/CVPR.2015.7298801

3D ShapeNets: A deep representation for volumetric shapes

Zhirong Wu, +6 more

- 07 Jun 2015

TL;DR: This work proposes to represent a geometric 3D shape as a probability distribution of binary variables on a 3D voxel grid, using a Convolutional Deep Belief Network, and shows that this 3D deep representation enables significant performance improvement over the-state-of-the-arts in a variety of tasks.

...read moreread less

6.6K

•Journal Article•10.1109/TPAMI.2012.59

3D Convolutional Neural Networks for Human Action Recognition

Shuiwang Ji, +3 more

- 01 Jan 2013

- IEEE Transactions on Pattern Analysis an...

TL;DR: Wang et al. as mentioned in this paper developed a novel 3D CNN model for action recognition, which extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames.

...read moreread less

6K

•Proceedings Article

3D Convolutional Neural Networks for Human Action Recognition

Shuiwang Ji, +3 more

- 21 Jun 2010

TL;DR: A novel 3D CNN model for action recognition that extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames.

...read moreread less

4.3K

Proceedings Article•10.1109/IROS.2015.7353481

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

Daniel Maturana, +1 more

- 01 Sep 2015

TL;DR: VoxNet is proposed, an architecture to tackle the problem of robust object recognition by integrating a volumetric Occupancy Grid representation with a supervised 3D Convolutional Neural Network (3D CNN).

...read moreread less

4.2K

Point cloud labeling using 3D Convolutional Neural Network

Chat with Paper

AI Agents for this Paper

Citations

Deep Learning for 3D Point Clouds: A Survey

OctNet: Learning Deep 3D Representations at High Resolutions

Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences

A Review on Deep Learning Techniques Applied to Semantic Segmentation.

References

Gradient-based learning applied to document recognition

3D ShapeNets: A deep representation for volumetric shapes

3D Convolutional Neural Networks for Human Action Recognition

3D Convolutional Neural Networks for Human Action Recognition

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

Related Papers (5)

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

3D ShapeNets: A deep representation for volumetric shapes

Fully convolutional networks for semantic segmentation