Object Insertion Based Data Augmentation for Semantic Segmentation

doi:10.1109/icra46639.2022.9811816

Proceedings Article10.1109/icra46639.2022.9811816

Object Insertion Based Data Augmentation for Semantic Segmentation

Yuan Zeng Ren, +2 more

- 23 May 2022

pp 359-365

14

TL;DR: An object insertion based data augmentation method is proposed which can increase the performance of the semantic segmentation network remarkably and an object library is created by using the labeled LiDAR point clouds.

Abstract: Neural network used for the LiDAR semantic segmentation task needs the point-wise labeled point clouds for training, which is more expensive than bounding box annotations. Enhancing the diversity of training data through object insertion is an effective method to reduce labeling costs. The existing object insertion methods are mainly divided into two categories. First is “copy” the clusters from a LiDAR frame and “paste” it to other frames or positions. Second is inserting CAD models into the background then using LiDAR simulator to generate laser points of the inserted CAD models. “Copy-paste” method cannot generate realistic scanning lines and shadows, and the CAD models, especially the CAD models of flexible objects, are hard to obtain. We propose an object insertion based data augmentation method which can increase the performance of the semantic segmentation network remarkably. First, an object library is created by using the labeled LiDAR point clouds. Then, these objects are inserted into the LiDAR point clouds dynamically during the training. Finally, the realistic scanning lines and shadows are simulated according to the real LiDAR parameters. The experimental results show that the proposed augmentation method can increase the performance of different semantic segmentation frameworks remarkably.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/icra48891.2023.10160429

Few-Shot Point Cloud Semantic Segmentation via Contrastive Self-Supervision and Multi-Resolution Attention

29 May 2023

TL;DR: Zhang et al. as mentioned in this paper proposed a contrastive self-supervision framework for few-shot learning pretrain, which aims to eliminate the feature extraction bias through class-agnostic contrastive supervision.

...read moreread less

4

Journal Article•10.1109/aim46323.2023.10196168

Copy and Paste Augmentation for Deformable Wiring Harness Bags Segmentation

Bare Luka Žagar, +6 more

- 28 Jun 2023

TL;DR: An approach to generate a dataset of a specific object of interest, i.e. deformable wiring harness bags, with minimal effort employing the copy and paste technique is proposed, and the obtained dataset is validated on the semantic segmentation task in a real-world test setup.

...read moreread less

2

Proceedings Article•10.1109/icra48891.2023.10160496

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation

29 May 2023

TL;DR: M2SKD as mentioned in this paper proposes a multi-to-single knowledge distillation framework for the 3D point cloud semantic segmentation task to boost the performance of those hard classes by fusing all the points of multi-scans directly, only the instances that belong to the previously defined hard classes are fused.

...read moreread less

2

Journal Article•10.48550/arxiv.2407.21452

Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments

Haodong Hong, +4 more

- 31 Jul 2024

TL;DR: This study introduces R2R-UNO, a dataset with obstructed navigation graphs and visual observations, to evaluate Vision-and-Language Navigation methods' adaptability to unexpected obstructions, proposing ObVLN, a novel method that achieves robust performance in both obstructed and unobstructed scenarios.

...read moreread less

1

Proceedings Article•10.1109/ICRA48891.2023.10160496

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation

Shoumeng Qiu, +4 more

- 28 Apr 2023

TL;DR: M2SKD as mentioned in this paper proposes a multi-to-single knowledge distillation framework for the 3D point cloud semantic segmentation task to boost the performance of those hard classes by fusing all the points of multi-scans directly, only the instances that belong to the previously defined hard classes are fused.

...read moreread less

1

References

•Journal Article•10.1186/S40537-019-0197-0

A survey on Image Data Augmentation for Deep Learning

Connor Shorten, +1 more

- 06 Jul 2019

- Journal of Big Data

TL;DR: This survey will present existing methods for Data Augmentation, promising developments, and meta-level decisions for implementing DataAugmentation, a data-space solution to the problem of limited data.

...read moreread less

10.6K

•Proceedings Article•10.1109/CVPR42600.2020.01164

nuScenes: A Multimodal Dataset for Autonomous Driving

Holger Caesar, +9 more

- 14 Jun 2020

TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

...read moreread less

4K

•Journal Article•10.3390/S18103337

SECOND: Sparsely Embedded Convolutional Detection

Yan Yan, +2 more

- 06 Oct 2018

- Sensors

TL;DR: An improved sparse convolution method for Voxel-based 3D convolutional networks is investigated, which significantly increases the speed of both training and inference and introduces a new form of angle loss regression to improve the orientation estimation performance.

...read moreread less

3.2K

•Proceedings Article•10.1109/ICCV.2019.00651

KPConv: Flexible and Deformable Convolution for Point Clouds

Hugues Thomas, +5 more

- 18 Apr 2019

TL;DR: KPConv is a new design of point convolution, i.e. that operates on point clouds without any intermediate representation, that outperform state-of-the-art classification and segmentation approaches on several datasets.

...read moreread less

3.1K

•Proceedings Article•10.1109/ICCV.2017.97

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Chen Sun, +3 more

- 10 Jul 2017

TL;DR: In this paper, the authors investigated how the performance of current vision tasks would change if this data was used for representation learning and found that the performance on vision tasks increases logarithmically based on volume of training data size.

...read moreread less

3K

...

Expand

Object Insertion Based Data Augmentation for Semantic Segmentation

Chat with Paper

AI Agents for this Paper

Citations

Few-Shot Point Cloud Semantic Segmentation via Contrastive Self-Supervision and Multi-Resolution Attention

Copy and Paste Augmentation for Deformable Wiring Harness Bags Segmentation

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation

Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation

References

A survey on Image Data Augmentation for Deep Learning

nuScenes: A Multimodal Dataset for Autonomous Driving

SECOND: Sparsely Embedded Convolutional Detection

KPConv: Flexible and Deformable Convolution for Point Clouds

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Related Papers (5)

Research on Fusion Method of Lidar and Visual Image Based on Surface Vehicle

Real-time object segmentation for visual object detection in dynamic scenes

Teat pose estimation via RGBD segmentation for automated milking

First contact: an active vision approach to segmentation

Identification of occlusion regions based on background rebuilding for automatic video object segmentation