Proceedings Article10.1109/icra46639.2022.9811816
Object Insertion Based Data Augmentation for Semantic Segmentation
Yuan Zeng Ren,Siyan Zhao,Bingbing Liu +2 more
- 23 May 2022
pp 359-365
14
TL;DR: An object insertion based data augmentation method is proposed which can increase the performance of the semantic segmentation network remarkably and an object library is created by using the labeled LiDAR point clouds.
read more
Abstract: Neural network used for the LiDAR semantic segmentation task needs the point-wise labeled point clouds for training, which is more expensive than bounding box annotations. Enhancing the diversity of training data through object insertion is an effective method to reduce labeling costs. The existing object insertion methods are mainly divided into two categories. First is “copy” the clusters from a LiDAR frame and “paste” it to other frames or positions. Second is inserting CAD models into the background then using LiDAR simulator to generate laser points of the inserted CAD models. “Copy-paste” method cannot generate realistic scanning lines and shadows, and the CAD models, especially the CAD models of flexible objects, are hard to obtain. We propose an object insertion based data augmentation method which can increase the performance of the semantic segmentation network remarkably. First, an object library is created by using the labeled LiDAR point clouds. Then, these objects are inserted into the LiDAR point clouds dynamically during the training. Finally, the realistic scanning lines and shadows are simulated according to the real LiDAR parameters. The experimental results show that the proposed augmentation method can increase the performance of different semantic segmentation frameworks remarkably.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Few-Shot Point Cloud Semantic Segmentation via Contrastive Self-Supervision and Multi-Resolution Attention
29 May 2023
TL;DR: Zhang et al. as mentioned in this paper proposed a contrastive self-supervision framework for few-shot learning pretrain, which aims to eliminate the feature extraction bias through class-agnostic contrastive supervision.
4
Copy and Paste Augmentation for Deformable Wiring Harness Bags Segmentation
Bare Luka Žagar,Alessio Caporali,Amadeusz Szymko,Piotr Kicki,Krzysztof Walas,Gianluca Palli,Alois Knoll +6 more
- 28 Jun 2023
TL;DR: An approach to generate a dataset of a specific object of interest, i.e. deformable wiring harness bags, with minimal effort employing the copy and paste technique is proposed, and the obtained dataset is validated on the semantic segmentation task in a real-world test setup.
2
Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation
29 May 2023
TL;DR: M2SKD as mentioned in this paper proposes a multi-to-single knowledge distillation framework for the 3D point cloud semantic segmentation task to boost the performance of those hard classes by fusing all the points of multi-scans directly, only the instances that belong to the previously defined hard classes are fused.
2
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments
Haodong Hong,Sen Wang,Zi Huang,Qi Wu,Jiajun Liu +4 more
- 31 Jul 2024
TL;DR: This study introduces R2R-UNO, a dataset with obstructed navigation graphs and visual observations, to evaluate Vision-and-Language Navigation methods' adaptability to unexpected obstructions, proposing ObVLN, a novel method that achieves robust performance in both obstructed and unobstructed scenarios.
Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation
Shoumeng Qiu,Feng Jiang,Haiqiang Zhang,Xiangyang Xue,Jian Pu +4 more
- 28 Apr 2023
TL;DR: M2SKD as mentioned in this paper proposes a multi-to-single knowledge distillation framework for the 3D point cloud semantic segmentation task to boost the performance of those hard classes by fusing all the points of multi-scans directly, only the instances that belong to the previously defined hard classes are fused.
1
References
A survey on Image Data Augmentation for Deep Learning
TL;DR: This survey will present existing methods for Data Augmentation, promising developments, and meta-level decisions for implementing DataAugmentation, a data-space solution to the problem of limited data.
nuScenes: A Multimodal Dataset for Autonomous Driving
Holger Caesar,Varun Bankiti,Alex H. Lang,Sourabh Vora,Venice Erin Liong,Qiang Xu,Anush Krishnan,Yu Pan,Giancarlo Baldan,Oscar Beijbom +9 more
- 14 Jun 2020
TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.
SECOND: Sparsely Embedded Convolutional Detection
Yan Yan,Yuxing Mao,Bo Li +2 more
TL;DR: An improved sparse convolution method for Voxel-based 3D convolutional networks is investigated, which significantly increases the speed of both training and inference and introduces a new form of angle loss regression to improve the orientation estimation performance.
3.2K
KPConv: Flexible and Deformable Convolution for Point Clouds
Hugues Thomas,Charles R. Qi,Jean-Emmanuel Deschaud,Beatriz Marcotegui,François Goulette,Leonidas J. Guibas +5 more
- 18 Apr 2019
TL;DR: KPConv is a new design of point convolution, i.e. that operates on point clouds without any intermediate representation, that outperform state-of-the-art classification and segmentation approaches on several datasets.
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Chen Sun,Abhinav Shrivastava,Saurabh Singh,Abhinav Gupta +3 more
- 10 Jul 2017
TL;DR: In this paper, the authors investigated how the performance of current vision tasks would change if this data was used for representation learning and found that the performance on vision tasks increases logarithmically based on volume of training data size.