Context‐guided ground truth sampling for multi‐modality data augmentation in autonomous driving
5
About: This article is published in Iet Intelligent Transport Systems. The article was published on 08 Sep 2022. and is currently open access. The article focuses on the topics: Ground truth & Context (archaeology).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Object-Level Data Augmentation for Deep Learning-Based Obstacle Detection in Railways
TL;DR: A novel deep learning (DL)-based rail track detection that enables context- and scale-aware obstacle-level data augmentation and demonstrates significant improvement of detection of distant objects by augmentation of training dataset with synthetic images.
6
PSCO: A Point Cloud Scene Classification Model Based on Contrast Learning
Nuo Cheng,Xiaohan Li,Chuanyu Luo,Xiaotong Liu,Han Li,Sheng Lei,Pu Li +6 more
- 08 Oct 2023
TL;DR: A self-supervised method and a corresponding package to classify driving scenes in large-scale point cloud data using a contrastive learning approach which encodes each single frame of the collected raw data into a feature vector that can be effectively classified.
U2D2PCB: Uncertainty-aware Unsupervised Defect Detection on PCB Images Using Reconstructive and Discriminative Models
Qiman Wu,Jin Zhang,H. Xia,Pengrong Lin,Yong Wang,Mengke Tian,Rencheng Song +6 more
TL;DR: The defect detection of printed circuit board (PCB) images faces challenges such as limited sample number, imbalanced sample types, and varying detection reliability, so an uncertainty-aware unsupervised detection model on PCB images is proposed.
Multimodal Seed Data Augmentation for Low-Resource Audio Latin Cuengh Language
Lanlan Jiang,Xingguo Qin,Jingwei Zhang,Jun Li +3 more
TL;DR: This study introduces a multimodal seed data augmentation model to enhance intelligent recognition and comprehension of Latin Cuengh, a low-resource dialect, by leveraging speech and bilingual textual data, achieving a 3% accuracy increase and improved training efficiency.
Exploring 3D Object Detection for Autonomous Factory Driving: Advanced Research on Handling Limited Annotations with Ground Truth Sampling Augmentation
Matthias Reuse,Karl Amende,Martin Simon,Bernhard Sick +3 more
- 18 Feb 2024
TL;DR: Limited annotations in 3D object detection for autonomous factory driving are addressed through ground truth sampling augmentation. The effectiveness of various augmentation techniques is explored, revealing that polygon-based augmentation yields the best results.
References
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.
Microsoft COCO: Common Objects in Context
Tsung-Yi Lin,Michael Maire,Serge Belongie,James Hays,Pietro Perona,Deva Ramanan,Piotr Dollár,C. Lawrence Zitnick +7 more
- 06 Sep 2014
TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Are we ready for autonomous driving? The KITTI vision benchmark suite
Andreas Geiger,Philip Lenz,Raquel Urtasun +2 more
- 16 Jun 2012
TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts,Mohamed Omran,Sebastian Ramos,Timo Rehfeld,Markus Enzweiler,Rodrigo Benenson,Uwe Franke,Stefan Roth,Bernt Schiele +8 more
- 01 Jun 2016
TL;DR: This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.
11.5K
nuScenes: A Multimodal Dataset for Autonomous Driving
Holger Caesar,Varun Bankiti,Alex H. Lang,Sourabh Vora,Venice Erin Liong,Qiang Xu,Anush Krishnan,Yu Pan,Giancarlo Baldan,Oscar Beijbom +9 more
- 14 Jun 2020
TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.