nuScenes: A Multimodal Dataset for Autonomous Driving

doi:10.1109/CVPR42600.2020.01164

Open AccessProceedings Article10.1109/CVPR42600.2020.01164

nuScenes: A Multimodal Dataset for Autonomous Driving

Holger Caesar, +9 more

- 14 Jun 2020

- pp 11621-11631

4K

TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/iccv51070.2023.00028

Rethinking Range View Representation for LiDAR Segmentation

Ling‐Dong Kong, +8 more

- 01 Oct 2023

TL;DR: Rethinking range view representation for LiDAR segmentation yields state-of-the-art performance by addressing key factors like many-to-one mapping, semantic incoherence, and shape deformation.

...read moreread less

48

Journal Article•10.1109/tits.2022.3156011

Scenario Understanding and Motion Prediction for Autonomous Vehicles—Review and Comparison

01 Oct 2022

- IEEE Transactions on Intelligent Transpo...

TL;DR: In this article , the authors present a review of the state of the art in scenario understanding and motion prediction for autonomous driving and make a comparison between three specific prediction methods, which consider specific functional aspects and general requirements of applicability.

...read moreread less

48

Journal Article•10.1109/mits.2023.3298534

Collaborative Perception in Autonomous Driving: Methods, Datasets, and Challenges

Yushan Han, +5 more

- 01 Nov 2023

- IEEE Intelligent Transportation Systems ...

TL;DR: Collaborative perception is crucial for autonomous driving and involves addressing occlusion and sensor failure issues. Recent advancements in collaborative perception have increased, but few reviews have focused on systematical collaboration modules and datasets. This article reviews recent achievements to bridge this gap and motivate future research.

...read moreread less

48

Journal Article•10.1109/TIP.2021.3074306

MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation

Xibin Song, +6 more

- 26 Apr 2021

- IEEE Transactions on Image Processing

TL;DR: Zhang et al. as mentioned in this paper proposed a multi-level feature extraction (MLFE) strategy which can learn rich hierarchical representation, and a dual-attention strategy, combining global attention and structure attention, is proposed to intensify the obtained features both globally and locally, resulting in improved depth maps with sharper boundaries.

...read moreread less

48

•Journal Article•10.3390/s22249577

A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving

Heather E. Dillaway

- 07 Dec 2022

- Sensors

TL;DR: LiDAR is a commonly used sensor for autonomous driving to make accurate, robust, and fast decision-making when driving as mentioned in this paper , which is used in the perception system, especially object detection, to understand the driving environment.

...read moreread less

48

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2005.177

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

- 20 Jun 2005

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

36.7K

...

Expand

nuScenes: A Multimodal Dataset for Autonomous Driving

Chat with Paper

AI Agents for this Paper

Citations

Rethinking Range View Representation for LiDAR Segmentation

Scenario Understanding and Motion Prediction for Autonomous Vehicles—Review and Comparison

Collaborative Perception in Autonomous Driving: Methods, Datasets, and Challenges

MLDA-Net: Multi-Level Dual Attention-Based Network for Self-Supervised Monocular Depth Estimation

A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Histograms of oriented gradients for human detection

Related Papers (5)

Are we ready for autonomous driving? The KITTI vision benchmark suite

Vision meets robotics: The KITTI dataset

Deep Residual Learning for Image Recognition

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space