nuScenes: A Multimodal Dataset for Autonomous Driving

doi:10.1109/CVPR42600.2020.01164

Open AccessProceedings Article10.1109/CVPR42600.2020.01164

nuScenes: A Multimodal Dataset for Autonomous Driving

Holger Caesar, +9 more

- 14 Jun 2020

- pp 11621-11631

4K

TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/iccv51070.2023.00033

HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative Perception with Vision Transformer

Hao Xiang, +2 more

- 01 Oct 2023

TL;DR: HM-ViT is a novel framework for multi-agent hetero-modal cooperative perception in V2V scenarios, enabling collaborative object prediction with distinct sensor modalities.

...read moreread less

16

•Proceedings Article•10.1109/IV47402.2020.9304819

Sense–Assess–eXplain (SAX): Building Trust in Autonomous Vehicles in Challenging Real-World Driving Scenarios

Matthew Gadd, +4 more

- 19 Oct 2020

TL;DR: In this article, the authors present how to build robots that can robustly sense and interpret their environment using traditional as well as unconventional sensors; assess their own capabilities; and vitally in the purpose of assurance and trust, can provide causal explanations of their interpretations and assessments.

...read moreread less

16

Journal Article•10.1109/tits.2022.3149370

Towards Compact Autonomous Driving Perception With Balanced Learning and Multi-Sensor Fusion

01 Sep 2022

- IEEE Transactions on Intelligent Transpo...

TL;DR: Zhang et al. as mentioned in this paper proposed a compact deep multi-task learning model to handle various autonomous driving perception tasks in one forward pass, which performs multiple views of semantic segmentation, depth estimation, light detection and ranging (LiDAR) segmentation and bird's eye view projection simultaneously without being supported by other models.

...read moreread less

16

Review•10.1109/access.2023.3312382

Radars for Autonomous Driving: A Review of Deep Learning Methods and Challenges

Arvind Srivastav, +1 more

- 01 Jan 2023

- IEEE Access

TL;DR: Radar data presents challenges for deep learning due to its low resolution, sparsity, clutter, and lack of datasets. Under-utilization of radar capabilities limits autonomous perception. This review aims to encourage further research on autonomous radar data.

...read moreread less

16

•Journal Article•10.1007/s11263-021-01554-9

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

Qingyong Hu, +5 more

- 04 Jan 2022

- International Journal of Computer Vision

TL;DR: In this paper , the authors introduce SensatUrban, an urban-scale UAV photogrammetry point cloud dataset consisting of nearly three billion points collected from three UK cities, covering 7.6 km.

...read moreread less

16

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2005.177

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

- 20 Jun 2005

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

36.7K

...

Expand

nuScenes: A Multimodal Dataset for Autonomous Driving

Chat with Paper

AI Agents for this Paper

Citations

HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative Perception with Vision Transformer

Sense–Assess–eXplain (SAX): Building Trust in Autonomous Vehicles in Challenging Real-World Driving Scenarios

Towards Compact Autonomous Driving Perception With Balanced Learning and Multi-Sensor Fusion

Radars for Autonomous Driving: A Review of Deep Learning Methods and Challenges

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Histograms of oriented gradients for human detection

Related Papers (5)

Are we ready for autonomous driving? The KITTI vision benchmark suite

Vision meets robotics: The KITTI dataset

Deep Residual Learning for Image Recognition

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space