nuScenes: A Multimodal Dataset for Autonomous Driving

doi:10.1109/CVPR42600.2020.01164

Open AccessProceedings Article10.1109/CVPR42600.2020.01164

nuScenes: A Multimodal Dataset for Autonomous Driving

Holger Caesar, +9 more

- 14 Jun 2020

- pp 11621-11631

4K

TL;DR: nuScenes as discussed by the authors is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

Rui Huang, +6 more

- 24 Jul 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a sparse LSTM-based multi-frame 3d object detection algorithm that outperforms the traditional frame by frame approach by 7.5% mAP@0.7 and other multi- frame approaches by 1.2% while using less memory and computation per frame.

...read moreread less

56

Journal Article•10.1016/j.robot.2024.104630

Path planning algorithms in the autonomous driving system: A comprehensive review

M. Reda, +3 more

- 01 Jan 2024

- Robotics and Autonomous Systems

TL;DR: This comprehensive review of autonomous driving systems focuses on path planning, categorizing techniques into traditional, machine/deep learning, and meta-heuristic optimization methods, highlighting their advantages, drawbacks, and future trends in autonomous vehicle development.

...read moreread less

55

Proceedings Article•10.1145/3503161.3547859

Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection

Zehui Chen, +5 more

- 25 Apr 2022

TL;DR: Graph-DETR3D is proposed to automatically aggregate multi-view imagery information through graph structure learning and benefits from a novel depth-invariant multi-scale training strategy, which maintains the visual depth consistency by simultaneously scaling the image size and the object depth.

...read moreread less

55

•Journal Article•10.1145/3579642

A Survey on Automated Driving System Testing: Landscapes and Trends

Shuncheng Tang, +10 more

- 13 Jun 2022

- ACM Transactions on Software Engineering...

TL;DR: A threat model is built that reveals the potential safety threats for each module of an ADS, and the challenges and opportunities in ADS testing are identified, which facilitates the future research in this field.

...read moreread less

55

Journal Article•10.1109/cvpr52729.2023.00678

CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP

Runnan Chen, +8 more

- 01 Jun 2023

TL;DR: CLIP2Scene transfers CLIP knowledge to 3D scene understanding, achieving impressive performance on annotation-free and fine-tuning tasks.

...read moreread less

55

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2005.177

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

- 20 Jun 2005

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

36.7K

...

Expand

nuScenes: A Multimodal Dataset for Autonomous Driving

Chat with Paper

AI Agents for this Paper

Citations

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

Path planning algorithms in the autonomous driving system: A comprehensive review

Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection

A Survey on Automated Driving System Testing: Landscapes and Trends

CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Histograms of oriented gradients for human detection

Related Papers (5)

Are we ready for autonomous driving? The KITTI vision benchmark suite

Vision meets robotics: The KITTI dataset

Deep Residual Learning for Image Recognition

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space