Online Vectorized HD Map Construction using Geometry

doi:10.48550/arxiv.2312.03341

Journal Article10.48550/arxiv.2312.03341

Online Vectorized HD Map Construction using Geometry

Zhixin Zhang, +4 more

- 06 Dec 2023

- arXiv.org

- Vol. abs/2312.03341

12

TL;DR: This work proposes GeMap, which end-to-end learns Euclidean shapes and relations of map instances beyond basic perception and achieves new state-of-the-art performance on the NuScenes and Argoverse 2 datasets.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2311.15599

UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Xiaohan Ding, +6 more

- 27 Nov 2023

- arXiv.org

TL;DR: It is discovered that large kernels are the key to unlocking the exceptional performance of ConvNets in domains where they were originally not proficient, and the proposed model achieves state-of-the-art performance on time-series forecasting and audio recognition tasks even without modality-specific customization to the architecture.

...read moreread less

66

Journal Article•10.48550/arxiv.2403.15951

MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping

Jiacheng Chen, +4 more

- 23 Mar 2024

- arXiv.org

TL;DR: MapTracker is an algorithm for consistent vector HD mapping that uses strided memory fusion to ensure consistent reconstructions over time. It accumulates sensor streams into memory buffers of raster and vector latents, and leverages query propagation to associate tracked road elements from the previous frame to the current frame.

...read moreread less

8

Journal Article•10.48550/arxiv.2407.08726

Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data

Cherie Ho, +9 more

- 11 Jul 2024

TL;DR: This study introduces Map It Anywhere (MIA), a data engine that leverages large-scale public maps to enable generalizable Bird's Eye View (BEV) map prediction, outperforming baselines by 35% with zero-shot performance, and paving the way for robust autonomous navigation.

...read moreread less

1

Preprint•10.48550/arxiv.2406.13988

LGmap: Local-to-Global Mapping Network for Online Long-Range Vectorized HD Map Construction

Kaihua Wu, +4 more

- 20 Jun 2024

TL;DR: LGmap is an online mapping pipeline for constructing long-range HD maps. It utilizes SVT, HTF, and ped-crossing resampling techniques to achieve high stability and accuracy.

...read moreread less

1

Journal Article•10.1145/3690624.3709383

LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating

Deguo Xia, +8 more

- 06 Jan 2025

TL;DR: LDMapNet-U is an end-to-end system for city-scale lane-level map updating, leveraging a Prior-Map Encoding module and Instance Change Prediction module to simultaneously generate vectorized maps and detect changes, significantly reducing update cycles and improving map accuracy.

...read moreread less

References

•Posted Content

Focal Loss for Dense Object Detection

Tsung-Yi Lin, +4 more

- 07 Aug 2017

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes to address the extreme foreground-background class imbalance encountered during training of dense detectors by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples, and develops a novel Focal Loss, which focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training.

...read moreread less

16.7K

•Posted Content

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Mingxing Tan, +1 more

- 28 May 2019

- arXiv: Learning

TL;DR: A new scaling method is proposed that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient and is demonstrated the effectiveness of this method on scaling up MobileNets and ResNet.

...read moreread less

12K

•Posted Content

SGDR: Stochastic Gradient Descent with Warm Restarts

Ilya Loshchilov, +1 more

- 13 Aug 2016

- arXiv: Learning

TL;DR: In this paper, a simple warm restart technique for stochastic gradient descent was proposed to improve its anytime performance when training deep neural networks, which achieved state-of-the-art results on both the CIFAR-10 and CifAR-100 datasets.

...read moreread less

5.9K

•Posted Content

Deformable DETR: Deformable Transformers for End-to-End Object Detection

Xizhou Zhu, +5 more

- 08 Oct 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: Deformable DETR, whose attention modules only attend to a small set of key sampling points around a reference, can achieve better performance than DETR (especially on small objects) with 10$\times less training epochs.

...read moreread less

4.5K

•Posted Content

nuScenes: A multimodal dataset for autonomous driving

Holger Caesar, +9 more

- 26 Mar 2019

- arXiv: Learning

TL;DR: nuScenes as mentioned in this paper is the first dataset to carry the full autonomous vehicle sensor suite: 6 cameras, 5 radars and 1 lidar, all with full 360 degree field of view.

...read moreread less

3.7K

...

Expand