Objects as Points

Open AccessPosted Content

Objects as Points

- 16 Apr 2019

- arXiv: Computer Vision and Pattern Recog...

2.6K

TL;DR: The center point based approach, CenterNet, is end-to-end differentiable, simpler, faster, and more accurate than corresponding bounding box based detectors and performs competitively with sophisticated multi-stage methods and runs in real-time.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

End-to-End Object Detection with Transformers

Nicolas Carion, +5 more

- 26 May 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a new method that views object detection as a direct set prediction problem, and demonstrates accuracy and run-time performance on par with the well-established and highly-optimized Faster RCNN baseline on the challenging COCO object detection dataset.

...read moreread less

9.1K

•Book Chapter•10.1007/978-3-030-58452-8_13

End-to-End Object Detection with Transformers

Nicolas Carion, +5 more

- 23 Aug 2020

TL;DR: DetR as mentioned in this paper proposes a set-based global loss that forces unique predictions via bipartite matching, and a transformer encoder-decoder architecture to directly output the final set of predictions in parallel.

...read moreread less

8.5K

•Proceedings Article•10.1109/CVPR42600.2020.01079

EfficientDet: Scalable and Efficient Object Detection

Mingxing Tan, +2 more

- 14 Jun 2020

TL;DR: EfficientDetD7 as discussed by the authors proposes a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion, and a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time.

...read moreread less

7.2K

•Proceedings Article•10.1109/CVPRW50498.2020.00203

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

Chien-Yao Wang, +5 more

- 14 Jun 2020

TL;DR: Cross Stage Partial Network (CSPNet) as discussed by the authors integrates feature maps from the beginning and the end of a network stage to mitigate the problem of duplicate gradient information within network optimization.

...read moreread less

4.2K

Journal Article•10.48550/arXiv.2207.02696

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Chien-Yao Wang, +2 more

- 06 Jul 2022

- arXiv.org

TL;DR: YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 160 FPS and has the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100.

...read moreread less

3.7K

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

•Book Chapter•10.1007/978-3-319-10602-1_48

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

- 06 Sep 2014

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

51.7K

•Proceedings Article•10.1109/CVPR.2016.91

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

- 27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

45.7K

...

Expand

Objects as Points

Chat with Paper

AI Agents for this Paper

Citations

End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

EfficientDet: Scalable and Efficient Object Detection

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition

Microsoft COCO: Common Objects in Context

You Only Look Once: Unified, Real-Time Object Detection

Related Papers (5)

Deep Residual Learning for Image Recognition

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

Feature Pyramid Networks for Object Detection

You Only Look Once: Unified, Real-Time Object Detection