Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

Open AccessPosted Content

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

- 05 Dec 2019

- arXiv: Computer Vision and Pattern Recog...

1K

TL;DR: An Adaptive Training Sample Selection (ATSS) to automatically select positive and negative samples according to statistical characteristics of object significantly improves the performance of anchor-based and anchor-free detectors and bridges the gap between them.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Book Chapter•10.1007/978-981-16-5912-6_88

Data Enhancement for Deep Learning-Based Wrist Fracture Detection

Weijie Huang, +4 more

- 23 Aug 2021

TL;DR: Wang et al. as discussed by the authors proposed a data enhancement method based on image mosaic, which is embedded into several existing deep learning frameworks for verification, and the experimental results show that the proposed data enhancing method is universal to the existing DNN frameworks, and their AP value will be improved by 3%.

...read moreread less

1

Journal Article•10.48550/arXiv.2206.05730

Object Occlusion of Adding New Categories in Objection Detection

Boyang Deng, +2 more

- 12 Jun 2022

- arXiv.org

TL;DR: This work performs a systematic study of the Object Occlusion data collection and augmentation methods where it is shown that the simple mechanism of object occlusion is good enough and can provide acceptable accuracy in real scenarios adding new category.

...read moreread less

1

Journal Article•10.1007/s11042-023-14990-1

Surgical action detection based on path aggregation adaptive spatial network

Zhen Chao, +4 more

- 08 Mar 2023

- Multimedia Tools and Applications

TL;DR: A path aggregation adaptive spatial feature pyramid network (PAAS-FPN), which combines bottom-up path enhancement and an adaptive spatial fusion mechanism, which achieves the highest detection accuracy in several experiments, thereby confirming its effectiveness in surgeon action detection.

...read moreread less

1

Journal Article•10.48550/arxiv.2310.03456

Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization

Edward Fish, +2 more

- 05 Oct 2023

- arXiv.org

TL;DR: The Multi-Resolution Audio-Visual Feature Fusion (MRAV-FF) is introduced, an innovative method to merge audio-visual data across different temporal resolutions and is compatible with existing FPN TAL architectures and offering a significant enhancement in performance when audio data is available.

...read moreread less

1

Journal Article•10.48550/arxiv.2401.18032

DROP: Decouple Re-Identification and Human Parsing with Task-specific Features for Occluded Person Re-identification

Shuguang Dou, +6 more

- 31 Jan 2024

- arXiv.org

TL;DR: The paper introduces the Decouple Re-identificatiOn and human Parsing (DROP) method for occluded person re-identification (ReID), arguing that the inferior performance of the former is due to distinct granularity requirements for ReID and human parsing features.

...read moreread less

1

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Book Chapter•10.1007/978-3-319-10602-1_48

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

- 06 Sep 2014

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

51.7K

•Proceedings Article•10.1109/CVPR.2016.91

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

- 27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

45.7K

•Journal Article•10.1007/S11263-015-0816-Y

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015

- International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

41.6K