AugFPN: Improving Multi-Scale Feature Learning for Object Detection

doi:10.1109/CVPR42600.2020.01261

Open AccessProceedings Article10.1109/CVPR42600.2020.01261

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Chaoxu Guo, +4 more

- 14 Jun 2020

- pp 12595-12604

398

TL;DR: Guo et al. as discussed by the authors proposed a new feature pyramid architecture named AugFPN, which consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection.

Abstract: Current state-of-the-art detectors typically exploit feature pyramid to detect objects at different scales. Among them, FPN is one of the representative works that build a feature pyramid by multi-scale features summation. However, the design defects behind prevent the multi-scale features from being fully exploited. In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems. Specifically, AugFPN consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection. AugFPN narrows the semantic gaps between features of different scales before feature fusion through Consistent Supervision. In feature fusion, ratio-invariant context information is extracted by Residual Feature Augmentation to reduce the information loss of feature map at the highest pyramid level. Finally, Soft RoI Selection is employed to learn a better RoI feature adaptively after feature fusion. By replacing FPN with AugFPN in Faster R-CNN, our models achieve 2.3 and 1.6 points higher Average Precision (AP) when using ResNet50 and MobileNet-v2 as backbone respectively. Furthermore, AugFPN improves RetinaNet by 1.6 points AP and FCOS by 0.9 points AP when using ResNet50 as backbone. Codes are available on https://github.com/Gus-Guo/AugFPN.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/CVPR46437.2021.01008

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

Siyuan Qiao, +2 more

- 01 Jun 2021

TL;DR: DetectoRS as mentioned in this paper proposes recursive feature pyramid, which incorporates extra feedback connections from Feature Pyramid Networks into the bottom-up backbone layers, and switchable atrous convolution which convolves the features with different atrous rates and gathers the results using switch functions.

...read moreread less

739

•Posted Content

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution.

Siyuan Qiao, +2 more

- 03 Jun 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes Recursive Feature Pyramid, which incorporates extra feedback connections from Feature Pyramid Networks into the bottom-up backbone layers and proposes Switchable Atrous Convolution, which convolves the features with different atrous rates and gathers the results using switch functions.

...read moreread less

609

Journal Article•10.1007/978-3-031-72751-1_1

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Chien-Yao Wang, +2 more

- 25 Oct 2024

- Lecture Notes in Computer Science

379

Journal Article•10.1109/tgrs.2021.3133956

ABNet: Adaptive Balanced Network for Multiscale Object Detection in Remote Sensing Imagery

01 Jan 2022

- IEEE Transactions on Geoscience and Remo...

TL;DR: Zhang et al. as mentioned in this paper proposed an adaptive balanced network (ABNet) to improve the feature representation ability of the backbone, which can alleviate the obstacles of complex background on foreground objects.

...read moreread less

142

Journal Article•10.1109/smc53992.2023.10394415

AFPN: Asymptotic Feature Pyramid Network for Object Detection

Guoyu Yang, +5 more

- 01 Oct 2023

TL;DR: AFPN is a novel feature pyramid network that incorporates asymptotic feature fusion and adaptive spatial fusion to enhance object detection performance.

...read moreread less

140

...

Expand

References

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2016.91

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

- 27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

45.7K

•Book Chapter•10.1007/978-3-319-46448-0_2

SSD: Single Shot MultiBox Detector

Wei Liu, +6 more

- 08 Oct 2016

TL;DR: The approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, which makes SSD easy to train and straightforward to integrate into systems that require a detection component.

...read moreread less

35.5K

•Proceedings Article•10.1109/CVPR.2014.81

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, +3 more

- 23 Jun 2014

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.

...read moreread less

33.7K

•Proceedings Article•10.1109/CVPR.2017.106

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, +5 more

- 21 Jul 2017

TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.

...read moreread less

29.5K

...

Expand

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Chat with Paper

AI Agents for this Paper

Citations

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution.

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

ABNet: Adaptive Balanced Network for Multiscale Object Detection in Remote Sensing Imagery

AFPN: Asymptotic Feature Pyramid Network for Object Detection

References

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

You Only Look Once: Unified, Real-Time Object Detection

SSD: Single Shot MultiBox Detector

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Feature Pyramid Networks for Object Detection

Related Papers (5)

Feature Pyramid Networks for Object Detection

Deep Residual Learning for Image Recognition

SSD: Single Shot MultiBox Detector

Focal Loss for Dense Object Detection

Microsoft COCO: Common Objects in Context