NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
Golnaz Ghiasi,Tsung-Yi Lin,Quoc V. Le +2 more
- 16 Apr 2019
- pp 7036-7045
TL;DR: The adopted Neural Architecture Search is adopted and a new feature pyramid architecture in a novel scalable search space covering all cross-scale connections is discovered, named NAS-FPN, which achieves better accuracy and latency tradeoff compared to state-of-the-art object detection models.
read more
Abstract: Current state-of-the-art convolutional architectures for object detection are manually designed. Here we aim to learn a better architecture of feature pyramid network for object detection. We adopt Neural Architecture Search and discover a new feature pyramid architecture in a novel scalable search space covering all cross-scale connections. The discovered architecture, named NAS-FPN, consists of a combination of top-down and bottom-up connections to fuse features across scales. NAS-FPN, combined with various backbone models in the RetinaNet framework, achieves better accuracy and latency tradeoff compared to state-of-the-art object detection models. NAS-FPN improves mobile detection accuracy by 2 AP compared to state-of-the-art SSDLite with MobileNetV2 model in [32] and achieves 48.3 AP which surpasses Mask R-CNN [10] detection accuracy with less computation time.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Self-attention and long-range relationship capture network for underwater object detection
Ziran Gao,Yanli Shi,Sha Li +2 more
TL;DR: This paper proposes a self-attention and long-range relationship capture network for underwater object detection, achieving state-of-the-art performance on four datasets with a highest mAP of 88.7% on RUOD, outperforming prior methods by 1.2-8.5 percentage points.
6
Multi-scale Attention-Based Feature Pyramid Networks for Object Detection.
Xiaodong Zhao,Junliang Chen,Minmin Liu,Kai Ye,Linlin Shen +4 more
- 06 Aug 2021
TL;DR: Wang et al. as discussed by the authors proposed the Multi-scale Attention-based Feature Pyramid Networks (MAFPN) to exploit spatial and channel information and generate a better feature representation for each level from multi-scale features.
5
Light-YOLOv5: A Lightweight Drone Detector for Resource-Constrained Cameras
Jing Han,Ran Cao,Alessandro Brighente,Mauro Conti +3 more
TL;DR: Light-YOLOv5 is a lightweight drone detector for resource-constrained cameras that achieves better classification than other state-of-the-art solutions while being more efficient.
5
An Improved Tiered Head Pose Estimation Network with Self-Adjust Loss Function
Xiaoliang Zhu,Qiaolai Yang,Liang Zhao,Zhicheng Dai,Zili He,Wenting Rong,Junyi Sun,Gendong Liu +7 more
TL;DR: A THESL-Net (tiered head pose estimation with self-adjustment loss network) model is proposed, gaining a greater freedom during angle estimation and outperforms the state-of-the-art approaches.
Bidirectional Parallel Feature Pyramid Network for Object Detection
01 Jan 2022
TL;DR: BPFPN as mentioned in this paper proposes a cross-level Channel Attention-Refinement (ClCSAR) block and Weighted Parallel Feature Aggregation (WPFA) block to capture multi-scale spatial information from each level of FPN effectively.
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger,Philipp Fischer,Thomas Brox +2 more
- 05 Oct 2015
TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.
Densely Connected Convolutional Networks
Gao Huang,Zhuang Liu,Laurens van der Maaten,Kilian Q. Weinberger +3 more
- 21 Jul 2017
TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.
SSD: Single Shot MultiBox Detector
Wei Liu,Dragomir Anguelov,Dumitru Erhan,Christian Szegedy,Scott Reed,Cheng-Yang Fu,Alexander C. Berg +6 more
- 08 Oct 2016
TL;DR: The approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, which makes SSD easy to train and straightforward to integrate into systems that require a detection component.
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin,Piotr Dollár,Ross Girshick,Kaiming He,Bharath Hariharan,Serge Belongie +5 more
- 21 Jul 2017
TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Tsung-Yi Lin,Priya Goyal,Ross Girshick,Kaiming He,Piotr Dollár +4 more
- 07 Aug 2017