Open AccessPosted Content
R-FCN: Object Detection via Region-based Fully Convolutional Networks
TL;DR: This work presents region-based, fully convolutional networks for accurate and efficient object detection, and proposes position-sensitive score maps to address a dilemma between translation-invariance in image classification and translation-variance in object detection.
read more
Abstract: We present region-based, fully convolutional networks for accurate and efficient object detection. In contrast to previous region-based detectors such as Fast/Faster R-CNN that apply a costly per-region subnetwork hundreds of times, our region-based detector is fully convolutional with almost all computation shared on the entire image. To achieve this goal, we propose position-sensitive score maps to address a dilemma between translation-invariance in image classification and translation-variance in object detection. Our method can thus naturally adopt fully convolutional image classifier backbones, such as the latest Residual Networks (ResNets), for object detection. We show competitive results on the PASCAL VOC datasets (e.g., 83.6% mAP on the 2007 set) with the 101-layer ResNet. Meanwhile, our result is achieved at a test-time speed of 170ms per image, 2.5-20x faster than the Faster R-CNN counterpart. Code is made publicly available at: this https URL
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning
Xiaopeng Yan,Ziliang Chen,Anni Xu,Xiaoxi Wang,Xiaodan Liang,Liang Lin +5 more
- 28 Sep 2019
TL;DR: Meta R-CNN as discussed by the authors proposes meta-learning over RoI (Region-of-Interest) features instead of a full image feature, which disentangles multi-object information merged with the background, without bells and whistles.
Style Aggregated Network for Facial Landmark Detection
Xuanyi Dong,Yan Yan,Wanli Ouyang,Yi Yang +3 more
- 18 Jun 2018
TL;DR: Zhang et al. as discussed by the authors proposed a style-aggregated approach to deal with the large intrinsic variance of image styles for facial landmark detection, where the original face images accompanying with styleaggaggaggated ones play a duet to train a landmark detector which is complementary to each other.
Adaptive NMS: Refining Pedestrian Detection in a Crowd
Songtao Liu,Di Huang,Yunhong Wang +2 more
- 07 Apr 2019
TL;DR: This paper proposes adaptive-NMS, which applies a dynamic suppression threshold to an instance, according to the target density, and designs an efficient subnetwork to learn density scores, which can be conveniently embedded into both the single-stage and two-stage detectors.
Grounded Language-Image Pre-training
01 Jun 2022
TL;DR: GLIP as mentioned in this paper unifies object detection and phrase grounding for pre-training, which can leverage massive image-text pairs by generating grounding boxes in a self-training fashion, making the learned representations semantic-rich.
405
AugFPN: Improving Multi-Scale Feature Learning for Object Detection
Chaoxu Guo,Bin Fan,Qian Zhang,Shiming Xiang,Chunhong Pan +4 more
- 14 Jun 2020
TL;DR: Guo et al. as discussed by the authors proposed a new feature pyramid architecture named AugFPN, which consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection.
References
•Proceedings Article
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs
Liang-Chieh Chen,George Papandreou,Iasonas Kokkinos,Kevin Murphy,Alan L. Yuille +4 more
- 07 May 2015
TL;DR: DeepLab as mentioned in this paper combines the responses at the final layer with a fully connected CRF to localize segment boundaries at a level of accuracy beyond previous methods, achieving 71.6% IOU accuracy in the test set.
2.4K
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Sean Bell,C. Lawrence Zitnick,Kavita Bala,Ross Girshick +3 more
- 01 Jun 2016
TL;DR: The Inside-Outside Net (ION), an object detector that exploits information both inside and outside the region of interest, provides strong evidence that context and multi-scale representations improve small object detection.
•Posted Content
Instance-sensitive Fully Convolutional Networks
TL;DR: This paper develops FCNs that are capable of proposing instance-level segment candidates that do not have any high-dimensional layer related to the mask resolution, but instead exploits image local coherence for estimating instances.
288
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014