R-FCN: Object Detection via Region-based Fully Convolutional Networks

Open AccessPosted Content

R-FCN: Object Detection via Region-based Fully Convolutional Networks

- 20 May 2016

- arXiv: Computer Vision and Pattern Recog...

2.4K

TL;DR: This work presents region-based, fully convolutional networks for accurate and efficient object detection, and proposes position-sensitive score maps to address a dilemma between translation-invariance in image classification and translation-variance in object detection.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/ICCV.2019.00967

Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning

Xiaopeng Yan, +5 more

- 28 Sep 2019

TL;DR: Meta R-CNN as discussed by the authors proposes meta-learning over RoI (Region-of-Interest) features instead of a full image feature, which disentangles multi-object information merged with the background, without bells and whistles.

...read moreread less

436

•Proceedings Article•10.1109/CVPR.2018.00047

Style Aggregated Network for Facial Landmark Detection

Xuanyi Dong, +3 more

- 18 Jun 2018

TL;DR: Zhang et al. as discussed by the authors proposed a style-aggregated approach to deal with the large intrinsic variance of image styles for facial landmark detection, where the original face images accompanying with styleaggaggaggated ones play a duet to train a landmark detector which is complementary to each other.

...read moreread less

417

•Proceedings Article•10.1109/CVPR.2019.00662

Adaptive NMS: Refining Pedestrian Detection in a Crowd

Songtao Liu, +2 more

- 07 Apr 2019

TL;DR: This paper proposes adaptive-NMS, which applies a dynamic suppression threshold to an instance, according to the target density, and designs an efficient subnetwork to learn density scores, which can be conveniently embedded into both the single-stage and two-stage detectors.

...read moreread less

407

•Proceedings Article•10.1109/cvpr52688.2022.01069

Grounded Language-Image Pre-training

01 Jun 2022

TL;DR: GLIP as mentioned in this paper unifies object detection and phrase grounding for pre-training, which can leverage massive image-text pairs by generating grounding boxes in a self-training fashion, making the learned representations semantic-rich.

...read moreread less

405

•Proceedings Article•10.1109/CVPR42600.2020.01261

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Chaoxu Guo, +4 more

- 14 Jun 2020

TL;DR: Guo et al. as discussed by the authors proposed a new feature pyramid architecture named AugFPN, which consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection.

...read moreread less

401

...

Expand

References

•Proceedings Article

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Liang-Chieh Chen, +4 more

- 07 May 2015

TL;DR: DeepLab as mentioned in this paper combines the responses at the final layer with a fully connected CRF to localize segment boundaries at a level of accuracy beyond previous methods, achieving 71.6% IOU accuracy in the test set.

...read moreread less

2.4K

•Proceedings Article•10.1109/CVPR.2016.314

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

Sean Bell, +3 more

- 01 Jun 2016

TL;DR: The Inside-Outside Net (ION), an object detector that exploits information both inside and outside the region of interest, provides strong evidence that context and multi-scale representations improve small object detection.

...read moreread less

1.6K

•Posted Content

Instance-sensitive Fully Convolutional Networks

Jifeng Dai, +4 more

- 29 Mar 2016

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper develops FCNs that are capable of proposing instance-level segment candidates that do not have any high-dimensional layer related to the mask resolution, but instead exploits image local coherence for estimating instances.

...read moreread less

288

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Chat with Paper

AI Agents for this Paper

Citations

Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning

Style Aggregated Network for Facial Landmark Detection

Adaptive NMS: Refining Pedestrian Detection in a Crowd

Grounded Language-Image Pre-training

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

References

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

Instance-sensitive Fully Convolutional Networks

Related Papers (5)

SSD: Single Shot MultiBox Detector

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

You Only Look Once: Unified, Real-Time Object Detection

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks