Light-Head R-CNN: In Defense of Two-Stage Object Detector.

Open AccessPosted Content

Light-Head R-CNN: In Defense of Two-Stage Object Detector.

- 20 Nov 2017

- arXiv: Computer Vision and Pattern Recog...

406

TL;DR: The authors' ResNet-101 based light-head R-CNN outperforms state-of-art object detectors on COCO while keeping time efficiency and significantly outperforming the single-stage, fast detectors like YOLO and SSD on both speed and accuracy.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book Chapter•10.1007/978-3-030-01264-9_8

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Ningning Ma, +3 more

- 08 Sep 2018

TL;DR: ShuffleNet V2 as discussed by the authors proposes to evaluate the direct metric on the target platform, beyond only considering FLOPs, based on a series of controlled experiments, and derives several practical guidelines for efficient network design.

...read moreread less

6.6K

•Proceedings Article•10.1109/CVPRW50498.2020.00203

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

Chien-Yao Wang, +5 more

- 14 Jun 2020

TL;DR: Cross Stage Partial Network (CSPNet) as discussed by the authors integrates feature maps from the beginning and the end of a network stage to mitigate the problem of duplicate gradient information within network optimization.

...read moreread less

4.2K

•Journal Article•10.1007/S11263-019-01247-4

Deep Learning for Generic Object Detection: A Survey

Li Liu, +7 more

- 01 Feb 2020

- International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

2.9K

•Posted Content

Object Detection in 20 Years: A Survey

Zhengxia Zou, +3 more

- 13 May 2019

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper extensively reviews 400+ papers of object detection in the light of its technical evolution, spanning over a quarter-century's time (from the 1990s to 2019), and makes an in-deep analysis of their challenges as well as technical improvements in recent years.

...read moreread less

1.8K

•Journal Article•10.1016/J.ISPRSJPRS.2019.11.023

Object detection in optical remote sensing images: A survey and a new benchmark

Ke Li, +4 more

- 01 Jan 2020

- Isprs Journal of Photogrammetry and Remo...

TL;DR: A comprehensive review of the recent deep learning based object detection progress in both the computer vision and earth observation communities is provided and a large-scale, publicly available benchmark for object DetectIon in Optical Remote sensing images is proposed, which is named as DIOR.

...read moreread less

1.6K

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Journal Article•10.1109/TPAMI.2016.2577031

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

64.4K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

...

Expand

Light-Head R-CNN: In Defense of Two-Stage Object Detector.

Chat with Paper

AI Agents for this Paper

Citations

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

Deep Learning for Generic Object Detection: A Survey

Object Detection in 20 Years: A Survey

Object detection in optical remote sensing images: A survey and a new benchmark

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Going deeper with convolutions

Related Papers (5)

SSD: Single Shot MultiBox Detector

Deep Residual Learning for Image Recognition

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Feature Pyramid Networks for Object Detection

Microsoft COCO: Common Objects in Context