Open AccessPosted Content
R-FCN: Object Detection via Region-based Fully Convolutional Networks
TL;DR: This work presents region-based, fully convolutional networks for accurate and efficient object detection, and proposes position-sensitive score maps to address a dilemma between translation-invariance in image classification and translation-variance in object detection.
read more
Abstract: We present region-based, fully convolutional networks for accurate and efficient object detection. In contrast to previous region-based detectors such as Fast/Faster R-CNN that apply a costly per-region subnetwork hundreds of times, our region-based detector is fully convolutional with almost all computation shared on the entire image. To achieve this goal, we propose position-sensitive score maps to address a dilemma between translation-invariance in image classification and translation-variance in object detection. Our method can thus naturally adopt fully convolutional image classifier backbones, such as the latest Residual Networks (ResNets), for object detection. We show competitive results on the PASCAL VOC datasets (e.g., 83.6% mAP on the 2007 set) with the 101-layer ResNet. Meanwhile, our result is achieved at a test-time speed of 170ms per image, 2.5-20x faster than the Faster R-CNN counterpart. Code is made publicly available at: this https URL
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Pruning Blocks for CNN Compression and Acceleration via Online Ensemble Distillation
TL;DR: This paper proposes an online ensemble distillation method to automatically prune blocks/layers of a target network by transferring the knowledge from a strong teacher in an end-to-end manner, and employs the fast iterative shrinkage-thresholding algorithm to fast and reliably remove the redundant blocks.
25
Training a deep learning architecture for vehicle detection using limited heterogeneous traffic data
Deepak Mittal,Avinash Reddy,Gitakrishnan Ramadurai,Kaushik Mitra,Balaraman Ravindran +4 more
- 01 Jan 2018
TL;DR: This work combines an existing large general (non-traffic) dataset with a small low-resolution heterogeneous traffic dataset and obtains state-of-the-art vehicle detection performance.
25
Focal loss dense detector for vehicle surveillance
Xiaoliang Wang,Peng Cheng,Xinchuan Liu,Benedict M. Uzochukwu +3 more
- 02 Apr 2018
TL;DR: In this article, focal loss based RetinaNet is utilized to match the speed of regular one-stage detectors and also defeat two-stage detector in accuracy for vehicle detection, achieving state-of-the-art performance.
25
TDIOT: Target-Driven Inference for Deep Video Object Tracking
TL;DR: TDIOT as mentioned in this paper applies an appearance similarity-based temporal matching for data association and incorporates a local search and matching module into the inference head layer that exploits SiamFC to tackle tracking discontinuities, and a scale adaptive region proposal network that enables to search for the target at an adaptively enlarged spatial neighborhood specified by the trace of the target.
25
Hands and Faces, Fast: Mono-Camera User Detection Robust Enough to Directly Control a UAV in Flight
Sepehr MohaimenianPour,Richard T. Vaughan +1 more
- 01 Oct 2018
TL;DR: This work presents a robust real-time system for simultaneous detection of hands and faces in RGB and gray-scale images, and a novel dataset used for training that gives state of the art accuracy and speed in a hand detection benchmark and competitive results in a face detection benchmark.
25
References
•Proceedings Article
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs
Liang-Chieh Chen,George Papandreou,Iasonas Kokkinos,Kevin Murphy,Alan L. Yuille +4 more
- 07 May 2015
TL;DR: DeepLab as mentioned in this paper combines the responses at the final layer with a fully connected CRF to localize segment boundaries at a level of accuracy beyond previous methods, achieving 71.6% IOU accuracy in the test set.
2.4K
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Sean Bell,C. Lawrence Zitnick,Kavita Bala,Ross Girshick +3 more
- 01 Jun 2016
TL;DR: The Inside-Outside Net (ION), an object detector that exploits information both inside and outside the region of interest, provides strong evidence that context and multi-scale representations improve small object detection.
•Posted Content
Instance-sensitive Fully Convolutional Networks
TL;DR: This paper develops FCNs that are capable of proposing instance-level segment candidates that do not have any high-dimensional layer related to the mask resolution, but instead exploits image local coherence for estimating instances.
288
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Karen Simonyan,Andrew Zisserman +1 more
- 04 Sep 2014