Beyond spatial pooling: Fine-grained representation learning in multiple domains

doi:10.1109/cvpr.2015.7299125

Journal Article10.1109/cvpr.2015.7299125

Beyond spatial pooling: Fine-grained representation learning in multiple domains

Chi Li, +2 more

- 01 Jun 2015

pp 4913-4922

5

TL;DR: This paper forms a probabilistic framework for analyzing the performance of pooling, and applies multiple scales of filters coupled with different pooling granularities, and makes use of color as an additional pooling domain, thereby reducing the sensitivity to spatial deformations.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1117/1.JRS.11.042609

Comprehensive survey of deep learning in remote sensing: theories, tools, and challenges for the community

John E. Ball, +2 more

- 23 Sep 2017

- Journal of Applied Remote Sensing

TL;DR: In this article, the authors provide a comprehensive survey of state-of-the-art remote sensing deep learning research for remote sensing applications, focusing on theories, tools, and challenges for the remote sensing community.

...read moreread less

705

Journal Article•10.1007/S10044-018-0736-X

Fractal dimension of bag-of-visual words

Lucas Correia Ribas, +5 more

- 01 Feb 2019

- Pattern Analysis and Applications

TL;DR: This paper proposes a new method to describe the visual words using the fractal dimension through box-counting method, and the experimental results reveal that the proposed method leads to highly discriminative features of theVisual words.

...read moreread less

7

•Posted Content

cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey

Hirokatsu Kataoka, +11 more

- 26 May 2016

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This review focused on reading the ALL 602 conference papers presented at the CVPR2015, the premier annual computer vision event held in June 2015, and proposed "DeepSurvey" as a mechanism embodying the entire process from the reading through all the papers, the generation of ideas, and to the writing of paper.

...read moreread less

cvpaper.challenge in CVPR2015 -- A review of CVPR2015

Kataoka Hirokatsu, +11 more

- 14 Dec 2015

TL;DR: This challenge aims to simultaneously read papers and create documents for easy understanding top conference papers in Japanese in the fields of computer vision, image processing, pattern recognition and machine learning.

...read moreread less

References

•Journal Article•10.1145/3065386

ImageNet classification with deep convolutional neural networks

Alex Krizhevsky, +2 more

- 24 May 2017

- Communications of The ACM

TL;DR: A large, deep convolutional neural network was trained to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes and employed a recently developed regularization method called "dropout" that proved to be very effective.

...read moreread less

98.2K

•Proceedings Article•10.1109/CVPR.2006.68

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Svetlana Lazebnik, +2 more

- 17 Jun 2006

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.

...read moreread less

9.2K

•Book Chapter•10.1007/978-3-319-10578-9_23

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He, +3 more

- 18 Jun 2014

- arXiv: Computer Vision and Pattern Recog...

TL;DR: SPP-Net as mentioned in this paper proposes a spatial pyramid pooling strategy, which can generate a fixed-length representation regardless of image size/scale, and achieves state-of-the-art performance in object detection.

...read moreread less

8.6K

•Proceedings Article

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

Pierre Sermanet, +5 more

- 23 Feb 2014

TL;DR: In this article, a multiscale and sliding window approach is proposed to predict object boundaries, which is then accumulated rather than suppressed in order to increase detection confidence, and OverFeat is the winner of the ImageNet Large Scale Visual Recognition Challenge 2013.

...read moreread less

4.8K

•Proceedings Article

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Jeff Donahue, +6 more

- 21 Jun 2014

TL;DR: DeCAF as discussed by the authors is an open-source implementation of these deep convolutional activation features, along with all associated network parameters, to enable vision researchers to conduct experimentation with deep representations across a range of visual concept learning paradigms.

...read moreread less

4.7K

...

Expand

Beyond spatial pooling: Fine-grained representation learning in multiple domains

Chat with Paper

AI Agents for this Paper

Citations

Comprehensive survey of deep learning in remote sensing: theories, tools, and challenges for the community

Fractal dimension of bag-of-visual words

cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey

cvpaper.challenge in CVPR2015 -- A review of CVPR2015

cvpaper.challenge in 2016: Futuristic Computer Vision through 1, 600 Papers Survey.

References

ImageNet classification with deep convolutional neural networks

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition