Pose pooling kernels for sub-category recognition

doi:10.1109/CVPR.2012.6248364

Proceedings Article10.1109/CVPR.2012.6248364

Pose pooling kernels for sub-category recognition

Ning Zhang, +2 more

- 16 Jun 2012

- pp 3665-3672

167

TL;DR: This work develops representations for poselet-based pose normalization using both explicit warping and implicit pooling as mechanisms and defines a pose normalized similarity or kernel function that is suitable for nearest-neighbor or kernel-based learning methods.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

CNN Features off-the-shelf: an Astounding Baseline for Recognition

Ali Sharif Razavian, +3 more

- 23 Mar 2014

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

...read moreread less

4.5K

•Proceedings Article•10.1109/CVPRW.2014.131

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

Ali Sharif Razavian, +3 more

- 23 Jun 2014

TL;DR: In this paper, features extracted from the OverFeat network are used as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets.

...read moreread less

4.4K

Proceedings Article•10.1109/ICCVW.2013.77

3D Object Representations for Fine-Grained Categorization

Jonathan Krause, +3 more

- 02 Dec 2013

TL;DR: This paper lifts two state-of-the-art 2D object representations to 3D, on the level of both local feature appearance and location, and shows their efficacy for estimating 3D geometry from images via ultra-wide baseline matching and 3D reconstruction.

...read moreread less

4.4K

Proceedings Article•10.1109/ICCV.2015.170

Bilinear CNN Models for Fine-Grained Visual Recognition

Tsung-Yu Lin, +2 more

- 07 Dec 2015

TL;DR: Blinear models, a recognition architecture that consists of two feature extractors whose outputs are multiplied using outer product at each location of the image and pooled to obtain an image descriptor, are proposed.

...read moreread less

2.4K

•Book Chapter•10.1007/978-3-319-10590-1_54

Part-Based R-CNNs for Fine-Grained Category Detection

Ning Zhang, +3 more

- 06 Sep 2014

TL;DR: In this article, the authors propose a model for fine-grained categorization by leveraging deep convolutional features computed on bottom-up region proposals, which learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a finegrained category from a pose normalized representation.

...read moreread less

1.3K

...

Expand

References

•Proceedings Article•10.1109/CVPR.2006.68

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Svetlana Lazebnik, +2 more

- 17 Jun 2006

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.

...read moreread less

9.2K

•Journal Article•10.1109/34.927467

Active appearance models

Timothy F. Cootes, +2 more

- 01 Jun 2001

- IEEE Transactions on Pattern Analysis an...

Abstract: We describe a new method of matching statistical models of appearance to images. A set of model parameters control modes of shape and gray-level variation learned from a training set. We construct an efficient iterative matching algorithm by learning the relationship between perturbations in the model parameters and the induced image errors.

...read moreread less

6.4K

•Book Chapter•10.1007/BFB0054760

Active Appearance Models

Timothy F. Cootes, +2 more

- 02 Jun 1998

TL;DR: A novel method of interpreting images using an Active Appearance Model (AAM), a statistical model of the shape and grey-level appearance of the object of interest which can generalise to almost any valid example.

...read moreread less

6.4K

The Caltech-UCSD Birds-200-2011 Dataset

Catherine Wah, +4 more

- 01 Jul 2011

TL;DR: CUB-200-2011 as mentioned in this paper is an extended version of CUB200, which roughly doubles the number of images per category and adds new part localization annotations, annotated with bounding boxes, part locations, and at-ribute labels.

...read moreread less

5.6K

Proceedings Article•10.1109/ICVGIP.2008.47

Automated Flower Classification over a Large Number of Classes

M.-E. Nilsback, +1 more

- 16 Dec 2008

TL;DR: Results show that learning the optimum kernel combination of multiple features vastly improves the performance, from 55.1% for the best single feature to 72.8% forThe combination of all features.

...read moreread less

4.3K

...

Expand

Pose pooling kernels for sub-category recognition

Chat with Paper

AI Agents for this Paper

Citations

CNN Features off-the-shelf: an Astounding Baseline for Recognition

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

3D Object Representations for Fine-Grained Categorization

Bilinear CNN Models for Fine-Grained Visual Recognition

Part-Based R-CNNs for Fine-Grained Category Detection

References

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Active appearance models

Active Appearance Models

The Caltech-UCSD Birds-200-2011 Dataset

Automated Flower Classification over a Large Number of Classes

Related Papers (5)

The Caltech-UCSD Birds-200-2011 Dataset

Object Detection with Discriminatively Trained Part-Based Models

ImageNet Classification with Deep Convolutional Neural Networks

Cats and dogs

Histograms of oriented gradients for human detection