Proceedings Article10.1109/CVPR.2012.6248364
Pose pooling kernels for sub-category recognition
Ning Zhang,Ryan Farrell,Trever Darrell +2 more
- 16 Jun 2012
- pp 3665-3672
167
TL;DR: This work develops representations for poselet-based pose normalization using both explicit warping and implicit pooling as mechanisms and defines a pose normalized similarity or kernel function that is suitable for nearest-neighbor or kernel-based learning methods.
read more
Abstract: The ability to normalize pose based on super-category landmarks can significantly improve models of individual categories when training data are limited. Previous methods have considered the use of volumetric or morphable models for faces and for certain classes of articulated objects. We consider methods which impose fewer representational assumptions on categories of interest, and exploit contemporary detection schemes which consider the ensemble of responses of detectors trained for specific posekeypoint configurations. We develop representations for poselet-based pose normalization using both explicit warping and implicit pooling as mechanisms. Our method defines a pose normalized similarity or kernel function that is suitable for nearest-neighbor or kernel-based learning methods.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
CNN Features off-the-shelf: an Astounding Baseline for Recognition
TL;DR: A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.
4.5K
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
Ali Sharif Razavian,Hossein Azizpour,Josephine Sullivan,Stefan Carlsson +3 more
- 23 Jun 2014
TL;DR: In this paper, features extracted from the OverFeat network are used as a generic image representation to tackle the diverse range of recognition tasks of object image classification, scene recognition, fine grained recognition, attribute detection and image retrieval applied to a diverse set of datasets.
3D Object Representations for Fine-Grained Categorization
Jonathan Krause,Michael Stark,Jia Deng,Li Fei-Fei +3 more
- 02 Dec 2013
TL;DR: This paper lifts two state-of-the-art 2D object representations to 3D, on the level of both local feature appearance and location, and shows their efficacy for estimating 3D geometry from images via ultra-wide baseline matching and 3D reconstruction.
Bilinear CNN Models for Fine-Grained Visual Recognition
Tsung-Yu Lin,Aruni RoyChowdhury,Subhransu Maji +2 more
- 07 Dec 2015
TL;DR: Blinear models, a recognition architecture that consists of two feature extractors whose outputs are multiplied using outer product at each location of the image and pooled to obtain an image descriptor, are proposed.
Part-Based R-CNNs for Fine-Grained Category Detection
Ning Zhang,Jeff Donahue,Ross Girshick,Trevor Darrell +3 more
- 06 Sep 2014
TL;DR: In this article, the authors propose a model for fine-grained categorization by leveraging deep convolutional features computed on bottom-up region proposals, which learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a finegrained category from a pose normalized representation.
References
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Svetlana Lazebnik,Cordelia Schmid,Jean Ponce +2 more
- 17 Jun 2006
TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
Active appearance models
Abstract: We describe a new method of matching statistical models of appearance to images. A set of model parameters control modes of shape and gray-level variation learned from a training set. We construct an efficient iterative matching algorithm by learning the relationship between perturbations in the model parameters and the induced image errors.
Active Appearance Models
Timothy F. Cootes,G.J. Edwards,Christopher J. Taylor +2 more
- 02 Jun 1998
TL;DR: A novel method of interpreting images using an Active Appearance Model (AAM), a statistical model of the shape and grey-level appearance of the object of interest which can generalise to almost any valid example.
The Caltech-UCSD Birds-200-2011 Dataset
Catherine Wah,Steve Branson,Peter Welinder,Pietro Perona,Serge Belongie +4 more
- 01 Jul 2011
TL;DR: CUB-200-2011 as mentioned in this paper is an extended version of CUB200, which roughly doubles the number of images per category and adds new part localization annotations, annotated with bounding boxes, part locations, and at-ribute labels.
Automated Flower Classification over a Large Number of Classes
M.-E. Nilsback,Andrew Zisserman +1 more
- 16 Dec 2008
TL;DR: Results show that learning the optimum kernel combination of multiple features vastly improves the performance, from 55.1% for the best single feature to 72.8% forThe combination of all features.
4.3K
Related Papers (5)
[...]
Omkar M. Parkhi,Andrea Vedaldi,Andrew Zisserman,C. V. Jawahar +3 more
- 16 Jun 2012
Navneet Dalal,Bill Triggs +1 more
- 20 Jun 2005