Fisher kernel

Topic Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1109/CVPR.2010.5540039•

Aggregating local descriptors into a compact image representation

[...]

Herve Jegou¹, Matthijs Douze¹, Cordelia Schmid¹, Patrick Pérez•Institutions (1)

French Institute for Research in Computer Science and Automation¹

13 Jun 2010

TL;DR: This work proposes a simple yet efficient way of aggregating local image descriptors into a vector of limited dimension, which can be viewed as a simplification of the Fisher kernel representation, and shows how to jointly optimize the dimension reduction and the indexing algorithm.

...read moreread less

Abstract: We address the problem of image search on a very large scale, where three constraints have to be considered jointly: the accuracy of the search, its efficiency, and the memory usage of the representation. We first propose a simple yet efficient way of aggregating local image descriptors into a vector of limited dimension, which can be viewed as a simplification of the Fisher kernel representation. We then show how to jointly optimize the dimension reduction and the indexing algorithm, so that it best preserves the quality of vector comparison. The evaluation shows that our approach significantly outperforms the state of the art: the search accuracy is comparable to the bag-of-features approach for an image representation that fits in 20 bytes. Searching a 10 million image dataset takes about 50ms.

...read moreread less

3,226 citations

Book Chapter•10.1007/978-3-642-15561-1_11•

Improving the fisher kernel for large-scale image classification

[...]

Florent Perronnin¹, Jorge Sanchez¹, Thomas Mensink¹•Institutions (1)

Xerox¹

5 Sep 2010

TL;DR: In an evaluation involving hundreds of thousands of training images, it is shown that classifiers learned on Flickr groups perform surprisingly well and that they can complement classifier learned on more carefully annotated datasets.

...read moreread less

Abstract: The Fisher kernel (FK) is a generic framework which combines the benefits of generative and discriminative approaches. In the context of image classification the FK was shown to extend the popular bag-of-visual-words (BOV) by going beyond count statistics. However, in practice, this enriched representation has not yet shown its superiority over the BOV. In the first part we show that with several well-motivated modifications over the original framework we can boost the accuracy of the FK. On PASCAL VOC 2007 we increase the Average Precision (AP) from 47.9% to 58.3%. Similarly, we demonstrate state-of-the-art accuracy on CalTech 256. A major advantage is that these results are obtained using only SIFT descriptors and costless linear classifiers. Equipped with this representation, we can now explore image classification on a larger scale. In the second part, as an application, we compare two abundant resources of labeled images to learn classifiers: ImageNet and Flickr groups. In an evaluation involving hundreds of thousands of training images we show that classifiers learned on Flickr groups perform surprisingly well (although they were not intended for this purpose) and that they can complement classifiers learned on more carefully annotated datasets.

...read moreread less

3,220 citations

Proceedings Article•10.1109/NNSP.1999.788121•

Fisher discriminant analysis with kernels

[...]

Sebastian Mika, Gunnar Rätsch¹, Jason Weston¹, Bernhard Schölkopf¹, K.R. Mullers² - Show less +1 more•Institutions (2)

Max Planck Society¹, Fraunhofer Institute for Open Communication Systems²

23 Aug 1999

TL;DR: In this article, a non-linear classification technique based on Fisher's discriminant is proposed and the main ingredient is the kernel trick which allows the efficient computation of Fisher discriminant in feature space.

...read moreread less

Abstract: A non-linear classification technique based on Fisher's discriminant is proposed. The main ingredient is the kernel trick which allows the efficient computation of Fisher discriminant in feature space. The linear classification in feature space corresponds to a (powerful) non-linear decision function in input space. Large scale simulations demonstrate the competitiveness of our approach.

...read moreread less

3,144 citations

Proceedings Article•10.1109/CVPR.2007.383266•

Fisher Kernels on Visual Vocabularies for Image Categorization

[...]

Florent Perronnin¹, Christopher R. Dance¹•Institutions (1)

Xerox¹

17 Jun 2007

TL;DR: This work shows that Fisher kernels can actually be understood as an extension of the popular bag-of-visterms, and proposes to apply this framework to image categorization where the input signals are images and where the underlying generative model is a visual vocabulary: a Gaussian mixture model which approximates the distribution of low-level features in images.

...read moreread less

Abstract: Within the field of pattern classification, the Fisher kernel is a powerful framework which combines the strengths of generative and discriminative approaches. The idea is to characterize a signal with a gradient vector derived from a generative probability model and to subsequently feed this representation to a discriminative classifier. We propose to apply this framework to image categorization where the input signals are images and where the underlying generative model is a visual vocabulary: a Gaussian mixture model which approximates the distribution of low-level features in images. We show that Fisher kernels can actually be understood as an extension of the popular bag-of-visterms. Our approach demonstrates excellent performance on two challenging databases: an in-house database of 19 object/scene categories and the recently released VOC 2006 database. It is also very practical: it has low computational needs both at training and test time and vocabularies trained on one set of categories can be applied to another set without any significant loss in performance.

...read moreread less

2,086 citations

Proceedings Article•10.1145/1273496.1273592•

Self-taught learning: transfer learning from unlabeled data

[...]

Rajat Raina¹, Alexis Battle¹, Honglak Lee¹, Benjamin Packer¹, Andrew Y. Ng¹ - Show less +1 more•Institutions (1)

Stanford University¹

20 Jun 2007

TL;DR: An approach to self-taught learning that uses sparse coding to construct higher-level features using the unlabeled data to form a succinct input representation and significantly improve classification performance.

...read moreread less

Abstract: We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabeled data follows the same class labels or generative distribution as the labeled data. Thus, we would like to use a large number of unlabeled images (or audio samples, or text documents) randomly downloaded from the Internet to improve performance on a given image (or audio, or text) classification task. Such unlabeled data is significantly easier to obtain than in typical semi-supervised or transfer learning settings, making self-taught learning widely applicable to many practical learning problems. We describe an approach to self-taught learning that uses sparse coding to construct higher-level features using the unlabeled data. These features form a succinct input representation and significantly improve classification performance. When using an SVM for classification, we further show how a Fisher kernel can be learned for this representation.

...read moreread less

1,970 citations

...

Expand

Year	Papers
2025	1
2024	6
2023	12
2022	28
2021	3
2020	16

Topic Tools

Papers published on a yearly basis

Papers

Aggregating local descriptors into a compact image representation

Improving the fisher kernel for large-scale image classification

Fisher discriminant analysis with kernels

Fisher Kernels on Visual Vocabularies for Image Categorization

Self-taught learning: transfer learning from unlabeled data

Related Topics (5)

Performance Metrics