In defense of Nearest-Neighbor based image classification

doi:10.1109/CVPR.2008.4587598

Proceedings Article10.1109/CVPR.2008.4587598

In defense of Nearest-Neighbor based image classification

Oren Boiman, +2 more

- 23 Jun 2008

- pp 1-8

1.3K

TL;DR: It is argued that two practices commonly used in image classification methods, have led to the inferior performance of NN-based image classifiers: Quantization of local image descriptors (used to generate "bags-of-words ", codebooks) and Computation of 'image-to-image' distance, instead of ' image- to-class' distance.

Abstract: State-of-the-art image classification methods require an intensive learning/training stage (using SVM, Boosting, etc.) In contrast, non-parametric nearest-neighbor (NN) based image classifiers require no training time and have other favorable properties. However, the large performance gap between these two families of approaches rendered NN-based image classifiers useless. We claim that the effectiveness of non-parametric NN-based image classification has been considerably undervalued. We argue that two practices commonly used in image classification methods, have led to the inferior performance of NN-based image classifiers: (i) Quantization of local image descriptors (used to generate "bags-of-words ", codebooks). (ii) Computation of 'image-to-image' distance, instead of 'image-to-class' distance. We propose a trivial NN-based classifier - NBNN, (Naive-Bayes nearest-neighbor), which employs NN- distances in the space of the local image descriptors (and not in the space of images). NBNN computes direct 'image- to-class' distances without descriptor quantization. We further show that under the Naive-Bayes assumption, the theoretically optimal image classifier can be accurately approximated by NBNN. Although NBNN is extremely simple, efficient, and requires no learning/training phase, its performance ranks among the top leading learning-based image classifiers. Empirical comparisons are shown on several challenging databases (Caltech-101 ,Caltech-256 and Graz-01).

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

•Book

Computer Vision: Algorithms and Applications

Richard Szeliski

- 30 Sep 2010

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

5.4K

Proceedings Article•10.1109/CVPR.2010.5540018

Locality-constrained Linear Coding for image classification

Jinjun Wang, +5 more

- 13 Jun 2010

TL;DR: This paper presents a simple but effective coding scheme called Locality-constrained Linear Coding (LLC) in place of the VQ coding in traditional SPM, using the locality constraints to project each descriptor into its local-coordinate system, and the projected coordinates are integrated by max pooling to generate the final representation.

...read moreread less

3.7K

Book Chapter•10.1007/978-3-642-15561-1_16

Adapting visual category models to new domains

Kate Saenko, +3 more

- 05 Sep 2010

TL;DR: This paper introduces a method that adapts object models acquired in a particular visual domain to new imaging conditions by learning a transformation that minimizes the effect of domain-induced changes in the feature distribution.

...read moreread less

3.5K

Proceedings Article•10.1109/CVPR.2009.5206757

Linear spatial pyramid matching using sparse coding for image classification

Jianchao Yang, +3 more

- 20 Jun 2009

TL;DR: An extension of the SPM method is developed, by generalizing vector quantization to sparse coding followed by multi-scale spatial max pooling, and a linear SPM kernel based on SIFT sparse codes is proposed, leading to state-of-the-art performance on several benchmarks by using a single type of descriptors.

...read moreread less

3.4K

...

Expand

References

Proceedings Article•10.1109/ICCV.2007.4409065

Support Kernel Machines for Object Recognition

A. Kumar, +1 more

- 26 Dec 2007

TL;DR: Recent kernel learning techniques are exploited that show how learning SKMs can be formulated as a convex optimization problem, which can be solved efficiently using Sequential Minimal Optimization.

...read moreread less

86

Journal Article•10.1016/J.IMAVIS.2004.03.012

Unifying statistical texture classification frameworks

Manik Varma, +1 more

- 01 Dec 2004

- Image and Vision Computing

TL;DR: There is a correspondence between the two common representations of filter outputs—textons and binned histograms and it is shown that two classification methodologies, nearest neighbour matching and Bayesian classification, are equivalent for particular choices of the distance measure.

...read moreread less

79

Journal Article•10.1137/S0097539799366340

Expected-Case Complexity of Approximate Nearest Neighbor Searching

Sunil Arya, +1 more

- 01 Mar 2003

- SIAM Journal on Computing

TL;DR: It is shown that with a simple partition tree, called the sliding-midpoint tree, it is possible to achieve linear space and logarithmic query time in the expected case; in contrast, the data structures known to achievelinear space and Logarithic queryTime in the worst case are complex, and algorithms on them run more slowly in practice.

...read moreread less

31

•Proceedings Article•10.5555/338219.338583

Expected-case complexity of approximate nearest neighbor searching

Sunil Arya, +1 more

- 01 Feb 2000

TL;DR: In this paper, a simple partition tree, called the sliding-midpoint tree, is proposed to achieve linear space and logarithmic query time in the expected case, which is the best known algorithm for the approximate nearest neighbor problem.

...read moreread less

18

•Proceedings Article•10.1109/CVPR.2004.312

Class-Based Matching of Object Parts

Evgeniy Bart, +1 more

- 27 Jun 2004

TL;DR: A novel technique for class-based matching of object parts across large changes in viewing conditions based on using the equivalence of corresponding features in different viewing conditions is developed, not restricted to planar components or affine transformations.

...read moreread less

14