Proceedings Article10.1109/CVPR.2008.4587598
In defense of Nearest-Neighbor based image classification
Oren Boiman,E. Shechtman,Michal Irani +2 more
- 23 Jun 2008
- pp 1-8
TL;DR: It is argued that two practices commonly used in image classification methods, have led to the inferior performance of NN-based image classifiers: Quantization of local image descriptors (used to generate "bags-of-words ", codebooks) and Computation of 'image-to-image' distance, instead of ' image- to-class' distance.
read more
Abstract: State-of-the-art image classification methods require an intensive learning/training stage (using SVM, Boosting, etc.) In contrast, non-parametric nearest-neighbor (NN) based image classifiers require no training time and have other favorable properties. However, the large performance gap between these two families of approaches rendered NN-based image classifiers useless. We claim that the effectiveness of non-parametric NN-based image classification has been considerably undervalued. We argue that two practices commonly used in image classification methods, have led to the inferior performance of NN-based image classifiers: (i) Quantization of local image descriptors (used to generate "bags-of-words ", codebooks). (ii) Computation of 'image-to-image' distance, instead of 'image-to-class' distance. We propose a trivial NN-based classifier - NBNN, (Naive-Bayes nearest-neighbor), which employs NN- distances in the space of the local image descriptors (and not in the space of images). NBNN computes direct 'image- to-class' distances without descriptor quantization. We further show that under the Naive-Bayes assumption, the theoretically optimal image classifier can be accurately approximated by NBNN. Although NBNN is extremely simple, efficient, and requires no learning/training phase, its performance ranks among the top leading learning-based image classifiers. Empirical comparisons are shown on several challenging databases (Caltech-101 ,Caltech-256 and Graz-01).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
ImageNet: A large-scale hierarchical image database
Jia Deng,Wei Dong,Richard Socher,Li-Jia Li,Kai Li,Li Fei-Fei +5 more
- 20 Jun 2009
TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
•Book
Computer Vision: Algorithms and Applications
Richard Szeliski
- 30 Sep 2010
TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.
Locality-constrained Linear Coding for image classification
Jinjun Wang,Jianchao Yang,Kai Yu,Fengjun Lv,Thomas S. Huang,Yihong Gong +5 more
- 13 Jun 2010
TL;DR: This paper presents a simple but effective coding scheme called Locality-constrained Linear Coding (LLC) in place of the VQ coding in traditional SPM, using the locality constraints to project each descriptor into its local-coordinate system, and the projected coordinates are integrated by max pooling to generate the final representation.
3.7K
Adapting visual category models to new domains
Kate Saenko,Brian Kulis,Mario Fritz,Trevor Darrell +3 more
- 05 Sep 2010
TL;DR: This paper introduces a method that adapts object models acquired in a particular visual domain to new imaging conditions by learning a transformation that minimizes the effect of domain-induced changes in the feature distribution.
Linear spatial pyramid matching using sparse coding for image classification
Jianchao Yang,Kai Yu,Yihong Gong,Thomas S. Huang +3 more
- 20 Jun 2009
TL;DR: An extension of the SPM method is developed, by generalizing vector quantization to sparse coding followed by multi-scale spatial max pooling, and a linear SPM kernel based on SIFT sparse codes is proposed, leading to state-of-the-art performance on several benchmarks by using a single type of descriptors.
References
Support Kernel Machines for Object Recognition
A. Kumar,Cristian Sminchisescu +1 more
- 26 Dec 2007
TL;DR: Recent kernel learning techniques are exploited that show how learning SKMs can be formulated as a convex optimization problem, which can be solved efficiently using Sequential Minimal Optimization.
86
Unifying statistical texture classification frameworks
Manik Varma,Andrew Zisserman +1 more
TL;DR: There is a correspondence between the two common representations of filter outputs—textons and binned histograms and it is shown that two classification methodologies, nearest neighbour matching and Bayesian classification, are equivalent for particular choices of the distance measure.
79
Expected-Case Complexity of Approximate Nearest Neighbor Searching
Sunil Arya,Ho-Yam Addy Fu +1 more
TL;DR: It is shown that with a simple partition tree, called the sliding-midpoint tree, it is possible to achieve linear space and logarithmic query time in the expected case; in contrast, the data structures known to achievelinear space and Logarithic queryTime in the worst case are complex, and algorithms on them run more slowly in practice.
Expected-case complexity of approximate nearest neighbor searching
Sunil Arya,Ho-Yam Addy Fu +1 more
- 01 Feb 2000
TL;DR: In this paper, a simple partition tree, called the sliding-midpoint tree, is proposed to achieve linear space and logarithmic query time in the expected case, which is the best known algorithm for the approximate nearest neighbor problem.
18
Class-Based Matching of Object Parts
Evgeniy Bart,Shimon Ullman +1 more
- 27 Jun 2004
TL;DR: A novel technique for class-based matching of object parts across large changes in viewing conditions based on using the equivalence of corresponding features in different viewing conditions is developed, not restricted to planar components or affine transformations.