Distance Metric Learning for Large Margin Nearest Neighbor Classification
5.6K
TL;DR: This paper shows how to learn a Mahalanobis distance metric for kNN classification from labeled examples in a globally integrated manner and finds that metrics trained in this way lead to significant improvements in kNN Classification.
read more
Abstract: The accuracy of k-nearest neighbor (kNN) classification depends significantly on the metric used to compute distances between different examples. In this paper, we show how to learn a Mahalanobis distance metric for kNN classification from labeled examples. The Mahalanobis metric can equivalently be viewed as a global linear transformation of the input space that precedes kNN classification using Euclidean distances. In our approach, the metric is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin. As in support vector machines (SVMs), the margin criterion leads to a convex optimization based on the hinge loss. Unlike learning in SVMs, however, our approach requires no modification or extension for problems in multiway (as opposed to binary) classification. In our framework, the Mahalanobis distance metric is obtained as the solution to a semidefinite program. On several data sets of varying size and difficulty, we find that metrics trained in this way lead to significant improvements in kNN classification. Sometimes these results can be further improved by clustering the training examples and learning an individual metric within each cluster. We show how to learn and combine these local metrics in a globally integrated manner.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A survey and analysis on automatic image annotation
TL;DR: A deep review of state-of-the-art AIA methods is presented by synthesizing 138 literatures published during the past two decades by dividing AIA Methods into five categories and comparing their performance on benchmark dataset and standard evaluation metrics.
125
Fast neighborhood component analysis
TL;DR: Experimental results show that, compared with NCA, FNCA not only significantly increases the training speed but also obtains higher classification accuracy, and comparative studies with the state-of-the-art approaches on various real-world datasets also verify the effectiveness of the proposed linear and nonlinear F NCA methods.
125
Wasserstein Discriminant Analysis
TL;DR: It is shown that WDA leverages a mechanism that induces neighborhood preservation, and the optimization problem of WDA can be tackled using automatic differentiation of Sinkhorn’s fixed-point iterations.
ENN: Extended Nearest Neighbor Method for Pattern Recognition [Research Frontier]
TL;DR: In this paper, the extended nearest neighbor (ENN) method is proposed to predict input patterns according to the maximum gain of intra-class coherence, which considers not only who are the nearest neighbors of the test sample, but also who consider the test samples as their nearest neighbors.
124
A Survey of Techniques for the Identification of Mobile Phones Using the Physical Fingerprints of the Built-In Components
Gianmarco Baldini,Gary Steri +1 more
TL;DR: The potential privacy risks associated to the tracking of the mobile phone on the basis of its fingerprints and the related mitigation techniques are analyzed and the main issues and research opportunities and potential future trends are identified.
124
References
•Book
Principal Component Analysis
Ian T. Jolliffe
- 01 May 1986
TL;DR: In this article, the authors present a graphical representation of data using Principal Component Analysis (PCA) for time series and other non-independent data, as well as a generalization and adaptation of principal component analysis.
17.7K
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Nearest neighbor pattern classification
Thomas M. Cover,Peter E. Hart +1 more
TL;DR: The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points, so it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.
Eigenfaces for recognition
Matthew Turk,Alex Pentland +1 more
TL;DR: A near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals, and that is easy to implement using a neural network architecture.
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
- 17 Jun 1997
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Related Papers (5)
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
Raia Hadsell,Sumit Chopra,Yann LeCun +2 more
- 17 Jun 2006