Distance Metric Learning for Large Margin Nearest Neighbor Classification

doi:10.5555/1577069.1577078

Open AccessJournal Article10.5555/1577069.1577078

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Kilian Q. Weinberger, +1 more

- 01 Dec 2009

- Journal of Machine Learning Research

- Vol. 10, Iss: 9, pp 207-244

5.6K

TL;DR: This paper shows how to learn a Mahalanobis distance metric for kNN classification from labeled examples in a globally integrated manner and finds that metrics trained in this way lead to significant improvements in kNN Classification.

Abstract: The accuracy of k-nearest neighbor (kNN) classification depends significantly on the metric used to compute distances between different examples. In this paper, we show how to learn a Mahalanobis distance metric for kNN classification from labeled examples. The Mahalanobis metric can equivalently be viewed as a global linear transformation of the input space that precedes kNN classification using Euclidean distances. In our approach, the metric is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin. As in support vector machines (SVMs), the margin criterion leads to a convex optimization based on the hinge loss. Unlike learning in SVMs, however, our approach requires no modification or extension for problems in multiway (as opposed to binary) classification. In our framework, the Mahalanobis distance metric is obtained as the solution to a semidefinite program. On several data sets of varying size and difficulty, we find that metrics trained in this way lead to significant improvements in kNN classification. Sometimes these results can be further improved by clustering the training examples and learning an individual metric within each cluster. We show how to learn and combine these local metrics in a globally integrated manner.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1093/NAR/GKAB829

BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Hong-Liang Li, +2 more

- 28 Sep 2021

- Nucleic Acids Research

TL;DR: In this article, 155 different biological language models (BLMs) for DNA, RNA and protein sequence analysis are discussed, which are able to extract the linguistic properties of 'book of life' and extend the BLMs into a system called BioSeq-BLM for automatically representing and analyzing the sequence data.

...read moreread less

143

Journal Article•10.1016/J.PATREC.2015.09.010

A survey on representation-based classification and detection in hyperspectral remote sensing imagery

Wei Li, +1 more

- 01 Nov 2016

- Pattern Recognition Letters

TL;DR: This paper reviews the state-of-the-art representation-based classification and detection approaches for hyperspectral remote sensing imagery, including sparse representation-Based classification (SRC), collaborative representation- based classification (CRC), and their extensions.

...read moreread less

143

•Proceedings Article

Multilingual Distributed Representations without Word Alignment

Karl Moritz Hermann, +1 more

- 01 Jan 2014

TL;DR: The authors proposed a method for learning distributed representations in a multilingual setup, which learns to assign similar embeddings to aligned sentences and dissimilar ones to sentences which are not aligned while not requiring word alignments.

...read moreread less

142

•Book•10.1007/978-3-319-50478-0

Machine Learning for Health Informatics: State-of-the-Art and Future Challenges: Lecture Notes in Artificial Intelligence

Andreas Holzinger

- 12 Dec 2016

TL;DR: Successful application of ML for HI needs an integrated approach, fostering a concerted effort of four areas: (1) data science, (2) algorithms (with focus on networks and topology (structure), and entropy (time), (3) data visualization, and last but not least (4) privacy, data protection, safety & security.

...read moreread less

142

•Proceedings Article•10.1109/BTAS.2016.7791205

Triplet Probabilistic Embedding for Face Verification and Clustering

Swami Sankaranarayanan, +3 more

- 19 Apr 2016

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes an approach that couples a deep CNN-based approach with a low-dimensional discriminative embedding step, learned using triplet probability constraints to address the unconstrained face verification problem.

...read moreread less

142

...

Expand

References

•Book

Principal Component Analysis

Ian T. Jolliffe

- 01 May 1986

TL;DR: In this article, the authors present a graphical representation of data using Principal Component Analysis (PCA) for time series and other non-independent data, as well as a generalization and adaptation of principal component analysis.

...read moreread less

17.7K

•Journal Article•10.1109/34.868688

Normalized cuts and image segmentation

Jianbo Shi, +1 more

- 01 Aug 2000

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.

...read moreread less

15.6K

•Journal Article•10.1109/TIT.1967.1053964

Nearest neighbor pattern classification

Thomas M. Cover, +1 more

- 01 Jan 1967

- IEEE Transactions on Information Theory

TL;DR: The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points, so it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.

...read moreread less

15.2K

Journal Article•10.1162/JOCN.1991.3.1.71

Eigenfaces for recognition

Matthew Turk, +1 more

- 01 Jan 1991

- Journal of Cognitive Neuroscience

TL;DR: A near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals, and that is easy to implement using a neural network architecture.

...read moreread less

15.2K

•Proceedings Article•10.1109/CVPR.1997.609407

Normalized cuts and image segmentation

Jianbo Shi, +1 more

- 17 Jun 1997

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.

...read moreread less

14.7K

...

Expand

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Chat with Paper

AI Agents for this Paper

Citations

BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

A survey on representation-based classification and detection in hyperspectral remote sensing imagery

Multilingual Distributed Representations without Word Alignment

Machine Learning for Health Informatics: State-of-the-Art and Future Challenges: Lecture Notes in Artificial Intelligence

Triplet Probabilistic Embedding for Face Verification and Clustering

References

Principal Component Analysis

Normalized cuts and image segmentation

Nearest neighbor pattern classification

Eigenfaces for recognition

Normalized cuts and image segmentation

Related Papers (5)

FaceNet: A unified embedding for face recognition and clustering

Deep Residual Learning for Image Recognition

Learning a similarity metric discriminatively, with application to face verification

ImageNet Classification with Deep Convolutional Neural Networks

Dimensionality Reduction by Learning an Invariant Mapping