Open Access
High-Dimensional Pattern Recognition using Low-Dimensional Embedding and Earth Mover's Distance
Linh Lieu,Naoki Saito +1 more
- 01 Jan 2009
TL;DR: An algorithm is proposed that combines existing techniques in a novel way to do classification of datasets consisting of high-dimension data (e.g., sets of signals or images) and sets up a framework for application of the Earth Mover’s Distance as a discriminant between datasets.
read more
Abstract: We propose an algorithm that combines existing techniques in a novel way to do classification of datasets consisting of high-dimension al data (e.g., sets of signals or images). Furthermore, our algorithm sets up a framework for application of the Earth Mover’s Distance (EMD) [1, 2] as a discriminant m easure between datasets. We show how to prepare a compact representation ‐ a signature ‐ for each dataset so that computation of EMD between datasets can be done efficiently. This signature-construction step requires the tasks of dim ension reduction, automatic determination of the data’s intrinsic dimensionalit y, out-of-sample extension, and point clustering. We will show how to apply some existing methods (which include Laplacian eigenmaps [3, 4, 5], diffusion maps framework [6, 7, 8], and elongated K-means [9]) to perform these tasks successfully. We will also provide two examples of applications of our proposed algorithm.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
Nonlinear dimension reduction via local tangent space alignment
Zhenyue Zhang,Hongyuan Zha +1 more
TL;DR: In this article, the local geometry of the manifold is represented by tangent spaces learned by fitting an affine subspace in a neighborhood of each data point, aligned to give the internal global coordinates of the data points with respect to the underlying manifold by way of a partial eigendecomposition of the neighborhood connection matrix.
21
References
•Book
Graph theory
Frank Harary
- 01 Jan 1969
TL;DR: This project focuses on Tutte’s work in cryptography, which enabled the British to read high-level German army messages and has been described as one of the greatest intellectual feats of the war.
18K
Nonlinear dimensionality reduction by locally linear embedding.
Sam T. Roweis,Lawrence K. Saul +1 more
TL;DR: Locally linear embedding (LLE) is introduced, an unsupervised learning algorithm that computes low-dimensional, neighborhood-preserving embeddings of high-dimensional inputs that learns the global structure of nonlinear manifolds.
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
- 17 Jun 1997
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
A tutorial on spectral clustering
TL;DR: In this article, the authors present the most common spectral clustering algorithms, and derive those algorithms from scratch by several different approaches, and discuss the advantages and disadvantages of these algorithms.