Open AccessPosted Content
Ground Metric Learning
Marco Cuturi,David Avis +1 more
TL;DR: The problem of learning the ground metric is formulated as the minimization of the difference of two convex polyhedral functions over a convex set of metric matrices and it is shown that this approach is useful both for retrieval and binary/multiclass classification tasks.
read more
Abstract: Transportation distances have been used for more than a decade now in machine learning to compare histograms of features. They have one parameter: the ground metric, which can be any metric between the features themselves. As is the case for all parameterized distances, transportation distances can only prove useful in practice when this parameter is carefully chosen. To date, the only option available to practitioners to set the ground metric parameter was to rely on a priori knowledge of the features, which limited considerably the scope of application of transportation distances. We propose to lift this limitation and consider instead algorithms that can learn the ground metric using only a training set of labeled histograms. We call this approach ground metric learning. We formulate the problem of learning the ground metric as the minimization of the difference of two polyhedral convex functions over a convex set of distance matrices. We follow the presentation of our algorithms with promising experimental results on binary classification tasks using GIST descriptors of images taken in the Caltech-256 set.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Computational Optimal Transport
Gabriel Peyré,Marco Cuturi +1 more
TL;DR: This short book reviews OT with a bias toward numerical methods and their applications in data sciences, and sheds lights on the theoretical properties of OT that make it particularly useful for some of these applications.
1.9K
Optimal Transport for Domain Adaptation
TL;DR: A regularized unsupervised optimal transportation model to perform the alignment of the representations in the source and target domains, that consistently outperforms state of the art approaches and can be easily adapted to the semi-supervised case where few labeled samples are available in the target domain.
1K
•Posted Content
A Survey on Metric Learning for Feature Vectors and Structured Data
TL;DR: A systematic review of the metric learning literature is proposed, highlighting the pros and cons of each approach and presenting a wide range of methods that have recently emerged as powerful alternatives, including nonlinear metric learning, similarity learning and local metric learning.
768
•Proceedings Article
Fast Computation of Wasserstein Barycenters
Marco Cuturi,Arnaud Doucet +1 more
- 21 Jun 2014
TL;DR: Cuturi et al. as discussed by the authors proposed two original algorithms to compute Wasserstein barycenters that build upon the subgradient method, which can be used to visualize a large family of images and solve a constrained clustering problem.
•Posted Content
Fast Computation of Wasserstein Barycenters
Marco Cuturi,Arnaud Doucet +1 more
TL;DR: The Wasserstein distance is proposed to be smoothed with an entropic regularizer and recover in doing so a strictly convex objective whose gradients can be computed for a considerably cheaper computational cost using matrix scaling algorithms.
414
References
Support-Vector Networks
Corinna Cortes,Vladimir Vapnik +1 more
TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.
Latent dirichlet allocation
TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
•Proceedings Article
Latent Dirichlet Allocation
David M. Blei,Andrew Y. Ng,Michael I. Jordan +2 more
- 03 Jan 2001
TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Object recognition from local scale-invariant features
David G. Lowe
- 20 Sep 1999
TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.
•Book
Network Flows: Theory, Algorithms, and Applications
Ravindra K. Ahuja,Thomas L. Magnanti,James B. Orlin +2 more
- 01 Jan 1993
TL;DR: In-depth, self-contained treatments of shortest path, maximum flow, and minimum cost flow problems, including descriptions of polynomial-time algorithms for these core models are presented.