Open AccessProceedings Article
Structural Maxent Models
Corinna Cortes,Vitaly Kuznetsov,Mehryar Mohri,Umar Syed +3 more
- 06 Jul 2015
- pp 391-399
TL;DR: A new class of density estimation models, Structural Maxent models, with feature functions selected from a union of possibly very complex sub-families and yet benefiting from strong learning guarantees are presented, based on a new principle supported by uniform convergence bounds and taking into consideration the complexity of the different sub- families composing the full set of features.
read more
Abstract: We present a new class of density estimation models, Structural Maxent models, with feature functions selected from a union of possibly very complex sub-families and yet benefiting from strong learning guarantees. The design of our models is based on a new principle supported by uniform convergence bounds and taking into consideration the complexity of the different sub-families composing the full set of features. We prove new data-dependent learning bounds for our models, expressed in terms of the Rademacher complexities of these sub-families. We also prove a duality theorem, which we use to derive our Structural Maxent algorithm. We give a full description of our algorithm, including the details of its derivation, and report the results of several experiments demonstrating that its performance improves on that of existing L1-norm regularized Maxent algorithms. We further similarly define conditional Structural Maxent models for multi-class classification problems. These are conditional probability models also making use of a union of possibly complex feature subfamilies. We prove a duality theorem for these models as well, which reveals their connection with existing binary and multi-class deep boosting algorithms.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Generalized Maximum Entropy for Supervised Classification.
TL;DR: This paper establishes a framework for supervised classification based on the generalized maximum entropy principle that leads to minimax risk classifiers (MRCs), and develops learning techniques that determine MRCs for general entropy functions and provide performance guarantees by means of convex optimization.
Journal Article
Minimax risk classifiers with 0-1 loss
TL;DR: It is shown that MRCs are strongly universally consistent using feature mappings given by characteristic kernels and can provide accurate classification together with tight performance guarantees in practice.
Variational Annealing on Graphs for Combinatorial Optimization
Sebastian Sanokowski,Wilhelm Berghammer,Sepp Hochreiter,Sebastian Lehner +3 more
TL;DR: It is corroborated that an autoregressive approach which captures statistical dependencies among solution variables yields superior performance on many popular CO problems and introduces subgraph tokenization in which the configuration of a set of solution variables is represented by a single token.
Maximum Weight Entropy
Antoine de Mathelin,Franccois Deheeger,Mathilde Mougeot,Nicolas Vayatis +3 more
TL;DR: A novel weight parameterization for the stochastic model, based on the singular value decomposition of the neural network's hidden representations, which enables a large increase of the weight entropy for a small empirical risk penalization.
References
•Book
Elements of information theory
Thomas M. Cover,Joy A. Thomas +1 more
- 01 Jan 1991
TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
Statistical learning theory
Vladimir Vapnik
- 01 Jan 1998
TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
30.4K
Maximum entropy modeling of species geographic distributions
TL;DR: In this paper, the use of the maximum entropy method (Maxent) for modeling species geographic distributions with presence-only data was introduced, which is a general-purpose machine learning method with a simple and precise mathematical formulation.
16.5K
Information Theory and Statistical Mechanics. II
TL;DR: In this article, the authors consider statistical mechanics as a form of statistical inference rather than as a physical theory, and show that the usual computational rules, starting with the determination of the partition function, are an immediate consequence of the maximum-entropy principle.
14K
A statistical explanation of MaxEnt for ecologists
TL;DR: A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.
5.9K
Related Papers (5)
Yann LeCun,Ayse Erkan +1 more
- 01 Jan 2010
Shuyang Gao,Greg Ver Steeg,Aram Galstyan +2 more
- 05 Dec 2016