Structural Maxent Models

Open AccessProceedings Article

Structural Maxent Models

- 06 Jul 2015

- pp 391-399

7

TL;DR: A new class of density estimation models, Structural Maxent models, with feature functions selected from a union of possibly very complex sub-families and yet benefiting from strong learning guarantees are presented, based on a new principle supported by uniform convergence bounds and taking into consideration the complexity of the different sub- families composing the full set of features.

Abstract: We present a new class of density estimation models, Structural Maxent models, with feature functions selected from a union of possibly very complex sub-families and yet benefiting from strong learning guarantees. The design of our models is based on a new principle supported by uniform convergence bounds and taking into consideration the complexity of the different sub-families composing the full set of features. We prove new data-dependent learning bounds for our models, expressed in terms of the Rademacher complexities of these sub-families. We also prove a duality theorem, which we use to derive our Structural Maxent algorithm. We give a full description of our algorithm, including the details of its derivation, and report the results of several experiments demonstrating that its performance improves on that of existing L1-norm regularized Maxent algorithms. We further similarly define conditional Structural Maxent models for multi-class classification problems. These are conditional probability models also making use of a union of possibly complex feature subfamilies. We prove a duality theorem for these models as well, which reveals their connection with existing binary and multi-class deep boosting algorithms.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Convex Analysisの二,三の進展について

徹丸山

- 01 Feb 1977

5.9K

•Posted Content

Generalized Maximum Entropy for Supervised Classification.

Santiago Mazuelas, +2 more

- 10 Jul 2020

- arXiv: Machine Learning

TL;DR: This paper establishes a framework for supervised classification based on the generalized maximum entropy principle that leads to minimax risk classifiers (MRCs), and develops learning techniques that determine MRCs for general entropy functions and provide performance guarantees by means of convex optimization.

...read moreread less

21

Journal Article

Minimax risk classifiers with 0-1 loss

Santiago Mazuelas, +2 more

- 17 Jan 2022

- arXiv.org

TL;DR: It is shown that MRCs are strongly universally consistent using feature mappings given by characteristic kernels and can provide accurate classiﬁcation together with tight performance guarantees in practice.

...read moreread less

4

Journal Article•10.48550/arxiv.2311.14156

Variational Annealing on Graphs for Combinatorial Optimization

Sebastian Sanokowski, +3 more

- 23 Nov 2023

- arXiv.org

TL;DR: It is corroborated that an autoregressive approach which captures statistical dependencies among solution variables yields superior performance on many popular CO problems and introduces subgraph tokenization in which the configuration of a set of solution variables is represented by a single token.

...read moreread less

Journal Article•10.48550/arxiv.2309.15704

Maximum Weight Entropy

Antoine de Mathelin, +3 more

- 27 Sep 2023

- arXiv.org

TL;DR: A novel weight parameterization for the stochastic model, based on the singular value decomposition of the neural network's hidden representations, which enables a large increase of the weight entropy for a small empirical risk penalization.

...read moreread less

References

•Book

Elements of information theory

Thomas M. Cover, +1 more

- 01 Jan 1991

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

52.2K

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

•Journal Article•10.1016/J.ECOLMODEL.2005.03.026

Maximum entropy modeling of species geographic distributions

Steven J. Phillips, +3 more

- 25 Jan 2006

- Ecological Modelling

TL;DR: In this paper, the use of the maximum entropy method (Maxent) for modeling species geographic distributions with presence-only data was introduced, which is a general-purpose machine learning method with a simple and precise mathematical formulation.

...read moreread less

16.5K

Journal Article•10.1103/PHYSREV.106.620

Information Theory and Statistical Mechanics. II

E. T. Jaynes

- 15 Oct 1957

- Physical Review

TL;DR: In this article, the authors consider statistical mechanics as a form of statistical inference rather than as a physical theory, and show that the usual computational rules, starting with the determination of the partition function, are an immediate consequence of the maximum-entropy principle.

...read moreread less

14K

•Journal Article•10.1111/J.1472-4642.2010.00725.X

A statistical explanation of MaxEnt for ecologists

Jane Elith, +5 more

- 01 Jan 2011

- Diversity and Distributions

TL;DR: A new statistical explanation of MaxEnt is described, showing that the model minimizes the relative entropy between two probability densities defined in covariate space, which is likely to be a more accessible way to understand the model than previous ones that rely on machine learning concepts.

...read moreread less

5.9K

...

Expand

Structural Maxent Models

Chat with Paper

AI Agents for this Paper

Citations

Convex Analysisの二,三の進展について

Generalized Maximum Entropy for Supervised Classification.

Minimax risk classifiers with 0-1 loss

Variational Annealing on Graphs for Combinatorial Optimization

Maximum Weight Entropy

References

Elements of information theory

Statistical learning theory

Maximum entropy modeling of species geographic distributions

Information Theory and Statistical Mechanics. II

A statistical explanation of MaxEnt for ecologists

Related Papers (5)

Semi-supervised learning via generalized maximum entropy

Multiple-source adaptation theory and algorithms

Large-deviation analysis and applications Of learning tree-structured graphical models

Variational information maximization for feature selection

Learning Kolmogorov Models for Binary Random Variables