Open AccessDissertation
A family of algorithms for approximate bayesian inference
Tom Minka,Rosalind W. Picard +1 more
- 01 Jan 2001
1.1K
TL;DR: This thesis presents an approximation technique that can perform Bayesian inference faster and more accurately than previously possible, and is found to be convincingly better than rival approximation techniques: Monte Carlo, Laplace's method, and variational Bayes.
read more
Abstract: One of the major obstacles to using Bayesian methods for pattern recognition has been its computational expense. This thesis presents an approximation technique that can perform Bayesian inference faster and more accurately than previously possible. This method, “Expectation Propagation,” unifies and generalizes two previous techniques: assumed-density filtering, an extension of the Kalman filter, and loopy belief propagation, an extension of belief propagation in Bayesian networks. The unification shows how both of these algorithms can be viewed as approximating the true posterior distribution with simpler distribution, which is close in the sense of KL-divergence. Expectation Propagation exploits the best of both algorithms: the generality of assumed-density filtering and the accuracy of loopy belief propagation.
Loopy belief propagation, because it propagates exact belief states, is useful for limited types of belief networks, such as purely discrete networks. Expectation Propagation approximates the belief states with expectations, such as means and variances, giving it much wider scope. Expectation Propagation also extends belief propagation in the opposite direction—propagating richer belief states which incorporate correlations between variables.
This framework is demonstrated in a variety of statistical models using synthetic and real-world data. On Gaussian mixture problems, Expectation Propagation is found, for the same amount of computation, to be convincingly better than rival approximation techniques: Monte Carlo, Laplace's method, and variational Bayes. For pattern recognition, Expectation Propagation provides an algorithm for training Bayes Point Machine classifiers that is faster and more accurate than any previously known. The resulting classifiers outperform Support Vector Machines on several standard datasets, in addition to having a comparable training time. Expectation Propagation can also be used to choose an appropriate feature set for classification, via Bayesian model selection. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Bayesian Time Series Models: Expectation maximisation methods for solving (PO)MDPs and optimal control problems
Marc Toussaint,Amos Storkey,Stefan Harmeling +2 more
- 01 Jan 2011
TL;DR: This chapter shows that efficient probabilistic inference techniques can be used also for solving Markov Decision Processes or partial observable MDPs when formulated in terms of a structured dynamic Bayesian network (DBN).
Kernel-Based Copula Processes
Sebastian Jaimungal,Eddie K. H. Ng +1 more
- 30 Aug 2009
TL;DR: Kernel-based Copula Processes (KCPs) are proposed here as a unifying framework to model the interdependency across multiple time- series and the long-range dependency within an individual time-series.
Information-Theoretic Transfer Learning Framework for Bayesian Optimisation
Anil Ramachandran,Sunil Gupta,Santu Rana,Svetha Venkatesh +3 more
- 10 Sep 2018
TL;DR: A robust transfer learning based approach that transfer knowledge of the optima using a consistent probabilistic framework and offers desirable “no bias” transfer learning in the limit is proposed.
19
Doubly Iterative Turbo Equalization: Optimization through Deep Unfolding
Serdar Sahin,Charly Poulliat,Antonio Maria Cipriano,Marie-Laure Boucheret +3 more
- 08 Sep 2019
TL;DR: A novel, mutual-information dependent learning cost function is proposed, suited to turbo detectors, and through learning, the detection performance of the deep EP network is optimized.
A Variational Bayesian Inference-Inspired Unrolled Deep Network for MIMO Detection
TL;DR: In this paper , a model-driven DL detector based on variational Bayesian inference is proposed for multi-input multi-output (MIMO) systems, which circumvents matrix inversion via maximizing a relaxed evidence lower bound.
References
Factor graphs and the sum-product algorithm
TL;DR: A generic message-passing algorithm, the sum-product algorithm, that operates in a factor graph, that computes-either exactly or approximately-various marginal functions derived from the global function.
Sampling-Based Approaches to Calculating Marginal Densities
TL;DR: In this paper, three sampling-based approaches, namely stochastic substitution, the Gibbs sampler, and the sampling-importance-resampling algorithm, are compared and contrasted in relation to various joint probability structures frequently encountered in applications.
•Journal Article
Sampling-based approaches to calculating marginal densities
TL;DR: Stochastic substitution, the Gibbs sampler, and the sampling-importance-resampling algorithm can be viewed as three alternative sampling- (or Monte Carlo-) based approaches to the calculation of numerical estimates of marginal probability distributions.
6.6K
An introduction to variational methods for graphical models
TL;DR: This paper presents a tutorial introduction to the use of variational methods for inference and learning in graphical models (Bayesian networks and Markov random fields), and describes a general framework for generating variational transformations based on convex duality.