Bert Kappen
Radboud University Nijmegen
46 Papers
667 Citations
Bert Kappen is an academic researcher from Radboud University Nijmegen. The author has contributed to research in topics: Artificial neural network & Graphical model. The author has an hindex of 20, co-authored 46 publications.
Chat about Author
Papers
•Proceedings Article
On the Sample Complexity of Reinforcement Learning with a Generative Model
Mohammad Gheshlaghi Azar,Bert Kappen,R mi Munos +2 more
- 26 Jun 2012
TL;DR: This work considers the problems of learning the optimal action-value function and the optimal policy in discounted-reward Markov decision processes (MDPs) and proves new PAC bounds on the sample-complexity of two well-known model-based reinforcement learning (RL) algorithms in the presence of a generative model of the MDP.
Optimal control as a graphical model inference problem
TL;DR: In this paper, the authors reformulate a class of non-linear stochastic optimal control problems as a Kullback-Leibler (KL) minimization problem and apply approximate inference methods to efficiently compute approximate optimal controls.
•Proceedings Article
Approximate inference and constrained optimization
Tom Heskes,Kees Albers,Bert Kappen +2 more
- 07 Aug 2002
TL;DR: In this article, the authors describe a class of algorithms that solve this typically non-convex constrained minimization problem through a sequence of convex constrained minimumizations of upper bounds on the Kikuchi free energy.
Graphical model inference in optimal control of stochastic multi-agent systems
TL;DR: This article uses naive mean field approximation and belief propagation to approximate the optimal control in systems with linear dynamics, and compares the approximate inference methods with the exact solution, and shows that they can accurately compute the optimalControl.
•Proceedings Article
Risk sensitive path integral control
Bart van den Broek,Wim Wiegerinck,Bert Kappen +2 more
- 08 Jul 2010
TL;DR: In this article, the authors show that path integral methods generalize directly to risk sensitive stochastic optimal control with non-linear dynamics in continuous space-time, and demonstrate the effect of multi-modal control with risk sensitivity.