Eigenfunction approximation methods for linearly-solvable optimal control problems

doi:10.1109/ADPRL.2009.4927540

Proceedings Article10.1109/ADPRL.2009.4927540

Eigenfunction approximation methods for linearly-solvable optimal control problems

Emanuel Todorov

- 15 May 2009

- pp 161-168

63

TL;DR: A general class of nonlinear stochastic optimal control problems which can be reduced to computing the principal eigenfunction of a linear operator is identified and function approximation methods exploiting this inherent linearity are developed.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/CDC.2008.4739438

General duality between optimal control and estimation

Emanuel Todorov

- 01 Dec 2008

TL;DR: This work obtains a more natural form of LQG duality by replacing the Kalman-Bucy filter with the information filter and generalizes this result to non-linear stochastic systems, discrete stochastics systems, and deterministic systems.

...read moreread less

392

•Proceedings Article

Inverse Optimal Control with Linearly-Solvable MDPs

Krishnamurthy Dvijotham, +1 more

- 21 Jun 2010

TL;DR: Unlike all prior IRL algorithms which assume pre-existing features, this work study feature adaptation and shows that such adaptation is essential in continuous state spaces.

...read moreread less

168

•Proceedings Article•10.15607/RSS.2011.VII.010

Infinite-Horizon Model Predictive Control for Periodic Tasks with Contacts

Tom Erez, +2 more

- 27 Jun 2011

TL;DR: This paper uses offline optimization to find the limit-cycle solution of an infinite-horizon average-cost optimal-control task, and compute a local quadratic approximation of the Value function around this limit cycle that is used as the terminal cost of an online MPC.

...read moreread less

99

•Journal Article•10.1109/TIV.2019.2904417

Merging in Congested Freeway Traffic Using Multipolicy Decision Making and Passive Actor-Critic Learning

Tomoki Nishi, +2 more

- 20 Mar 2019

TL;DR: In this article, a method for freeway merge based on multipolicy decision making coupled with a reinforcement learning technique called passive actor-critic (pAC) is presented, which learns with less knowledge of the system and without active exploration.

...read moreread less

45

•Journal Article•10.1109/TIV.2019.2904417

Freeway Merging in Congested Traffic based on Multipolicy Decision Making with Passive Actor Critic.

Tomoki Nishi, +2 more

- 14 Jul 2017

- arXiv: Artificial Intelligence

TL;DR: A method for the freeway merging based on multi-policy decision making with a reinforcement learning method called pAC, which learns with less knowledge of the system and without active exploration to achieve 92% success rate to merge into a freeway, which is comparable to human decision making.

...read moreread less

40

...

Expand

References

•Proceedings Article

Linearly-solvable Markov decision problems

Emanuel Todorov

- 04 Dec 2006

TL;DR: A class of MPDs which greatly simplify Reinforcement Learning, which have discrete state spaces and continuous control spaces and enable efficient approximations to traditional MDPs.

...read moreread less

522

Proceedings Article•10.1109/CDC.2008.4739438

General duality between optimal control and estimation

Emanuel Todorov

- 01 Dec 2008

TL;DR: This work obtains a more natural form of LQG duality by replacing the Kalman-Bucy filter with the information filter and generalizes this result to non-linear stochastic systems, discrete stochastics systems, and deterministic systems.

...read moreread less

392

Journal Article•10.1137/1036162

Numerical Methods for Stochastic Control Problems in Continuous Time (Harold J. Kushner and Paul G. Dupuis)

Tyrone E. Duncan

- 01 Dec 1994

- Siam Review

TL;DR: Numerical methods for stochastic control problems in continuous time to help people facing with some harmful virus inside their desktop computer to read a good book with a cup of coffee in the afternoon.

...read moreread less

138

Journal Article•10.1137/S0363012901393894

A Variational Approach to Nonlinear Estimation

Sanjoy K. Mitter, +1 more

- 01 May 2003

- Siam Journal on Control and Optimization

TL;DR: Regular conditional versions of the forward and inverse Bayes formula are shown to have dual variational characterizations involving the minimization of apparent information and the maximization of compatible information, according to which Bayes' formula and its inverse are optimal information processors.

...read moreread less

117

•Journal Article•10.1103/PHYSREVLETT.95.200201

Linear theory for control of nonlinear stochastic systems.

Hilbert J. Kappen

- 07 Nov 2005

- Physical Review Letters

TL;DR: The role of noise and the issue of efficient computation in stochastic optimal control problems are addressed and a class of nonlinear control problems that can be formulated as a path integral and where the noise plays the role of temperature is considered.

...read moreread less