A planning algorithm for predictive state representations

Open AccessProceedings Article

A planning algorithm for predictive state representations

- 09 Aug 2003

- pp 1520-1521

18

TL;DR: This paper presents a policy iteration algorithm for nding policies using PSRs, and in preliminary experiments, the algorithm produced good solutions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1145/1015330.1015441

Learning low dimensional predictive representations

Matthew Rosencrantz, +2 more

- 04 Jul 2004

TL;DR: This work provides an efficient principal-components-based algorithm for learning a transformed predictive state representations (TPSRs), and shows that TPSRs can perform well in comparison to Hidden Markov Models learned with Baum-Welch in a real world robot tracking task for low dimensional representations and long prediction horizons.

...read moreread less

129

Proceedings Article•10.1109/ICMLA.2004.1383528

Planning with predictive state representations

Michael James, +2 more

- 01 Dec 2004

TL;DR: This paper develops and evaluates two general planning algorithms for PSR models that exploit the piecewise linear property of value functions for finite-horizon problems and shows how traditional reinforcement learning algorithms such as Q-learning can be extended toPSR models.

...read moreread less

50

Book Chapter•10.1007/978-3-540-68825-9_13

Point-based planning for predictive state representations

Masoumeh T. Izadi, +1 more

- 28 May 2008

TL;DR: An algorithm for approximate planning in PSRs is presented, based on an approach similar to point-based value iteration in POMDPs, which turns out to be a natural match for the PSR state representation.

...read moreread less

21

•Proceedings Article•10.1109/ICMLA.2009.36

Sensitivity Analysis of POMDP Value Functions

Stephane Ross, +3 more

- 13 Dec 2009

TL;DR: This paper addresses two types of perturbations in POMDP model parameters, namely additive and multiplicative, and provides theoretical bounds for the impact of these changes in the value function.

...read moreread less

14

•Proceedings Article

Planning in models that combine memory with predictive representations of state

Michael James, +1 more

- 09 Jul 2005

TL;DR: This paper demonstrates that the structure captured by mPSRs can be exploited quite naturally for stochastic planning based on value-iteration algorithms, and adapts the incremental-pruning (IP) algorithm defined for planning in POMDPs to mPSRS.

...read moreread less

12

...

Expand

References

•Proceedings Article

Predictive Representations of State

Michael L. Littman, +1 more

- 03 Jan 2001

TL;DR: This is the first specific formulation of the predictive idea that includes both stochasticity and actions (controls) and it is shown that any system has a linear predictive state representation with number of predictions no greater than the number of states in its minimal POMDP model.

...read moreread less

600

•Proceedings Article

Approximating optimal policies for partially observable stochastic domains

Ronald Parr, +1 more

- 20 Aug 1995

TL;DR: Smooth Partially Observable Value Approximation (SPOVA) is introduced, a new approximation method that can quickly yield good approximations which can improve over time and can be combined with reinforcement learning meth ods a combination that was very effective in test cases.

...read moreread less

•Proceedings Article

Acting Optimally in Partially Observable Stochastic Domains

Anthony R. Cassandra, +2 more

- 01 Aug 1994

TL;DR: The existing algorithms for computing optimal control strategies for partially observable stochastic environments are found to be highly computationally inefficient and a new algorithm is developed that is empirically more efficient.

...read moreread less

Algorithms for Sequential Decision Making

Michael L. Littman

- 01 Jan 1996

TL;DR: This thesis shows how to answer the question ``What should I do now?

...read moreread less

A planning algorithm for predictive state representations

Chat with Paper

AI Agents for this Paper

Citations

Learning low dimensional predictive representations

Planning with predictive state representations

Point-based planning for predictive state representations

Sensitivity Analysis of POMDP Value Functions

Planning in models that combine memory with predictive representations of state

References

Predictive Representations of State

Approximating optimal policies for partially observable stochastic domains

Acting Optimally in Partially Observable Stochastic Domains

Algorithms for Sequential Decision Making

Related Papers (5)

Predictive Representations of State

Planning with predictive state representations

Reinforcement Learning: An Introduction

Planning and Acting in Partially Observable Stochastic Domains

Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes