State Space Compression with Predictive Representations

Open AccessProceedings Article

State Space Compression with Predictive Representations

- 01 May 2008

- pp 41-46

TL;DR: This approach aims to minimize the potential error that may be caused by missing a number of core tests and provides analysis of the error caused by this compression and presents an empirical evaluation illustrating the performance of this approach.

Abstract: Current studies have demonstrated that the representational power of predictive state representations (PSRs) is at least equal to the one of partially observable Markov decision processes (POMDPs). This is while early steps in planning and generalization with PSRs suggest substantial improvements compared to POMDPs. However, lack of practical algorithms for learning these representations severely restricts their applicability. The computational inefficiency of exact PSR learning methods naturally leads to the exploration of various approximation methods that can provide a good set of core tests through less computational effort. In this paper, we address this problem in an optimization framework. In particular, our approach aims to minimize the potential error that may be caused by missing a number of core tests. We provide analysis of the error caused by this compression and present an empirical evaluation illustrating the performance of this approach.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

References

•Proceedings Article

Predictive Representations of State

Michael L. Littman, +1 more

- 03 Jan 2001

TL;DR: This is the first specific formulation of the predictive idea that includes both stochasticity and actions (controls) and it is shown that any system has a linear predictive state representation with number of predictions no greater than the number of states in its minimal POMDP model.

...read moreread less

600

Exact and approximate algorithms for partially observable markov decision processes

Leslie Pack Kaelbling, +1 more

- 01 Jan 1998

TL;DR: This work looks at sequential decision making in environments where the actions have probabilistic outcomes and in which the system state is only partially observable and considers a number of approaches for deriving policies that yield sub-optimal control and empirically explore their performance on a range of problems.

...read moreread less

461

Proceedings Article•10.1145/1102351.1102475

Learning predictive state representations in dynamical systems without reset

Britton Wolfe, +2 more

- 07 Aug 2005

TL;DR: Two algorithms can learn models for systems without requiring a reset action as was needed by the previously available general PSR-model learning algorithm: a Monte Carlo algorithm and a temporal difference algorithm.

...read moreread less

95

Proceedings Article•10.1145/1015330.1015359

Learning and discovery of predictive state representations in dynamical systems with reset

Michael James, +1 more

- 04 Jul 2004

TL;DR: The first discovery algorithm and a new learning algorithm for linear PSR-based models for the special class of controlled dynamical systems that have a reset operation are provided and experimental verification of these algorithms are provided.

...read moreread less

95

•Proceedings Article

Online Discovery and Learning of Predictive State Representations

Peter McCracken, +1 more

- 05 Dec 2005

TL;DR: This paper presents a new algorithm for discovery and learning of PSRs that uses a gradient descent approach to compute the predictions for the current state, and takes advantage of the large amount of structure inherent in a valid prediction matrix to constrain its predictions.

...read moreread less

64