Open AccessProceedings Article
State Space Compression with Predictive Representations
Abdeslam Boularias,Masoumeh T. Izadi,Brahim Chaib-draa,D. C. Wilson H. C. Lane +3 more
- 01 May 2008
- pp 41-46
TL;DR: This approach aims to minimize the potential error that may be caused by missing a number of core tests and provides analysis of the error caused by this compression and presents an empirical evaluation illustrating the performance of this approach.
read more
Abstract: Current studies have demonstrated that the representational power of predictive state representations (PSRs) is at least equal to the one of partially observable Markov decision processes (POMDPs). This is while early steps in planning and generalization with PSRs suggest substantial improvements compared to POMDPs. However, lack of practical algorithms for learning these representations severely restricts their applicability. The computational inefficiency of exact PSR learning methods naturally leads to the exploration of various approximation methods that can provide a good set of core tests through less computational effort. In this paper, we address this problem in an optimization framework. In particular, our approach aims to minimize the potential error that may be caused by missing a number of core tests. We provide analysis of the error caused by this compression and present an empirical evaluation illustrating the performance of this approach.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
References
•Proceedings Article
Predictive Representations of State
Michael L. Littman,Richard S. Sutton +1 more
- 03 Jan 2001
TL;DR: This is the first specific formulation of the predictive idea that includes both stochasticity and actions (controls) and it is shown that any system has a linear predictive state representation with number of predictions no greater than the number of states in its minimal POMDP model.
Exact and approximate algorithms for partially observable markov decision processes
Leslie Pack Kaelbling,Anthony R. Cassandra +1 more
- 01 Jan 1998
TL;DR: This work looks at sequential decision making in environments where the actions have probabilistic outcomes and in which the system state is only partially observable and considers a number of approaches for deriving policies that yield sub-optimal control and empirically explore their performance on a range of problems.
461
Learning predictive state representations in dynamical systems without reset
Britton Wolfe,Michael James,Satinder Singh +2 more
- 07 Aug 2005
TL;DR: Two algorithms can learn models for systems without requiring a reset action as was needed by the previously available general PSR-model learning algorithm: a Monte Carlo algorithm and a temporal difference algorithm.
95
Learning and discovery of predictive state representations in dynamical systems with reset
Michael James,Satinder Singh +1 more
- 04 Jul 2004
TL;DR: The first discovery algorithm and a new learning algorithm for linear PSR-based models for the special class of controlled dynamical systems that have a reset operation are provided and experimental verification of these algorithms are provided.
•Proceedings Article
Online Discovery and Learning of Predictive State Representations
Peter McCracken,Michael Bowling +1 more
- 05 Dec 2005
TL;DR: This paper presents a new algorithm for discovery and learning of PSRs that uses a gradient descent approach to compute the predictions for the current state, and takes advantage of the large amount of structure inherent in a valid prediction matrix to constrain its predictions.
64
Related Papers (5)
Masoumeh T. Izadi,Doina Precup +1 more
- 28 May 2008
Michael James,Satinder Singh,Michael L. Littman +2 more
- 01 Dec 2004
Masoumeh T. Izadi,Doina Precup +1 more
- 09 Aug 2003