Relocatable Action Models for Autonomous Navigation

Open Access

Relocatable Action Models for Autonomous Navigation

Bethany R. Leer, +1 more

- 01 Jan 2007

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

References

•Journal Article•10.1023/A:1022633531479

Learning to Predict by the Methods of Temporal Differences

Richard S. Sutton

- 01 Aug 1988

- Machine Learning

TL;DR: This article introduces a class of incremental learning procedures specialized for prediction – that is, for using past experience with an incompletely known system to predict its future behavior – and proves their convergence and optimality for special cases and relation to supervised-learning methods.

...read moreread less

5.2K

•Journal Article•10.3233/ICG-1995-18207

Temporal Difference Learning and TD-Gammon

Gerald Tesauro

- 01 Jan 1995

- ICGA Journal

TL;DR: TD-GAMMON is a neural network that trains itself to be an evaluation function for the game of backgammon by playing against itself and learning from the outcome.

...read moreread less

1.6K

Journal Article•10.1145/203330.203343

Temporal difference learning and TD-Gammon

Gerald Tesauro

- 01 Mar 1995

- Communications of The ACM

TL;DR: The domain of complex board games such as Go, chess, checkers, Othello, and backgammon has been widely regarded as an ideal testing ground for exploring a variety of concepts and approaches in artificial intelligence and machine learning.

...read moreread less

1.6K

•Journal Article•10.1162/153244303765208377

R-max - a general polynomial time algorithm for near-optimal reinforcement learning

Ronen I. Brafman, +1 more

- 01 Mar 2003

- Journal of Machine Learning Research

TL;DR: R-MAX is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time and formally justifies the ``optimism under uncertainty'' bias used in many RL algorithms.

...read moreread less

1.3K

•Proceedings Article

R-MAX: a general polynomial time algorithm for near-optimal reinforcement learning

Ronen I. Brafman, +1 more

- 04 Aug 2001

TL;DR: R-MAX as mentioned in this paper is a model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time, where the agent always maintains a complete, but possibly inaccurate model of its environment and acts based on the optimal policy derived from this model.

...read moreread less

1K

Relocatable Action Models for Autonomous Navigation

Chat with Paper

AI Agents for this Paper

References

Learning to Predict by the Methods of Temporal Differences

Temporal Difference Learning and TD-Gammon

Temporal difference learning and TD-Gammon

R-max - a general polynomial time algorithm for near-optimal reinforcement learning

R-MAX: a general polynomial time algorithm for near-optimal reinforcement learning

Related Papers (5)

The Design of Ultra-High Integrity Navigation System for Large Autonomous Vehicles

Navigation of an Autonomous Vehicle

Autonomous navigation - Where we are in 1984

Autonomous decision making in local navigation

Agent and model-based simulation framework for deep space navigation analysis and design