Curious model-building control systems

doi:10.1109/IJCNN.1991.170605

Open AccessProceedings Article10.1109/IJCNN.1991.170605

Curious model-building control systems

Jürgen Schmidhuber

- 18 Nov 1991

- pp 1458-1463

726

TL;DR: A novel curious model-building control system is described which actively tries to provoke situations for which it learned to expect to learn something about the environment, based on Watkins' Q-learning algorithm.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

•Journal Article•10.1016/J.NEUNET.2014.09.003

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015

- Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

18.7K

•Journal Article•10.1613/JAIR.301

Reinforcement learning: a survey

Leslie Pack Kaelbling, +2 more

- 01 Jan 1996

- Journal of Artificial Intelligence Resea...

TL;DR: Central issues of reinforcement learning are discussed, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

9K

•Posted Content

Reinforcement Learning: A Survey

Leslie Pack Kaelbling, +2 more

- 01 May 1996

- arXiv: Artificial Intelligence

TL;DR: A survey of reinforcement learning from a computer science perspective can be found in this article, where the authors discuss the central issues of RL, including trading off exploration and exploitation, establishing the foundations of RL via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

5.9K

•Journal Article•10.1016/J.NEUNET.2019.01.012

Continual lifelong learning with neural networks: A review.

German Ignacio Parisi, +4 more

- 01 May 2019

- Neural Networks

TL;DR: This review critically summarize the main challenges linked to lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic forgetting.

...read moreread less

3.2K

...

Expand

References

Learning from delayed rewards

Chris Watkins

- 01 Jan 1989

5.9K

Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences

Paul J. Werbos

- 01 Jan 1974

4.8K

Journal Article•10.1109/TSMC.1983.6313077

Neuronlike adaptive elements that can solve difficult learning control problems

Andrew G. Barto, +2 more

- 01 Sep 1983

TL;DR: In this article, a system consisting of two neuron-like adaptive elements can solve a difficult learning control problem, where the task is to balance a pole that is hinged to a movable cart by applying forces to the cart base.

...read moreread less

3.4K

•Book

Neuronlike adaptive elements that can solve difficult learning control problems

Andrew G. Barto, +2 more

- 03 Jan 1990

TL;DR: In this article, a system consisting of two neuron-like adaptive elements can solve a difficult learning control problem, where the task is to balance a pole that is hinged to a movable cart by applying forces to the cart base.

...read moreread less

2.1K

•Book Chapter•10.1016/B978-1-55860-141-3.50030-4

Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Richard S. Sutton

- 01 Jun 1990

TL;DR: This paper extends previous work with Dyna, a class of architectures for intelligent systems based on approximating dynamic programming methods, and presents and shows results for two Dyna architectures, based on Watkins's Q-learning, a new kind of reinforcement learning.

...read moreread less

1.8K

...

Expand

Curious model-building control systems

Chat with Paper

AI Agents for this Paper

Citations

Reinforcement Learning: An Introduction

Deep learning in neural networks

Reinforcement learning: a survey

Reinforcement Learning: A Survey

Continual lifelong learning with neural networks: A review.

References

Learning from delayed rewards

Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences

Neuronlike adaptive elements that can solve difficult learning control problems

Neuronlike adaptive elements that can solve difficult learning control problems

Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Related Papers (5)

Intrinsic Motivation Systems for Autonomous Mental Development

Reinforcement Learning: An Introduction

Intrinsically Motivated Reinforcement Learning

Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Reinforcement learning: a survey