Data-based Suboptimal Neuro-control Design with Reinforcement Learning for Dissipative Spatially Distributed Processes

doi:10.1021/IE4031743

Journal Article10.1021/IE4031743

Data-based Suboptimal Neuro-control Design with Reinforcement Learning for Dissipative Spatially Distributed Processes

Biao Luo, +4 more

- 01 May 2014

- Industrial & Engineering Chemistry Resea...

- Vol. 53, Iss: 19, pp 8106-8119

32

TL;DR: This paper considers the partially unknown spatially distributed processes (SDPs) which are described by general highly dissipative nonlinear partial differential equations (PDEs) and develops a data-based adaptive suboptimal neuro-control method by introducing the thought of reinforcement learning (RL).

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/TCYB.2014.2319577

Off-Policy Reinforcement Learning for $ H_\infty $ Control Design

Biao Luo, +2 more

- 01 Jan 2015

- IEEE Transactions on Systems, Man, and C...

TL;DR: An off-policy reinforcement leaning (RL) method is introduced to learn the solution of HJI equation from real system data instead of mathematical system model, and its convergence is proved.

...read moreread less

339

Journal Article•10.1109/TNNLS.2016.2585520

Model-Free Optimal Tracking Control via Critic-Only Q-Learning

Biao Luo, +3 more

- 12 Jul 2016

- IEEE Transactions on Neural Networks

TL;DR: This paper aims to solve the model-free optimal tracking control problem of nonaffine nonlinear discrete-time systems with a critic-only Q-learning (CoQL) method, which avoids solving the tracking Hamilton-Jacobi-Bellman equation.

...read moreread less

331

Journal Article•10.1016/J.AUTOMATICA.2014.10.056

Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design

Biao Luo, +3 more

- 01 Dec 2014

- Automatica

TL;DR: This paper addresses the model-free nonlinear optimal control problem based on data by introducing the reinforcement learning (RL) technique by using a data-based approximate policy iteration (API) method by using real system data rather than a system model.

...read moreread less

282

Journal Article•10.1109/TCYB.2016.2623859

Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control

Biao Luo, +4 more

- 01 Oct 2017

- IEEE Transactions on Systems, Man, and C...

TL;DR: The model-free optimal control problem of general discrete-time nonlinear systems is considered, and a data-based policy gradient adaptive dynamic programming (PGADP) algorithm is developed to design an adaptive optimal controller method.

...read moreread less

206

Journal Article•10.1109/TCYB.2018.2821369

Adaptive $Q$ -Learning for Data-Based Optimal Output Regulation With Experience Replay

Biao Luo, +2 more

- 27 Apr 2018

- IEEE Transactions on Systems, Man, and C...

TL;DR: The experience replay technique is employed in the learning process, which leads to simple and convenient implementation of the adaptive QL method, and the effectiveness of the developed adaptiveQL method is verified through numerical simulations.

...read moreread less

153

...

Expand

References

•Journal Article•10.1613/JAIR.301

Reinforcement learning: a survey

Leslie Pack Kaelbling, +2 more

- 01 Jan 1996

- Journal of Artificial Intelligence Resea...

TL;DR: Central issues of reinforcement learning are discussed, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

9K

•Posted Content

Reinforcement Learning: A Survey

Leslie Pack Kaelbling, +2 more

- 01 May 1996

- arXiv: Artificial Intelligence

TL;DR: A survey of reinforcement learning from a computer science perspective can be found in this article, where the authors discuss the central issues of RL, including trading off exploration and exploitation, establishing the foundations of RL via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

5.9K

•Proceedings Article•10.1109/ACC.1984.4171550

Robust adaptive control

Petros Ioannou, +1 more

- 15 Oct 1995

TL;DR: In this article, the authors present a model for dynamic control systems based on Adaptive Control System Design Steps (ACDS) with Adaptive Observers and Parameter Identifiers.

...read moreread less

5.9K

•Journal Article•10.1023/A:1022676722315

Technical Note Q-Learning

Chris Watkins, +1 more

- 01 May 1992

- Machine Learning

TL;DR: In this article, it is shown that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action values are represented discretely.

...read moreread less

3.8K

•Book

An introduction to infinite-dimensional linear systems theory

Ruth F. Curtain, +1 more

- 23 Jun 1995

TL;DR: This book presents Semigroup Theory, a treatment of systems theory concepts in finite dimensions with a focus on Hankel Operators and the Nehari Problem.

...read moreread less

3.5K