Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems

doi:10.1109/TNNLS.2013.2281663

Journal Article10.1109/TNNLS.2013.2281663

Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems

Derong Liu, +1 more

- 01 Mar 2014

- IEEE Transactions on Neural Networks

- Vol. 25, Iss: 3, pp 621-634

688

TL;DR: It is shown that the iterative performance index function is nonincreasingly convergent to the optimal solution of the Hamilton-Jacobi-Bellman equation and it is proven that any of the iteratives control laws can stabilize the nonlinear systems.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/TNNLS.2015.2508926

Neural Network Control-Based Adaptive Learning Design for Nonlinear Systems With Full-State Constraints

Yan-Jun Liu, +3 more

- 09 Mar 2016

- IEEE Transactions on Neural Networks

TL;DR: In order to stabilize a class of uncertain nonlinear strict-feedback systems with full-state constraints, an adaptive neural network control method is investigated and it is proved that all the signals in the closed-loop system are semiglobal uniformly ultimately bounded and the output is well driven to follow the desired output.

...read moreread less

509

Journal Article•10.1109/TSMC.2020.3042876

Adaptive Dynamic Programming for Control: A Survey and Recent Advances

Derong Liu, +4 more

- 01 Jan 2021

- IEEE Transactions on Systems, Man, and C...

TL;DR: In this article, the adaptive dynamic programming (ADP) with applications in control is reviewed, and the use of ADP to solve game problems, mainly nonzero-sum game problems is elaborated.

...read moreread less

500

Journal Article•10.1109/TCYB.2015.2492242

Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems

Qinglai Wei, +2 more

- 01 Mar 2016

- IEEE Transactions on Systems, Man, and C...

TL;DR: In this paper, for the first time, the admissibility properties of the iterative control laws are developed for value iteration algorithms and it is emphasized that new termination criteria are established to guarantee the effectiveness of the iteration control laws.

...read moreread less

443

Journal Article•10.1109/TFUZZ.2015.2418000

Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone

Yan-Jun Liu, +3 more

- 01 Feb 2016

- IEEE Transactions on Fuzzy Systems

TL;DR: An adaptive fuzzy optimal control design is addressed for a class of unknown nonlinear discrete-time systems that contain unknown functions and nonsymmetric dead-zone and can be proved based on the difference Lyapunov function method.

...read moreread less

409

Journal Article•10.1109/TIE.2016.2542134

Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method

Huaguang Zhang, +3 more

- 01 May 2017

- IEEE Transactions on Industrial Electron...

TL;DR: A data-based adaptive dynamic programming method is presented using the current and past system data rather than the accurate system models also instead of the traditional identification scheme which would cause the approximation residual errors.

...read moreread less

401

...

Expand

References

Journal Article•10.1126/science.153.3731.34

Dynamic Programming

Richard Bellman

- 21 Oct 1957

- Science

TL;DR: The study of brain processes has been spurred by the development of the digital computer. Understanding the ability of the human mind to make effective decisions in complex and uncertain situations would significantly improve the effectiveness of computers.

...read moreread less

7.3K

Learning from delayed rewards

Chris Watkins

- 01 Jan 1989

5.9K

Neuro-Dynamic Programming.

Dimitri P. Bertsekas

- 01 Jan 2009

TL;DR: In this article, the authors present the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less

4.7K

Journal Article•10.1016/0921-8890(95)00026-C

Learning from delayed rewards

Ben Kröse

- 01 Oct 1995

- Robotics and Autonomous Systems

TL;DR: The invention relates to a circuit for use in a receiver which can receive two-tone/stereo signals which is intended to make a choice between mono or stereo reproduction of signal A or of signal B and vice versa.

...read moreread less

3.9K

Book•10.1007/978-94-009-9907-7

Spacecraft attitude determination and control

James R. Wertz

- 01 Jan 1978

- Astrophysics and space science library

TL;DR: In this paper, the first comprehensive presentation of data, theory, and practice in attitude analysis is presented, including orthographic globe projections to eliminate confusion in vector drawings and a presentation of new geometrical procedures for mission analysis and attitude accuracy studies which can eliminate many complex simulations.

...read moreread less

2.6K