Adaptive Markov control processes

Open AccessBook

Adaptive Markov control processes

- 01 May 1989

411

TL;DR: In this paper, the authors present an inventory/production system for control of water reservoir management, and a semi-Markov control model for estimating the value of a water reservoir.

Abstract: 1 Controlled Markov Processes.- 1.1 Introduction.- 1.2 Stochastic Control Problems.- Control Models.- Policies.- Performance Criteria.- Control Problems.- 1.3 Examples.- An Inventory/Production System.- Control of Water Reservoirs.- Fisheries Management.- Nonstationary MCM's.- Semi-Markov Control Models.- 1.4 Further Comments.- 2 Discounted Reward Criterion.- 2.1 Introduction.- Summary.- 2.2 Optimality Conditions.- Continuity of ?*.- 2.3 Asymptotic Discount Optimality.- 2.4 Approximation of MCM's.- Nonstationary Value-Iteration.- Finite-State Approximations.- 2.5 Adaptive Control Models.- Preliminaries.- Nonstationary Value-Iteration.- The Principle of Estimation and Control.- Adaptive Policies.- 2.6 Nonparametric Adaptive Control.- The Parametric Approach.- New Setting.- The Empirical Distribution Process.- Nonparametric Adaptive Policies.- 2.7 Comments and References.- 3 Average Reward Criterion.- 3.1 Introduction.- Summary.- 3.2 The Optimality Equation.- 3.3 Ergodicity Conditions.- 3.4 Value Iteration.- Uniform Approximations.- Successive Averagings.- 3.5 Approximating Models.- 3.6 Nonstationary Value Iteration.- Nonstationary Successive Averagings.- Discounted-Like NVI.- 3.7 Adaptive Control Models.- Preliminaries.- The Principle of Estimation and Control (PEC).- Nonstationary Value Iteration (NVI).- 3.8 Comments and References.- 4 Partially Observable Control Models.- 4.1 Introduction.- Summary.- 4.2 PO-CM: Case of Known Parameters.- The PO Control Problem.- 4.3 Transformation into a CO Control Problem.- I-Policies.- The New Control Model.- 4.4 Optimal I-Policies.- 4.5 PO-CM's with Unknown Parameters.- PEC and NVI I-Policies.- 4.6 Comments and References.- 5 Parameter Estimation in MCM's.- 5.1 Introduction.- Summary.- 5.2 Contrast Functions.- 5.3 Minimum Contrast Estimators.- 5.4 Comments and References.- 6 Discretization Procedures.- 6.1 Introduction.- Summary.- 6.2 Preliminaries.- 6.3 The Non-Adaptive Case.- A Non-Recursive Procedure.- A Recursive Procedure.- 6.4 Adaptive Control Problems.- Preliminaries.- Discretization of the PEC Adaptive Policy.- Discretization of the NVI Adaptive Policy.- 6.5 Proofs.- The Non-Adaptive Case.- The Adaptive Case.- 6.6 Comments and References.- Appendix A. Contraction Operators.- Appendix B. Probability Measures.- Total Variation Norm.- Weak Convergence.- Appendix C. Stochastic Kernels.- Appendix D. Multifunctions and Measurable Selectors.- The Hausdorff Metric.- Multifunctions.- References.- Author Index.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

•Journal Article•10.1109/TSMCB.2008.2007630

Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem

Damien Ernst, +3 more

- 01 Apr 2009

TL;DR: This paper compares reinforcement learning with model predictive control in a unified framework and reports experimental results of their application to the synthesis of a controller for a nonlinear and deterministic electrical power oscillations damping problem.

...read moreread less

254

Journal Article•10.1109/9.133184

An optimal one-way multigrid algorithm for discrete-time stochastic control

C.-S. Chow, +1 more

- 01 Aug 1991

- IEEE Transactions on Automatic Control

TL;DR: It is shown that the one-way multigrid algorithm improves upon the complexity of its single-grid variant and is, in a certain sense, optimal.

TL;DR: In this article, the boundary value problems of mathematical physics can be solved by the methods of the preceding chapters by solving a variety of specific problems that illustrate the principal types of problems that were formulated in Chapter 7.

...read moreread less

1.1K

Adaptive Markov control processes

Chat with Paper

AI Agents for this Paper

Citations

Discrete-time controlled Markov processes with average cost criterion: a survey

The Capacity of Channels With Feedback

Chapter 8 Markov decision processes

Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem

An optimal one-way multigrid algorithm for discrete-time stochastic control

References

Semigroups of Linear Operators and Applications to Partial Differential Equations

Regular and Stochastic Motion

Compressible fluid flow and systems of conservation laws in several space variables

Applications of Centre Manifold Theory

Boundary Value Problems of Mathematical Physics

Related Papers (5)

Discrete-time Markov control processes

Stochastic optimal control : the discrete time case

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Discrete-time controlled Markov processes with average cost criterion: a survey

Markov Decision Processes