Open AccessBook
Adaptive Markov control processes
Onésimo Hernández-Lerma
- 01 May 1989
411
TL;DR: In this paper, the authors present an inventory/production system for control of water reservoir management, and a semi-Markov control model for estimating the value of a water reservoir.
read more
Abstract: 1 Controlled Markov Processes.- 1.1 Introduction.- 1.2 Stochastic Control Problems.- Control Models.- Policies.- Performance Criteria.- Control Problems.- 1.3 Examples.- An Inventory/Production System.- Control of Water Reservoirs.- Fisheries Management.- Nonstationary MCM's.- Semi-Markov Control Models.- 1.4 Further Comments.- 2 Discounted Reward Criterion.- 2.1 Introduction.- Summary.- 2.2 Optimality Conditions.- Continuity of ?*.- 2.3 Asymptotic Discount Optimality.- 2.4 Approximation of MCM's.- Nonstationary Value-Iteration.- Finite-State Approximations.- 2.5 Adaptive Control Models.- Preliminaries.- Nonstationary Value-Iteration.- The Principle of Estimation and Control.- Adaptive Policies.- 2.6 Nonparametric Adaptive Control.- The Parametric Approach.- New Setting.- The Empirical Distribution Process.- Nonparametric Adaptive Policies.- 2.7 Comments and References.- 3 Average Reward Criterion.- 3.1 Introduction.- Summary.- 3.2 The Optimality Equation.- 3.3 Ergodicity Conditions.- 3.4 Value Iteration.- Uniform Approximations.- Successive Averagings.- 3.5 Approximating Models.- 3.6 Nonstationary Value Iteration.- Nonstationary Successive Averagings.- Discounted-Like NVI.- 3.7 Adaptive Control Models.- Preliminaries.- The Principle of Estimation and Control (PEC).- Nonstationary Value Iteration (NVI).- 3.8 Comments and References.- 4 Partially Observable Control Models.- 4.1 Introduction.- Summary.- 4.2 PO-CM: Case of Known Parameters.- The PO Control Problem.- 4.3 Transformation into a CO Control Problem.- I-Policies.- The New Control Model.- 4.4 Optimal I-Policies.- 4.5 PO-CM's with Unknown Parameters.- PEC and NVI I-Policies.- 4.6 Comments and References.- 5 Parameter Estimation in MCM's.- 5.1 Introduction.- Summary.- 5.2 Contrast Functions.- 5.3 Minimum Contrast Estimators.- 5.4 Comments and References.- 6 Discretization Procedures.- 6.1 Introduction.- Summary.- 6.2 Preliminaries.- 6.3 The Non-Adaptive Case.- A Non-Recursive Procedure.- A Recursive Procedure.- 6.4 Adaptive Control Problems.- Preliminaries.- Discretization of the PEC Adaptive Policy.- Discretization of the NVI Adaptive Policy.- 6.5 Proofs.- The Non-Adaptive Case.- The Adaptive Case.- 6.6 Comments and References.- Appendix A. Contraction Operators.- Appendix B. Probability Measures.- Total Variation Norm.- Weak Convergence.- Appendix C. Stochastic Kernels.- Appendix D. Multifunctions and Measurable Selectors.- The Hausdorff Metric.- Multifunctions.- References.- Author Index.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Discrete-time controlled Markov processes with average cost criterion: a survey
Aristotle Arapostathis,Vivek S. Borkar,E. Fernandez-Gaucherand,Mrinal K. Ghosh,Steven I. Marcus +4 more
TL;DR: A survey of the average cost control problem for discrete-time Markov processes can be found in this paper, where the authors have attempted to put together a comprehensive account of the considerable research on this problem over the past three decades.
The Capacity of Channels With Feedback
TL;DR: A general feedback channel coding theorem based on Massey's concept of directed information is proved and the average cost optimality equation (ACOE) can be viewed as an implicit single-letter characterization of the capacity.
Chapter 8 Markov decision processes
Martin L. Puterman
- 01 Jan 1990
TL;DR: In this article, the authors present theory, applications, and computational methods for Markov Decision Processes (MDPs) and provide an optimality equation that characterizes the supremal value of the objective function, characterizing the form of an optimal policy, and developing efficient computational procedures for finding policies thatare optimal or close to optimal.
277
Reinforcement Learning Versus Model Predictive Control: A Comparison on a Power System Problem
Damien Ernst,Mevludin Glavic,Florin Capitanescu,Louis Wehenkel +3 more
- 01 Apr 2009
TL;DR: This paper compares reinforcement learning with model predictive control in a unified framework and reports experimental results of their application to the synthesis of a controller for a nonlinear and deterministic electrical power oscillations damping problem.
An optimal one-way multigrid algorithm for discrete-time stochastic control
C.-S. Chow,John N. Tsitsiklis +1 more
TL;DR: It is shown that the one-way multigrid algorithm improves upon the complexity of its single-grid variant and is, in a certain sense, optimal.
209
References
•Book
Semigroups of Linear Operators and Applications to Partial Differential Equations
Amnon Pazy
- 11 Feb 1992
TL;DR: In this article, the authors considered the generation and representation of a generator of C0-Semigroups of Bounded Linear Operators and derived the following properties: 1.1 Generation and Representation.
14.2K
•Book
Regular and Stochastic Motion
Allan J. Lichtenberg,Michael A. Lieberman +1 more
- 21 Jan 2013
3.9K
•Book
Compressible fluid flow and systems of conservation laws in several space variables
Andrew J. Majda
- 01 Jan 1984
TL;DR: In this paper, the authors describe the ecoulement of chocs as compressible and stabilite, and use it to detect fluides and to compressible choc.
1.8K
•Book
Applications of Centre Manifold Theory
Jack Carr
- 28 Dec 2011
TL;DR: In this paper, the authors present an approach for solving the panel flutter problem using a Second Order Equation (SOPE) and a Semigroup Theory. But their approach is limited to the case when the case is 1 < 0 and the case where 0 < 0.
1.6K
Boundary Value Problems of Mathematical Physics
Grant B. Gustafson,Calvin H. Wilcox +1 more
- 01 Jan 1998
TL;DR: In this article, the boundary value problems of mathematical physics can be solved by the methods of the preceding chapters by solving a variety of specific problems that illustrate the principal types of problems that were formulated in Chapter 7.
1.1K
Related Papers (5)
Onésimo Hernández-Lerma,Jean B. Lasserre +1 more
- 22 Jun 1999
Dimitri P. Bertsekas,Steven E. Shreve +1 more
- 01 Feb 2007
P. Whittle,M. L. Puterman +1 more