Dynamic programming in constrained Markov decision processes

Open AccessJournal Article

Dynamic programming in constrained Markov decision processes

Alexey Piunovskiy

- 01 Jan 2006

- Control and Cybernetics

- Vol. 35, Iss: 3, pp 645-660

30

TL;DR: It is shown that the problem can be reformulated as a standard MDP and solved using the Dynamic Programming approach, and an example on arolled queue is presented.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/ACC.2013.6579868

Stochastic optimal control with dynamic, time-consistent risk constraints

Yinlam Chow, +1 more

- 17 Jun 2013

TL;DR: A dynamic programming approach to stochastic optimal control problems with dynamic, time-consistent risk constraints, which allows to compute the optimal costs by value iteration and a procedure to construct optimal policies.

...read moreread less

28

Journal Article•10.1007/S11750-011-0186-8

Convex analytic approach to constrained discounted Markov decision processes with non-constant discount factors

Yi Zhang

- 01 Jul 2013

- Top

TL;DR: In this article, the authors developed the convex analytic approach to a discounted discrete-time Markov decision process (DTMDP) in Borel state and action spaces with N constraints, and proved that every extreme point of the space of occupation measures can be generated by a deterministic stationary policy for the DTMDP.

...read moreread less

17

•Journal Article•10.1016/J.AUTOMATICA.2019.108582

Constrained discounted Markov decision processes with Borel state spaces

Eugene A. Feinberg, +2 more

- 01 Jan 2020

- Automatica

TL;DR: In this article, the authors studied discrete-time discounted constrained Markov decision processes (CMDPs) with Borel state and action spaces and provided general assumptions under which the optimization problems in CMDPs are solvable in the class of randomized stationary policies.

...read moreread less

16

•Posted Content

A Martingale Approach and Time-Consistent Sampling-based Algorithms for Risk Management in Stochastic Optimal Control

Vu Anh Huynh, +2 more

- 29 Dec 2013

- arXiv: Systems and Control

TL;DR: In this article, a martingale approach is proposed to construct time-consistent control policies for stochastic optimal control problems with risk constraints that are expressed as bounded probabilities of failure for particular initial states.

...read moreread less

14

•Posted Content

Stochastic Optimal Control With Dynamic, Time-Consistent Risk Constraints

Yinlam Chow, +1 more

- 22 Nov 2015

- arXiv: Optimization and Control

TL;DR: In this article, a dynamic programing approach to stochastic optimal control problems with dynamic, time-consistent risk constraints is presented, which allows to compute the optimal costs by value iteration.

...read moreread less

12

...

Expand

References

•Book

Constrained Markov Decision Processes

Eitan Altman

- 30 Mar 1999

TL;DR: In this paper, a unified approach for the study of constrained Markov decision processes with a countable state space and unbounded costs is presented, where a single controller has several objectives; it is desirable to design a controller that minimize one of cost objectives, subject to inequality constraints on other cost objectives.

...read moreread less

1.9K

Journal Article•10.1109/TAC.2004.826725

Dynamic programming equations for discounted constrained stochastic control

R. C. Chen, +1 more

- 18 May 2004

- IEEE Transactions on Automatic Control

TL;DR: The application of the dynamic programming approach to constrained stochastic control problems with expected value constraints is demonstrated and optimality equations are obtained for these problems.

...read moreread less

55

Journal Article•10.1016/S0167-6377(00)00039-0

Constrained Markovian decision processes: the dynamic programming approach

Alexey Piunovskiy, +1 more

- 01 Oct 2000

- Operations Research Letters

TL;DR: The main result is the constructive development of optimal strategy with the help of the dynamic programming method in semicontinuous controlled Markov models in discrete time with total expected losses.

...read moreread less

46

•Journal Article•10.1016/0022-247X(91)90037-Z

On discounted dynamic programming with constraints

Kensuke Tanaka

- 01 Feb 1991

- Journal of Mathematical Analysis and App...

TL;DR: This paper introduces the Lagrangian programming problem corresponding to the original one, and proves the existence of an optimal solution for thelagrangian problem.

...read moreread less

22

Journal Article•10.1029/WR016I002P00271

A variance‐constrained reservoir control problem

Moshe Sniedovich

- 01 Apr 1980

- Water Resources Research

TL;DR: In this article, the variance constraint is incorporated as a penalty term in a nonseparable Lagrangian problem which is solved by a two-stage procedure, and the optimal solution to the non-separable problem is found by a simple search algorithm.

...read moreread less

18