Markov Decision Processes.

doi:10.2307/2348465

Journal Article10.2307/2348465

Markov Decision Processes.

Stephen Brooks, +1 more

- 01 Jan 1995

- The Statistician

- Vol. 44, Iss: 2, pp 292

132

About: This article is published in The Statistician. The article was published on 01 Jan 1995. The article focuses on the topics: Markov decision process.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Martin L. Puterman

- 15 Apr 1994

TL;DR: Puterman as discussed by the authors provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models, focusing primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous time discrete state models.

...read moreread less

12.3K

Journal Article•10.1007/S10107-010-0393-3

Risk-averse dynamic programming for Markov decision processes

Andrzej Ruszczyński

- 01 Oct 2010

- Mathematical Programming

TL;DR: The concept of a Markov risk measure is introduced and it is used to formulate risk-averse control problems for two Markov decision models: a finite horizon model and a discounted infinite horizon model.

...read moreread less

541

•Journal Article•10.1007/S11009-006-9753-0

The Cross-Entropy Method for Continuous Multi-Extremal Optimization

Dirk P. Kroese, +2 more

- 23 Oct 2006

- Methodology and Computing in Applied Pro...

TL;DR: The effectiveness of the cross-entropy method for solving difficult continuous multi-extremal optimization problems, including those with non-linear constraints, is demonstrated.

...read moreread less

292

Journal Article•10.1109/TCOMM.2013.052013.120565

Transmission Policies for Energy Harvesting Sensors with Time-Correlated Energy Supply

Nicolo Michelusi, +2 more

- 03 Jun 2013

- IEEE Transactions on Communications

TL;DR: This paper considers a wireless sensor powered by an energy harvesting device, which reports data of varying importance to its receiver, and derives the performance of the Balanced Policy (BP), which adapts the transmission probability to the harvesting state, such that energy harvesting and consumption are balanced.

...read moreread less

160

Proceedings Article•10.1109/SCC.2015.19

MDP and Machine Learning-Based Cost-Optimization of Dynamic Resource Allocation for Network Function Virtualization

Runyu Shi, +8 more

- 27 Jun 2015

TL;DR: Markov Decision Process (MDP) is applied to the NP-hard problem to dynamically allocate cloud resources for NFV components and Bayesian learning method is applications to monitor the historical resource usage in order to predict future resource reliability.

...read moreread less

91