Dynamic programming and stochastic control processes

doi:10.1016/S0019-9958(58)80003-0

Open AccessJournal Article10.1016/S0019-9958(58)80003-0

Dynamic programming and stochastic control processes

Richard Bellman

- 01 Sep 1958

- Information & Computation

- Vol. 1, Iss: 3, pp 228-239

221

TL;DR: It is shown how the functional equation technique of dynamic programming may be used to obtain a new computational and analytic approach to problems of this genre.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/TAC.1959.1104847

On adaptive control processes

Richard Bellman, +1 more

- 01 Nov 1959

- Ire Transactions on Automatic Control

TL;DR: The purpose of this paper is to show how the functional equation technique of a new mathematical discipline, dynamic programming, can be used in the formulation and solution of a variety of optimization problems concerning the design of adaptive devices.

...read moreread less

343

•Book

Quality-Driven Query Answering for Integrated Information Systems

Felix Naumann

- 27 Feb 2002

TL;DR: This paper presents a meta-modelling framework that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of planning and executing quality-driven queries.

...read moreread less

306

Journal Article•10.1016/J.TRC.2013.12.005

Transit network design based on travel time reliability

Baozhen Yao, +4 more

- 01 Jun 2014

- Transportation Research Part C-emerging ...

TL;DR: In this paper, a robust transit network optimization method, in which travel time reliability on road is considered, is presented, where a robust optimization model, taking into account the stochastic travel time, is formulated to satisfy the demand of passengers and provide reliable transit service.

...read moreread less

208

Journal Article•10.1016/j.eswa.2023.120495

Reinforcement Learning Algorithms: A brief survey

G. Pillai, +1 more

- 01 May 2023

- Expert systems with applications

TL;DR: Reinforcement Learning (RL) is a machine learning technique to learn sequential decision-making in complex problems as mentioned in this paper , which can learn an optimal policy autonomously with knowledge obtained by continuous interaction with a stochastic dynamical environment.

...read moreread less

166

Journal Article•10.1109/TETCI.2017.2669104

Evolutionary Many-Objective Optimization of Hybrid Electric Vehicle Control: From General Optimization to Preference Articulation

Ran Cheng, +4 more

- 14 Feb 2017

TL;DR: A case study of solving a many-objective hybrid electric vehicle controller design problem using three state-of-the-art evolutionary algorithms, namely, a decomposition based evolutionary algorithm (MOEA/D), a non-dominated sorting based genetic algorithm (NSGA-III), and a reference vector guided evolutionary algorithms (RVEA).

...read moreread less

116

...

Expand

References

•Journal Article•10.1090/S0002-9904-1952-09620-8

Some aspects of the sequential design of experiments

Herbert Robbins

- 01 Sep 1952

- Bulletin of the American Mathematical So...

TL;DR: The authors proposed a theory of sequential design of experiments, in which the size and composition of the samples are not fixed in advance but are functions of the observations themselves, which is a major advance.

...read moreread less

2.5K

•Journal Article•10.1090/QAM/78516

On the “bang-bang” control problem

Richard Bellman, +2 more

- 01 Jan 1956

- Quarterly of Applied Mathematics

TL;DR: In this paper, the authors considered the case where all the solutions of Z = Az approach zero as t approaches infinity, and the problem of choosing f so as to reduce z to 0 in minimum time.

...read moreread less

322

•Journal Article•10.1214/AOMS/1177731234

The Elementary Gaussian Processes

J. L. Doob

- 01 Sep 1944

- Annals of Mathematical Statistics

213

Journal Article•10.1007/BF02849266

On some variational problems occurring in the theory of dynamic programming

Richard Bellman, +2 more

- 01 Sep 1954

- Rendiconti Del Circolo Matematico Di Pal...

TL;DR: In this article, the authors investigated a class of interesting and important variational problems involving the control of a physical system over a time interval, including maintenance of a dynamic system in or near a specified state at minimum cost and maximising the output of a system given a limited quantity of resources.

...read moreread less

23

On communication processes involving learning and random duration.

Richard Ernest Bellman, +1 more

- 01 Jan 1958

TL;DR: The fundamental problem of determining the utility of a communication channel in conveying information is viewed as a problem within the framework of multistage decision processes of stochastic type, and as such is treated by the theory of dynamic programming.

...read moreread less

22

Dynamic programming and stochastic control processes

Chat with Paper

AI Agents for this Paper

Citations

On adaptive control processes

Quality-Driven Query Answering for Integrated Information Systems

Transit network design based on travel time reliability

Reinforcement Learning Algorithms: A brief survey

Evolutionary Many-Objective Optimization of Hybrid Electric Vehicle Control: From General Optimization to Preference Articulation

References

Some aspects of the sequential design of experiments

On the “bang-bang” control problem

The Elementary Gaussian Processes

On some variational problems occurring in the theory of dynamic programming

On communication processes involving learning and random duration.

Related Papers (5)

Controlled Markov processes and viscosity solutions

Reinforcement Learning: An Introduction

Dynamic Programming and Optimal Control

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Human-level control through deep reinforcement learning