Value-Based Continuous Control Without Concrete State-Action Value Function.

doi:10.1007/978-3-030-78811-7_34

Book Chapter10.1007/978-3-030-78811-7_34

Value-Based Continuous Control Without Concrete State-Action Value Function.

- 17 Jul 2021

- pp 352-364

TL;DR: In this article, the actor-critic method is proposed to implement value-based continuous control in an effective but compromise way, where actions with higher expected return (state-action value, also as Q) will be selected as the action decision.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

References

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

•Book

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Martin L. Puterman

- 15 Apr 1994

TL;DR: Puterman as discussed by the authors provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models, focusing primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous time discrete state models.

...read moreread less

12.3K

•Journal Article•10.1038/NATURE24270

Mastering the game of Go without human knowledge

David Silver, +16 more

- 19 Oct 2017

- Nature

TL;DR: An algorithm based solely on reinforcement learning is introduced, without human data, guidance or domain knowledge beyond game rules, that achieves superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.

...read moreread less

11.1K