Optimal Elevator Group Control via Deep Asynchronous Actor–Critic Learning

doi:10.1109/TNNLS.2020.2965208

Journal Article10.1109/TNNLS.2020.2965208

Optimal Elevator Group Control via Deep Asynchronous Actor–Critic Learning

Qinglai Wei, +3 more

- 13 Feb 2020

- IEEE Transactions on Neural Networks

- Vol. 31, Iss: 12, pp 5245-5256

69

TL;DR: The optimal control law of EGCSs is designed via a new deep RL method, such that the elevator system sends passengers to the desired destination floors as soon as possible, and the average waiting time in a complex building environment is reduced.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/TSMC.2020.3042876

Adaptive Dynamic Programming for Control: A Survey and Recent Advances

Derong Liu, +4 more

- 01 Jan 2021

- IEEE Transactions on Systems, Man, and C...

TL;DR: In this article, the adaptive dynamic programming (ADP) with applications in control is reviewed, and the use of ADP to solve game problems, mainly nonzero-sum game problems is elaborated.

...read moreread less

500

Journal Article•10.1109/TNNLS.2021.3056444

Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle.

Ning Wang, +2 more

- 19 Feb 2021

- IEEE Transactions on Neural Networks

TL;DR: In this paper, a data-driven performance-prescribed reinforcement learning control (DPRLC) scheme is created to pursue control optimality and prescribed tracking accuracy simultaneously, by devising state transformation with prescribed performance, constrained tracking errors are substantially converted into constraint free stabilization of tracking errors with unknown dynamics.

...read moreread less

186

Journal Article•10.1109/JAS.2020.1003426

Parallel control for optimal tracking via adaptive dynamic programming

Jingwei Lu, +2 more

- 26 Oct 2020

- IEEE/CAA Journal of Automatica Sinica

TL;DR: It is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state Feedback Control with the traditional performance indexfunction.

...read moreread less

127

Journal Article•10.1109/TCYB.2020.2979614

Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems

Qinglai Wei, +3 more

- 15 Apr 2021

- IEEE Transactions on Systems, Man, and C...

TL;DR: A novel distributed policy iteration algorithm is established for infinite horizon optimal control problems of continuous-time nonlinear systems to improve the iterative control law one by one, instead of updating all the control laws in each iteration of the traditional policy iteration algorithms.

...read moreread less

107

•Journal Article•10.1007/s10514-022-10034-z

Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning

Asad Ali Shahid, +3 more

- 09 Feb 2022

- Autonomous Robots

TL;DR: In this paper , a learning-based method that uses simulation data to learn an object manipulation task using two model-free reinforcement learning (RL) algorithms is presented, where the learning performance is compared across on-policy and off-policy algorithms: Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC).

...read moreread less

52

...

Expand

References

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

Journal Article•10.1038/NATURE14539

Deep learning

Yann LeCun, +4 more

- 28 May 2015

- Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

67K

Book Chapter•10.1007/978-3-030-13073-2_10

Sediment Transport and Movable Bed s

Oscar Castro-Orgaz, +1 more

- 01 Jan 2019

TL;DR: In this article, sediment is either loaded as bed-load with particles sliding, saltating, and rolling over the river bed, or as a suspended-load, where particles move with the turbulent water flow away from the bed.

...read moreread less

43K

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

Automatic differentiation in PyTorch

Adam Paszke, +9 more

- 28 Oct 2017

TL;DR: An automatic differentiation module of PyTorch is described — a library designed to enable rapid research on machine learning models that focuses on differentiation of purely imperative programs, with a focus on extensibility and low overhead.

...read moreread less

17.1K