Journal Article10.1109/TNNLS.2020.2965208
Optimal Elevator Group Control via Deep Asynchronous Actor–Critic Learning
69
TL;DR: The optimal control law of EGCSs is designed via a new deep RL method, such that the elevator system sends passengers to the desired destination floors as soon as possible, and the average waiting time in a complex building environment is reduced.
read more
Abstract: In this article, a new deep reinforcement learning (RL) method, called asynchronous advantage actor–critic (A3C) method, is developed to solve the optimal control problem of elevator group control systems (EGCSs). The main contribution of this article is that the optimal control law of EGCSs is designed via a new deep RL method, such that the elevator system sends passengers to the desired destination floors as soon as possible. Deep convolutional and recurrent neural networks, which can update themselves during applications, are designed to dispatch elevators. Then, the structure of the A3C method is developed, and the training phase for the learning optimal law is discussed. Finally, simulation results illustrate that the developed method effectively reduces the average waiting time in a complex building environment. Comparisons with traditional algorithms further verify the effectiveness of the developed method.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Adaptive Dynamic Programming for Control: A Survey and Recent Advances
TL;DR: In this article, the adaptive dynamic programming (ADP) with applications in control is reviewed, and the use of ADP to solve game problems, mainly nonzero-sum game problems is elaborated.
500
Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle.
Ning Wang,Ying Gao,Xuefeng Zhang +2 more
TL;DR: In this paper, a data-driven performance-prescribed reinforcement learning control (DPRLC) scheme is created to pursue control optimality and prescribed tracking accuracy simultaneously, by devising state transformation with prescribed performance, constrained tracking errors are substantially converted into constraint free stabilization of tracking errors with unknown dynamics.
186
Parallel control for optimal tracking via adaptive dynamic programming
TL;DR: It is proven that the optimal parallel control with the augmented performance index function can be seen as the suboptimal state Feedback Control with the traditional performance indexfunction.
127
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems
TL;DR: A novel distributed policy iteration algorithm is established for infinite horizon optimal control problems of continuous-time nonlinear systems to improve the iterative control law one by one, instead of updating all the control laws in each iteration of the traditional policy iteration algorithms.
107
Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
TL;DR: In this paper , a learning-based method that uses simulation data to learn an object manipulation task using two model-free reinforcement learning (RL) algorithms is presented, where the learning performance is compared across on-policy and off-policy algorithms: Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC).
52
References
Long short-term memory
TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
99K
Deep learning
TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.
67K
Sediment Transport and Movable Bed s
Oscar Castro-Orgaz,Willi H. Hager +1 more
- 01 Jan 2019
TL;DR: In this article, sediment is either loaded as bed-load with particles sliding, saltating, and rolling over the river bed, or as a suspended-load, where particles move with the turbulent water flow away from the bed.
43K
•Book
Reinforcement Learning: An Introduction
Richard S. Sutton,Andrew G. Barto +1 more
- 01 Jan 1988
TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Automatic differentiation in PyTorch
Adam Paszke,Sam Gross,Soumith Chintala,Gregory Chanan,Edward Z. Yang,Zachary DeVito,Zeming Lin,Alban Desmaison,Luca Antiga,Adam Lerer +9 more
- 28 Oct 2017
TL;DR: An automatic differentiation module of PyTorch is described — a library designed to enable rapid research on machine learning models that focuses on differentiation of purely imperative programs, with a focus on extensibility and low overhead.