An Improved Adaptive Dynamic Programming Algorithm Based on Fuzzy Extended State Observer for Dissolved Oxygen Concentration Control
TL;DR: In this article, an extended state observer (ESO) based on the Takagi-Sugeno (T-S) fuzzy model is designed to estimate the system state and total disturbance.
read more
Abstract: To solve the anti-disturbance control problem of dissolved oxygen concentration in the wastewater treatment plant (WWTP), an anti-disturbance control scheme based on reinforcement learning (RL) is proposed. An extended state observer (ESO) based on the Takagi–Sugeno (T-S) fuzzy model is first designed to estimate the the system state and total disturbance. The anti-disturbance controller compensates for the total disturbance based on the output of the observer in real time, online searches the optimal control policy using a neural-network-based adaptive dynamic programming (ADP) controller. For reducing the computational complexity and avoiding local optimal solutions, the echo state network (ESN) is used to approximate the optimal control policy and optimal value function in the ADP controller. Further analysis demonstrates the observer estimation errors for system state and total disturbance are bounded, and the weights of ESNs in the ADP controller are convergent. Finally, the effectiveness of the proposed ESO-based ADP control scheme is evaluated on a benchmark simulation model of the WWTP.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Reinforcement learning applied to wastewater treatment process control optimization: Approaches, challenges, and path forward
TL;DR: A review of existing RL applications for wastewater treatment control optimization can be found in this paper , where the authors evaluate five key challenges that must be addressed prior to widespread adoption: practical RL implementation, managing data, integrating existing process models, building trust in empirical control strategies, and bridging gaps in professional training.
15
Robust Model Free Adaptive Predictive Control for Wastewater Treatment Process With Packet Dropouts
Honggui Han,Shijia Fu,Haoyuan Sun,Chenyang Wang +3 more
TL;DR: RMFAPC with PDCM effectively controls WWTP despite disturbances and packet dropouts, significantly reducing IAE and demonstrating superior robustness against disturbances compared to MFAPC.
1
References
From PID to Active Disturbance Rejection Control
TL;DR: Active disturbance rejection control is proposed, which is motivated by the ever increasing demands from industry that requires the control technology to move beyond PID, and may very well break the hold of classical PID and enter a new era of innovations.
5.9K
Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication
Herbert Jaeger,Harald Haas +1 more
TL;DR: A method for learning nonlinear systems, echo state networks (ESNs), which employ artificial recurrent neural networks in a way that has recently been proposed independently as a learning mechanism in biological brains is presented.
Reinforcement learning and adaptive dynamic programming for feedback control
Frank L. Lewis,Draguna Vrabie +1 more
TL;DR: This work describes mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming that give insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.
1.5K
Sliding-Mode Control for Systems With Mismatched Uncertainties via a Disturbance Observer
Jun Yang,Shihua Li,Xinghuo Yu +2 more
TL;DR: A DOB-based SMC method is developed in this paper to counteract the mismatched disturbance and exhibits much better control performance than the baseline SMC and the integral SMC (I-SMC) methods, such as reduced chattering and nominal performance recovery.
1.2K
Adaptive critic designs
TL;DR: In this paper, the authors discuss a variety of adaptive critic designs (ACDs) for neuro-control, which are suitable for learning in noisy, nonlinear, and nonstationary environments They have common roots as generalizations of dynamic programming for neural reinforcement learning approaches.