Batch Process Control Based on Twin-actor Deep Deterministic Policy Gradient Algorithm

MA Junwei; XU Chen; TAO Hongfeng; YANG Huizhong

doi:10.13976/j.cnki.xk.2023.2488

MA Junwei, XU Chen, TAO Hongfeng, YANG Huizhong. Batch Process Control Based on Twin-actor Deep Deterministic Policy Gradient Algorithm[J]. INFORMATION AND CONTROL, 2023, 52(6): 773-783, 810. DOI: 10.13976/j.cnki.xk.2023.2488

Citation:

Batch Process Control Based on Twin-actor Deep Deterministic Policy Gradient Algorithm

Graphical Abstract

Graphical Abstract

Abstract

Abstract

We propose a batch process control scheme without a process model by combining reinforcement learning (RL) to solve the problem that conventional model-based control methods have inaccurate models because of their complex nonlinear dynamics when dealing with batch process tasks, which affects control performance. First, the method solves the problem of high estimation of the value function in deep RL algorithms by the structure of twin-actor parallel training to improve the learning efficiency of the algorithm. Second, an independent experience pool is established for each actor to maintain the independence of the twin actors. Furthermore, a novel reward function is established for the RL controller to guide the process back to the predetermined trajectory; we mitigate the temporal difference (TD) error accumulation problem in parameter updating by introducing a delayed policy update method. Finally, the effectiveness of the controller based on the twin-actor deep deterministic policy gradient algorithm for batch process control is demonstrated by simulating the penicillin fermentation process.

FullText(HTML)

References (35)

Cited By

Batch Process Control Based on Twin-actor Deep Deterministic Policy Gradient Algorithm

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content