Average Reward Reinforcement Learning Scheduling of Closed Reentrant Production Systems

LIU Chang-chun; SHEN Zhi-jiang; YU Hai-bin

LIU Chang-chun, SHEN Zhi-jiang, YU Hai-bin. Average Reward Reinforcement Learning Scheduling of Closed Reentrant Production Systems[J]. INFORMATION AND CONTROL, 2004, 33(2): 145-150.

Citation:

LIU Chang-chun, SHEN Zhi-jiang, YU Hai-bin. Average Reward Reinforcement Learning Scheduling of Closed Reentrant Production Systems[J]. INFORMATION AND CONTROL, 2004, 33(2): 145-150.

Citation:

LIU Chang-chun, SHEN Zhi-jiang, YU Hai-bin. Average Reward Reinforcement Learning Scheduling of Closed Reentrant Production Systems[J]. INFORMATION AND CONTROL, 2004, 33(2): 145-150.

Average Reward Reinforcement Learning Scheduling of Closed Reentrant Production Systems

Graphical Abstract

Graphical Abstract

Abstract

Abstract

How to schedule the closed reentrant queueing networks so as to maximize the system mean output is an intractable NP-hard problem. In this paper, a method of average reward reinforcement learning (RL) is applied to automatically find an adaptive scheduling policy by directly optimizing the mean output. Numerical study demonstrates that the RL scheduler consistently outperforms all the known priority policies.

FullText(HTML)

References (7)

Cited By

Average Reward Reinforcement Learning Scheduling of Closed Reentrant Production Systems

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content