张宏达, 李德才, 何玉庆. 基于不完备信息预测的多智能体分布式协同[J]. 信息与控制, 2024, 53(1): 86-97. DOI: 10.13976/j.cnki.xk.2023.2506
引用本文: 张宏达, 李德才, 何玉庆. 基于不完备信息预测的多智能体分布式协同[J]. 信息与控制, 2024, 53(1): 86-97. DOI: 10.13976/j.cnki.xk.2023.2506
ZHANG Hongda, LI Decai, HE Yuqing. Multi-agent Distributed Cooperation Based on Incomplete Information Prediction[J]. INFORMATION AND CONTROL, 2024, 53(1): 86-97. DOI: 10.13976/j.cnki.xk.2023.2506
Citation: ZHANG Hongda, LI Decai, HE Yuqing. Multi-agent Distributed Cooperation Based on Incomplete Information Prediction[J]. INFORMATION AND CONTROL, 2024, 53(1): 86-97. DOI: 10.13976/j.cnki.xk.2023.2506

基于不完备信息预测的多智能体分布式协同

Multi-agent Distributed Cooperation Based on Incomplete Information Prediction

  • 摘要: 为了解决部分可观对抗环境中多智能体协同决策难题,受人大脑皮层通过记忆进行学习和推理功能启发,提出一种新的部分可观对抗环境下基于不完备信息预测的多智能体分布式协同决策框架。该框架可采用支持向量回归等多种预测方法通过历史记忆和当前观察信息对环境中不可见信息进行预测,并将预测信息和观察到的信息融合,作为协同决策的依据;再通过分布式多智能体强化学习进行协同策略学习得到团队中每个智能体的决策模型。使用该框架结合多种预测算法在典型的部分可观对抗环境中进行了多智能体协同决策的验证。结果表明,提出的框架对多种预测算法具有普适性,且在保证对不可见部分高预测精度时能将多智能体协同决策水平提升23.4%。

     

    Abstract: To solve the problem of multi-agent cooperative decision-making in a partially observable adversarial environment inspired by the learning and reasoning functions of the human cerebral cortex through memory, a new multi-agent distributed cooperative decision-making framework based on incomplete information prediction in a partially observable adversarial environment is proposed. The framework can use support vector regression and other prediction methods to predict invisible information in the environment through historical memory and current observed information and fuse the predicted information and the observed information as a basis of cooperative decision-making; Then, cooperative strategy learning is performed through distributed multi-agent reinforcement learning to obtain the decision-making model of each agent in the team. Multi-agent cooperative decision-making is verified in a typical partially observable adversarial environment using this framework and various prediction algorithms. The results show that the proposed framework is universal to various prediction algorithms and can improve the level of multi-agent cooperative decision-making by 23.4% while ensuring high prediction accuracy for invisible parts.

     

/

返回文章
返回