面向工业互联网的容器级网算资源实时调度

Real Time Scheduling of Container Level Network-computing Resources for Industrial Internet

  • 摘要: 针对现有容器编排技术受限于Pod架构,难以实现对网络与计算资源的细粒度、统一调度,从而无法满足工业物联网高需求的问题。本文首先提出了一种容器级的网络资源管控机制,实现了Kubernetes集群系统细粒度的网络资源管控,并设计了基于动态规划的多时隙任务聚合算法,通过数据包容量和延迟约束,可以求得最优的数据包聚合方案,从而最小化数据包数量,进一步提升网络资源利用效率。随后,为了降低任务在Kubernetes集群系统的平均延迟并提高任务成功率,本文将任务调度与网算资源分配问题转化为马尔可夫决策过程,并在此基础上提出一种基于深度强化学习(Deep Reinforcement Learning,DRL)的网算资源协同调度算法。该算法融合了深度确定性策略梯度(Deep Deterministic Policy Gradient,DDPG)和动态规划(Dynamic Programming,DP)算法,能够有效处理高维连续的动作空间和状态空间。仿真结果表明,所提出的DDPG+DP算法能够有效地降低系统中的平均时延成本,相较于现有的任务调度资源分配方法具有更高的任务完成率。

     

    Abstract: Addressing the issue that existing container orchestration technologies are constrained by the Pod architecture, which makes it difficult to achieve fine-grained and unified scheduling of network and computing resources, thus failing to meet the high requirements of Industrial Internet of Things. This paper first proposes a container level network resource management mechanism, which achieves fine-grained network resource management of Kubernetes cluster systems, and designs a dynamic programming based multi-time slot task aggregation algorithm. By packet capacity and delay constraints, the optimal packet aggregation scheme can be obtained, thereby minimizing the number of packets and further improving network resource utilization efficiency. Subsequently, in order to reduce the average latency of tasks in Kubernetes cluster systems and improve task success rates, this paper transforms the task scheduling and network-computing resource allocation problem into a Markov decision process, and proposes a deep reinforcement learning based network-computing resource collaborative scheduling algorithm based on this. The algorithm integrates Deep Deterministic Policy Gradient (DDPG) and Dynamic Programming (DP), enabling effective handling of high-dimensional continuous action and state spaces. Simulation results demonstrate that the proposed DDPG+DP algorithm can significantly reduce the average delay cost within the system and achieves a higher task completion rate compared to existing task scheduling and resource allocation methods.

     

/

返回文章
返回