基于TD3-PID-VMC动态混合控制双足轮腿机器人运动优化方法

Motion Optimization Method for Bipedal Wheel-legged Robots Based on TD3-PID-VMC Dynamic Hybrid Control

  • 摘要: 为实现双足轮腿机器人在复杂地形下的高精度、强鲁棒运动控制,本文提出一种基于TD3-PID-VMC动态混合控制方法。首先,基于对称式四连杆闭环结构设计并构建双足轮腿机器人运动学模型,实现轮式滚动与腿式支撑的协同控制;其次,引入设定值加权策略改进串级比例-微分-积分(PID)控制器以提升轮毂电机的快速响应与稳态转速调节能力,并利用虚拟模型控制(VMC)优化系统模型参数,从而增强运动性能与环境适应性;最后,设计双延迟深度确定性策略梯度(TD3)驱动的混合权重动态分配模块,通过强化学习离线训练获得最优融合策略,并在实际控制中在线部署,实现PID与VMC权重的动态自适应调整,以适应不同地形和运动状态的需求。仿真与实物实验结果表明,在多模式轨迹跟踪任务中,本文方法的均方根误差(RMSE)较固定权重PID-VMC方法和LQR-VMC方法分别降低约32.9%和28.7%,在突发冲击扰动下恢复时间小于3.2s。本文方法有效提升了双足轮腿机器人在静态平衡、抗扰恢复与轨迹跟踪任务中的稳定性与精度。

     

    Abstract: To achieve high-precision and robust motion control for bipedal wheel-legged robots on complex terrains, we propose a TD3-PID-VMC dynamic hybrid control method. First, based on a symmetric four-link closed-loop structure, the kinematic model of the bipedal wheel-legged robot is designed and constructed to achieve coordinated control of wheel rolling and leg support. Then, a setpoint weighting strategy is introduced to improve the cascade Proportional-Integral-Derivative (PID) controller, enhancing the fast response and steady-state speed regulation capability of the hub motors, while Virtual Model Control (VMC) is utilized to optimize system model parameters, thereby improving motion performance and environmental adaptability. Finally, a hybrid weight dynamic allocation module driven by the twin delayed deep deterministic policy gradient (TD3) algorithm is designed. The optimal fusion strategy is obtained through offline reinforcement learning training and deployed online in real-world control, enabling dynamic and adaptive adjustment of the PID and VMC weights to accommodate diverse terrains and motion states. Simulation and hardware experimental results demonstrate that in multi-mode trajectory tracking tasks, the proposed method reduces the root mean square error (RMSE) by approximately 32.9% and 28.7% compared to the fixed-weight PID-VMC method and the LQR-VMC method, respectively, with a recovery time of less than 3.2 s under sudden impact disturbances. The proposed method effectively enhances the stability and precision of the bipedal wheel-legged robot in static balance, disturbance recovery, and trajectory tracking tasks.

     

/

返回文章
返回