强化学习下机械臂运动规划指标优化方法综述

胡江瑜; 马珺杰; 黄德青; 李亚楠; 李展; 杨勇

doi:10.13976/j.cnki.xk.2025.1922

强化学习下机械臂运动规划指标优化方法综述

Review on Optimization Methods for Manipulator Motion Planning Metrics under Reinforcement Learning

摘要

摘要: 随着关节型机械臂在工业制造和运维检修中的广泛应用，运动规划作为其核心功能备受关注。相较于人工势场法、采样法、启发式法等经典运动规划方法带来的高计算量、低收敛率问题，强化学习（RL）凭借其高效性、泛化性和处理复杂问题的能力，成为运动规划领域的研究热点。尽管基于RL的运动规划算法已大量涌现，但仍需认真思考如何分类现有成果并精确评估运动规划指标优化策略。首先，提出机械臂运动规划中所涉及到的技术检索策略与方法；然后，综述结构化环境和非结构化环境下，RL及融合算法解决规划中的高维空间解算、安全性能权衡、模拟到现实等性能指标方法；最后，面向非结构化环境下的新需求，评述运动规划领域的最新成果和发展方向。

Abstract: With the widespread adoption of articulated manipulators in industrial manufacturing, maintenance, and repair operations, motion planning has garnered significant attention as an essential functionality. Compared to classical motion planning techniques, such as artificial potential fields, sampling-based methods, and heuristic algorithms, which often suffer from high computational demands and low convergence rates, reinforcement learning (RL) offers compelling advantages. Its inherent efficiency, generalizability, and capability in handling complex problems have positioned RL as a prominent research focus within the motion planning domain. While numerous RL-based motion planning algorithms have emerged, a systematic classification of existing contributions and a precise evaluation of optimization strategies for planning metrics remain crucial. Firstly, we introduce the review methodology and search strategy and methods for relevant technologies. Subsequently, we comprehensively analyze how RL and fusion algorithms address key performance metrics, including high-dimensional space computation, safety-performance trade-offs, and sim-to-real transfer, across both structured and unstructured environments. Finally, we discuss the latest advancements and future research directions that will specifically address emerging needs in unstructured environments.

HTML全文

参考文献(128)

施引文献

资源附件(0)