WANG Cong, ZHANG Ziyang, CHEN Yanzhuang, ZHANG Qifeng, LI Shuo, WANG Xiaohui, WANG Sen. Deep Reinforcement Learning and Multi-Parameter Domain Randomization Based Underwater Adaptive Grasping Research for Underwater Manipulator[J]. INFORMATION AND CONTROL, 2022, 51(6): 651-661. DOI: 10.13976/j.cnki.xk.2022.1443
Citation: WANG Cong, ZHANG Ziyang, CHEN Yanzhuang, ZHANG Qifeng, LI Shuo, WANG Xiaohui, WANG Sen. Deep Reinforcement Learning and Multi-Parameter Domain Randomization Based Underwater Adaptive Grasping Research for Underwater Manipulator[J]. INFORMATION AND CONTROL, 2022, 51(6): 651-661. DOI: 10.13976/j.cnki.xk.2022.1443

Deep Reinforcement Learning and Multi-Parameter Domain Randomization Based Underwater Adaptive Grasping Research for Underwater Manipulator

  • This study proposes a general control system for underwater manipulation, which combines deep reinforcement learning and domain randomization for autonomous underwater manipulation of underwater manipulators. First, a reinforcement learning-based robot control system is established. Subsequently, multi-parameter domain randomization is used to improve the policy robustness and transferring effectiveness, including parameters of manipulator dynamics, hydrodynamic parameters, and noise and delay of state and action spaces. Finally, the trained policy is deployed on a new simulation environment and real underwater arm. The experimental results verify the validity of the proposed method and lay a foundation for autonomous manipulation in the real deep-sea environment in the future.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return