A path following controller for deep-sea mining vehicles considering slip control and random resistance based on improved deep deterministic policy gradient

OCEAN ENGINEERING(2023)

引用 1|浏览6
暂无评分
摘要
This study aimed to develop a deep-sea mining vehicle (DSMV) path-following controller that could better reflect the actual deep-sea mining conditions. First, the dynamic model of the DSMV was improved. By introducing a nonlinear slip-control model and random environmental noise resistance, the controlled plant was developed to be closer to the actual mining operation condition. Second, an improved deep deterministic policy gradient (IDDPG) algorithm was proposed. Compared to the standard DDPG algorithm, the improved algorithm reduces the overestimation of the Q value and enhances the ability of an agent to explore the global optimum. A warm-up stage was introduced to improve stability at the beginning of training and accelerate the convergence speed of training. Third, a general reward function was designed for this type of problem. Combined with the uncertainty of the improved model, the generalization ability and adaptability to the unknown environment of the controller could be improved. Finally, through a random one-point-following training test in the simulation environment and different path-following comparison tests, the path-following control ability of the controller was verified.
更多
查看译文
关键词
Deep-sea mining vehicle,Path following,Improved deep deterministic policy gradient,Slip control,Deep reinforcement learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要