UAV Agile Navigation Method for Unknown Environment via Deep Reinforcement Learning

IECON 2023- 49th Annual Conference of the IEEE Industrial Electronics Society(2023)

引用 0|浏览6
暂无评分
摘要
This paper mainly considers the navigation problem of unmanned aerial vehicle (UAV) in an unknown environment. Traditional path planning method relies on accurate model parameters and environment maps, which has poor adaptability. Therefore, this paper adopts the deep reinforcement learning algorithm to accomplish the navigation task. The classical proximal policy optimization (PPO) algorithm lacks the perception of the correlation between UAV action and state makes difficult for the UAV to choose the optimal path, thus affecting the success rate and speed of navigation. To solve this problem, this paper adds a long short-term memory (LSTM) network to the policy and evaluation network of the PPO algorithm so that the UAV can refer to the preceding status and action information during path planning. The method is extended to three-dimensional motion space. Simulation results demonstrate that the LSTM-PPO algorithm designed in this paper can complete navigation tasks in unknown environments, and show stability in continuous state space and continuous action space. Meanwhile, compared with the PPO algorithm, the success rate of navigation and average arrival time is significantly improved.
更多
查看译文
关键词
deep reinforcement learning,proximal policy optimization,long short-term memory,UAV navigation,path planning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要