Learning-based joint UAV trajectory and power allocation optimization for secure IoT networks

Digital Communications and Networks(2022)

引用 12|浏览2
暂无评分
摘要
Non-Orthogonal Multiplex Access (NOMA) can be deployed in Unmanned Aerial Vehicle (UAV) networks to improve spectrum efficiency. Due to the broadcasting feature of NOMA-UAV networks, it is essential to focus on the security of the wireless system. This paper focuses on maximizing the secrecy sum rate under the constraint of the achievable rate of the legitimate channels. To tackle the non-convexity optimization problem, a reinforcement learning-based alternative optimization algorithm is proposed. Firstly, with the help of successive convex approximations, the optimal power allocation scheme with a given UAV trajectory is obtained by using convex optimization tools. Afterwards, through plenty of explorations of the wireless environment, the Q-learning networks approach the optimal location transition strategy of the UAV, even without the wireless channel state information.
更多
查看译文
关键词
Unmanned aerial vehicle (UAV),NOMA,Reinforcement learning,Secure communications,Deep Q-learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要