Joint Time-Frequency Anti-Jamming Communications: A Reinforcement Learning Approach

2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP)(2019)

引用 3|浏览2
暂无评分
摘要
This paper investigates the channel selection and transmission duration scheduling problem in jamming environment. Although stable communication frequency and long transmission time can reduce switching overhead and achieve higher throughput, it is more likely to be disturbed by jammer. Our goal is to find the optimal transmission channel and duration strategy in jamming environment to maximize the long-term cumulative throughput of the system. We formulate the decision-making problem as a Markov decision process (MDP). Then, we propose a reinforcement learning (Q-learning) based channel selection and transmission duration scheduling algorithm. The simulation results show that compared with the reinforcement learning based fixed transmission duration algorithm, the system utility is significantly improved.
更多
查看译文
关键词
Anti-jamming, MDP, reinforcement learning, transmission duration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要