Cognitive radar mode control: a comparison of different reinforcement learning algorithms

S. A. Ford,M. Ritchie

International Conference on Radar Systems (RADAR 2022)(2022)

引用 0|浏览0
暂无评分
摘要
This paper describes the use of deep reinforcement learning (RL) to apply the concept of cognition in sensing systems to the choice of operational radio frequency (RF) mode (active, bistatic receive, electronic surveillance (ES), electronic protection measure (EPM)) for a multi-function RF system (MFRFS). This is investigated in a simulated air-to-air combat scenario, with the RL on a blue fast jet rewarded for successfully guiding a missile to the opposition, a red fast jet, and penalised if the red jet is successful. Three RL algorithms (deep Q-network (DQN), advantage actor-critic (A2C), and proximal policy optimisation (PPO)) are compared with baselines that include the 4 static modes and a set of fixed rulesets, and it is shown that - after hyperparameter tuning - the algorithms perform comparably to these baselines. It is suggested that PPO might be the optimal algorithm in this context.
更多
查看译文
关键词
4 static modes,advantage actor-critic,blue fast jet,cognitive radar mode control,deep reinforcement learning,different reinforcement learning algorithms,electronic protection measure,multifunction RF system,optimal algorithm,red fast jet,red jet,RL algorithms,simulated air-to-air combat scenario
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要