Sequential Decision Making with Limited Observation Capability: Application to Wireless Networks

IEEE Transactions on Cognitive Communications and Networking(2019)

引用 13|浏览10
暂无评分
摘要
This paper studies a generalized class of restless multi-armed bandits with hidden states and allow cumulative feedback, as opposed to the conventional instantaneous feedback. We call them lazy restless bandits (LRBs) as the events of decision making are sparser than the events of state transition. Hence, feedback after each decision event is the cumulative effect of the following state transition...
更多
查看译文
关键词
Indexes,Relays,Decision making,Markov processes,Fading channels,Optimization,Productivity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要