Noise Free Multi-armed Bandit Game.

Lecture Notes in Computer Science(2016)

引用 2|浏览39
暂无评分
摘要
We study the loss version of adversarial multi-armed bandit problems with one lossless arm. We show an adversary's strategy that forces any player to suffer K - 1 - O(1/T) loss where K is the number of arms and T is the number of rounds.
更多
查看译文
关键词
Algorithmic learning,Online learning,Bandit problem
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要