25th Annual Conference on Learning Theory Analysis of Thompson Sampling for the Multi-armed Bandit ProblemShipra Agrawal,Navin Goyal,Shie Mannor,Nathan Srebro,Robert C Williamsonmag(2013)引用 22|浏览10暂无评分关键词multi armed bandit,thompson samplingAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要