(Nearly) Optimal Differentially Private Stochastic Multi-Arm Bandits

Uncertainty in Artificial Intelligence(2015)

引用 6|浏览71
暂无评分
摘要
We study the problem of private stochastic multi-arm bandits. Our notion of privacy is the same as some of the earlier works in the general area of private online learning [13, 17, 24]. We design algorithms that are i) differentially private, and ii) have regret guarantees that (almost) match the regret guarantees for the best non-private algorithms (e.g., upper confidence bound sampling and Thompson sampling). Moreover, through our experiments, we empirically show the effectiveness of our algorithms.
更多
查看译文
关键词
multi-arm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要