Best arm identification in generalized linear bandits

OPERATIONS RESEARCH LETTERS(2021)

引用 3|浏览0
暂无评分
摘要
We consider the best-arm identification problem in generalized linear bandits: each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The goal is to identify a near-optimal arm with high probability while minimizing the number of arm pulls (i.e., the sampling budget). We propose the first algorithm for this problem and provide theoretical guarantees on its accuracy and sampling efficiency.
更多
查看译文
关键词
Best arm identification,Generalized linear bandits,Sequential clinical trial
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要