Near Minimax Optimal Players For The Finite-Time 3-Expert Prediction Problem

Yasin Abbasi-Yadkori,Peter L. Bartlett,Victor Gabillon

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017)（2017）

引用 23|浏览34

暂无评分

摘要

We study minimax strategies for the online prediction problem with expert advice. It has been conjectured that a simple adversary strategy, called COMB, is near optimal in this game for any number of experts. Our results and new insights make progress in this direction by showing that, up to a small additive term, COMB is minimax optimal in the finite-time three expert problem. In addition, we provide for this setting a new near minimax optimal COMB-based learner. Prior to this work, in this problem, learners obtaining the optimal multiplicative constant in their regret rate were known only when K = 2 or K -> infinity We characterize, when K = 3, the regret of the game scaling as root 8/(9 pi)T +/- log(T)(2) which gives for the first time the optimal constant in the leading (root T) term of the regret.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要