On the Minimax Regret for Linear Bandits in a wide variety of Action Spaces

arxiv（2023）

引用 0|浏览0

暂无评分

摘要

As noted in the works of \cite{lattimore2020bandit}, it has been mentioned that it is an open problem to characterize the minimax regret of linear bandits in a wide variety of action spaces. In this article we present an optimal regret lower bound for a wide class of convex action spaces.

查看译文

关键词

linear bandits,minimax regret,action

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要