Online bipartite matching with partially bandit feedback

Proceedings of NIPS workshop on Discrete optimization in Machine Learning. Granada, Spain（2011）

引用 4|浏览1

暂无评分

摘要

Inspired by the increasing popularity of online dating websites, we consider the problem of online bipartite matching. Previous work on this topic either assumes an unrealistic full-information feedback model or makes due with bandit feedback but requires us to solve a difficult computational problem on each online round. We take the middle-ground between these two extremes and propose a partially-bandit feedback model that is both realistic and allows us to design an efficient regret-minimizing algorithm. We present the ExpMatch online learning algorithm for bipartite matchings with partially-bandit feedback, analyze its computation complexity and its regret, and compare it to previous algorithms.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要