Online bipartite matching with partially bandit feedback

Proceedings of NIPS workshop on Discrete optimization in Machine Learning. Granada, Spain(2011)

引用 4|浏览1
暂无评分
摘要
Inspired by the increasing popularity of online dating websites, we consider the problem of online bipartite matching. Previous work on this topic either assumes an unrealistic full-information feedback model or makes due with bandit feedback but requires us to solve a difficult computational problem on each online round. We take the middle-ground between these two extremes and propose a partially-bandit feedback model that is both realistic and allows us to design an efficient regret-minimizing algorithm. We present the ExpMatch online learning algorithm for bipartite matchings with partially-bandit feedback, analyze its computation complexity and its regret, and compare it to previous algorithms.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要