Online bipartite matching with partially bandit feedback
Proceedings of NIPS workshop on Discrete optimization in Machine Learning. Granada, Spain(2011)
摘要
Inspired by the increasing popularity of online dating websites, we consider the problem of online bipartite matching. Previous work on this topic either assumes an unrealistic full-information feedback model or makes due with bandit feedback but requires us to solve a difficult computational problem on each online round. We take the middle-ground between these two extremes and propose a partially-bandit feedback model that is both realistic and allows us to design an efficient regret-minimizing algorithm. We present the ExpMatch online learning algorithm for bipartite matchings with partially-bandit feedback, analyze its computation complexity and its regret, and compare it to previous algorithms.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要