On Dual-Based PI Controllers for Online Allocation Problems

arxiv(2022)

引用 0|浏览1
暂无评分
摘要
Dual-based proportional-integral (PI) controllers are often employed in practice to solve online allocation problems with global constraints, such as budget pacing in online advertising. However, controllers are used in a heuristic fashion and come with no provable guarantees on their performance. This paper provides the first regret bounds on the performance of dual-based PI controllers for online allocation problems. We do so by first establishing a fundamental connection between dual-based PI controllers and a new first-order algorithm for online convex optimization, which, in a special case, recovers online mirror descent with momentum. We prove the proposed first-order algorithm attains low regret for general online convex optimization problems with adversarial inputs. We leverage this new result to give the first regret bound for dual-based PI controllers for online allocation problems. As a byproduct of our proofs, we provide the first regret bound for online mirror descent for non-smooth convex optimization, which might be of independent interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要