Bandits with Movement Costs and Adaptive Pricing
COLT, pp. 1242-1268, 2017.
We extend the model of Multi-armed Bandit with unit switching cost to incorporate a metric between the actions. We consider the case where the metric over the actions can be modeled by a complete binary tree, and the distance between two leaves is the size of the subtree of their least common ancestor, which abstracts the case that the ac...More
PPT (Upload PPT)