Stochastic Online Learning with Probabilistic Graph Feedback
national conference on artificial intelligence, 2020.
Abstract:
We consider a problem of stochastic online learning with general probabilistic graph feedback. Two cases are covered. (a) The one-step case where for each edge $(i,j)$ with probability $p_{ij}$ in the probabilistic feedback graph. After playing arm $i$ the learner observes a sample reward feedback of arm $j$ with independent probability...More
Code:
Data:
Tags
Comments