"Pseudo-reward Algorithms for Contextual Bandits with Linear Payoff Functions."

Ku-Chun Chou et al. (2014)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics