Stochastic Convex Optimization with Bandit Feedback

Alekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Alexander Rakhlin. Stochastic Convex Optimization with Bandit Feedback. SIAM Journal on Optimization, 23(1):213-240, 2013. [doi]

Abstract

Abstract is missing.