The following publications are possibly variants of this publication:
- Stochastic convex optimization with bandit feedbackAlekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Alexander Rakhlin. nips 2011: 1035-1043 [doi]
- Stochastic Convex Optimization with Bandit FeedbackAlekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Alexander Rakhlin. siamjo, 23(1):213-240, 2013. [doi]
- High-Probability Regret Bounds for Bandit Online Linear OptimizationPeter L. Bartlett, Varsha Dani, Thomas P. Hayes, Sham Kakade, Alexander Rakhlin, Ambuj Tewari. colt 2008: 335-342 [doi]