Towards Minimax Policies for Online Linear Optimization with Bandit Feedback - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Sébastien Bubeck, Nicolò Cesa-Bianchi, Sham M. Kakade. Towards Minimax Policies for Online Linear Optimization with Bandit Feedback. Journal of Machine Learning Research, 23, 2012. [doi]

This author has not been identified. Look up 'Sébastien Bubeck' in GoogleThis author has not been identified. Look up 'Nicolò Cesa-Bianchi' in GoogleThis author has not been identified. Look up 'Sham M. Kakade' in Google

runs on WebDSL