Towards Minimax Policies for Online Linear Optimization with Bandit Feedback - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Sébastien Bubeck, Nicolò Cesa-Bianchi, Sham M. Kakade. Towards Minimax Policies for Online Linear Optimization with Bandit Feedback. Journal of Machine Learning Research, 23, 2012. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL