Why is Posterior Sampling Better than Optimism for Reinforcement Learning? - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Ian Osband, Benjamin Van Roy. Why is Posterior Sampling Better than Optimism for Reinforcement Learning?. In Doina Precup, Yee Whye Teh, editors, Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017. Volume 70 of JMLR Workshop and Conference Proceedings, pages 2701-2710, JMLR.org, 2017. [doi]

This author has not been identified. Look up 'Ian Osband' in GoogleThis author has not been identified. Look up 'Benjamin Van Roy' in Google

runs on WebDSL