Thompson Sampling for Learning Parameterized Markov Decision Processes

Aditya Gopalan, Shie Mannor. Thompson Sampling for Learning Parameterized Markov Decision Processes. In Peter Grünwald, Elad Hazan, Satyen Kale, editors, Proceedings of The 28th Conference on Learning Theory, COLT 2015, Paris, France, July 3-6, 2015. Volume 40 of JMLR Proceedings, pages 861-898, JMLR.org, 2015. [doi]

Abstract

Abstract is missing.