Minimax Regret Bounds for Reinforcement Learning

Mohammad Gheshlaghi Azar, Ian Osband, RĂ©mi Munos. Minimax Regret Bounds for Reinforcement Learning. In Doina Precup, Yee Whye Teh, editors, Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017. Volume 70 of JMLR Workshop and Conference Proceedings, pages 263-272, JMLR.org, 2017. [doi]

Abstract

Abstract is missing.