Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

Andrew J. Wagenmaker, Max Simchowitz, Kevin Jamieson 0001. Beyond No Regret: Instance-Dependent PAC Reinforcement Learning. In Po-Ling Loh, Maxim Raginsky, editors, Conference on Learning Theory, 2-5 July 2022, London, UK. Volume 178 of Proceedings of Machine Learning Research, pages 358-418, PMLR, 2022. [doi]

Abstract

Abstract is missing.