Large Scale Markov Decision Processes with Changing Rewards

Adrian Rivera Cardoso, He Wang, Huan Xu. Large Scale Markov Decision Processes with Changing Rewards. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 2337-2347, 2019. [doi]

Abstract

Abstract is missing.