Grigory Neustroev, Mathijs de Weerdt, Remco A. Verzijlbergh. Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards. In J. Benton 0001, Nir Lipovetzky, Eva Onaindia, David E. Smith 0001, Siddharth Srivastava 0001, editors, Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, ICAPS 2019, Berkeley, CA, USA, July 11-15, 2019. pages 292-300, AAAI Press, 2019. [doi]
Abstract is missing.