Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

Yasin Abbasi-Yadkori, Peter L. Bartlett, Varun Kanade, Yevgeny Seldin, Csaba Szepesvári. Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions. In Christopher J. C. Burges, Léon Bottou, Zoubin Ghahramani, Kilian Q. Weinberger, editors, Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States. pages 2508-2516, 2013. [doi]

Abstract

Abstract is missing.