No-regret learning with high-probability in adversarial Markov decision processes

Mahsa Ghasemi, Abolfazl Hashemi, Haris Vikalo, Ufuk Topcu. No-regret learning with high-probability in adversarial Markov decision processes. In Cassio P. de Campos, Marloes H. Maathuis, Erik Quaeghebeur, editors, Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, UAI 2021, Virtual Event, 27-30 July 2021. Volume 161 of Proceedings of Machine Learning Research, pages 992-1001, AUAI Press, 2021. [doi]

Abstract

Abstract is missing.