Diverse Policies Converge in Reward-Free Markov Decision Processes

Fanqi Lin, Shiyu Huang, Wei-Wei Tu. Diverse Policies Converge in Reward-Free Markov Decision Processes. In Fenrong Liu, Arun Anand Sadanandan, Duc Nghia Pham, Petrus Mursanto, Dickson Lukose, editors, PRICAI 2023: Trends in Artificial Intelligence - 20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023, Jakarta, Indonesia, November 15-19, 2023, Proceedings, Part I. Volume 14325 of Lecture Notes in Computer Science, pages 125-136, Springer, 2022. [doi]

Abstract

Abstract is missing.