Improved Exploration in Factored Average-Reward MDPs

Mohammad Sadegh Talebi, Anders Jonsson, Odalric Maillard. Improved Exploration in Factored Average-Reward MDPs. In Arindam Banerjee 0001, Kenji Fukumizu, editors, The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event. Volume 130 of Proceedings of Machine Learning Research, pages 3988-3996, PMLR, 2021. [doi]

Authors

Mohammad Sadegh Talebi

This author has not been identified. Look up 'Mohammad Sadegh Talebi' in Google

Anders Jonsson

This author has not been identified. Look up 'Anders Jonsson' in Google

Odalric Maillard

This author has not been identified. Look up 'Odalric Maillard' in Google