Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions

Zihao Deng, Siddartha Devic, Brendan Juba. Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions. In Gustau Camps-Valls, Francisco J. R. Ruiz, Isabel Valera, editors, International Conference on Artificial Intelligence and Statistics, AISTATS 2022, 28-30 March 2022, Virtual Event. Volume 151 of Proceedings of Machine Learning Research, pages 11280-11304, PMLR, 2022. [doi]

Abstract

Abstract is missing.