Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning

Ming Yin, Yu Bai, Yu-Xiang Wang. Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning. In Arindam Banerjee 0001, Kenji Fukumizu, editors, The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event. Volume 130 of Proceedings of Machine Learning Research, pages 1567-1575, PMLR, 2021. [doi]

Authors

Ming Yin

This author has not been identified. Look up 'Ming Yin' in Google

Yu Bai

This author has not been identified. Look up 'Yu Bai' in Google

Yu-Xiang Wang

This author has not been identified. Look up 'Yu-Xiang Wang' in Google