Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Ming Yin, Yu Bai, Yu-Xiang Wang. Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning. In Arindam Banerjee 0001, Kenji Fukumizu, editors, The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021, April 13-15, 2021, Virtual Event. Volume 130 of Proceedings of Machine Learning Research, pages 1567-1575, PMLR, 2021. [doi]

This author has not been identified. Look up 'Ming Yin' in GoogleThis author has not been identified. Look up 'Yu Bai' in GoogleThis author has not been identified. Look up 'Yu-Xiang Wang' in Google

runs on WebDSL