Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping

Dongruo Zhou, Jiafan He, Quanquan Gu. Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 12793-12802, PMLR, 2021. [doi]

Authors

Dongruo Zhou

This author has not been identified. Look up 'Dongruo Zhou' in Google

Jiafan He

This author has not been identified. Look up 'Jiafan He' in Google

Quanquan Gu

This author has not been identified. Look up 'Quanquan Gu' in Google