Qi Cai, Zhuoran Yang, Chi Jin, Zhaoran Wang. Provably Efficient Exploration in Policy Optimization. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event. Volume 119 of Proceedings of Machine Learning Research, pages 1283-1294, PMLR, 2020. [doi]
Abstract is missing.