On the Optimality of Batch Policy Optimization Algorithms

Chenjun Xiao, Yifan Wu, Jincheng Mei, Bo Dai, Tor Lattimore, Lihong Li 0001, Csaba Szepesvári, Dale Schuurmans. On the Optimality of Batch Policy Optimization Algorithms. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 11362-11371, PMLR, 2021. [doi]

Abstract

Abstract is missing.