Improved Regret Bound and Experience Replay in Regularized Policy Iteration

Nevena Lazic, Dong Yin, Yasin Abbasi-Yadkori, Csaba Szepesvári. Improved Regret Bound and Experience Replay in Regularized Policy Iteration. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 6032-6042, PMLR, 2021. [doi]

Authors

Nevena Lazic

This author has not been identified. Look up 'Nevena Lazic' in Google

Dong Yin

This author has not been identified. Look up 'Dong Yin' in Google

Yasin Abbasi-Yadkori

This author has not been identified. Look up 'Yasin Abbasi-Yadkori' in Google

Csaba Szepesvári

This author has not been identified. Look up 'Csaba Szepesvári' in Google