Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang. Offline Reinforcement Learning with Closed-Form Policy Improvement Operators. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 20485-20528, PMLR, 2023. [doi]

Authors

Jiachen Li

This author has not been identified. Look up 'Jiachen Li' in Google

Edwin Zhang

This author has not been identified. Look up 'Edwin Zhang' in Google

Ming Yin

This author has not been identified. Look up 'Ming Yin' in Google

Qinxun Bai

This author has not been identified. Look up 'Qinxun Bai' in Google

Yu-Xiang Wang

This author has not been identified. Look up 'Yu-Xiang Wang' in Google

William Yang Wang

This author has not been identified. Look up 'William Yang Wang' in Google