Offline Reinforcement Learning with Closed-Form Policy Improvement Operators - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang. Offline Reinforcement Learning with Closed-Form Policy Improvement Operators. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 20485-20528, PMLR, 2023. [doi]

This author has not been identified. Look up 'Jiachen Li' in GoogleThis author has not been identified. Look up 'Edwin Zhang' in GoogleThis author has not been identified. Look up 'Ming Yin' in GoogleThis author has not been identified. Look up 'Qinxun Bai' in GoogleThis author has not been identified. Look up 'Yu-Xiang Wang' in GoogleThis author has not been identified. Look up 'William Yang Wang' in Google

runs on WebDSL