OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Jongmin Lee 0004, Wonseok Jeon, Byung-Jun Lee 0001, Joelle Pineau, Kee-Eung Kim. OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 6120-6130, PMLR, 2021. [doi]

Authors

Jongmin Lee 0004

This author has not been identified. Look up 'Jongmin Lee 0004' in Google

Wonseok Jeon

This author has not been identified. Look up 'Wonseok Jeon' in Google

Byung-Jun Lee 0001

This author has not been identified. Look up 'Byung-Jun Lee 0001' in Google

Joelle Pineau

This author has not been identified. Look up 'Joelle Pineau' in Google

Kee-Eung Kim

This author has not been identified. Look up 'Kee-Eung Kim' in Google