RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han. RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Authors

Rui Yang

This author has not been identified. Look up 'Rui Yang' in Google

Chenjia Bai

This author has not been identified. Look up 'Chenjia Bai' in Google

Xiaoteng Ma

This author has not been identified. Look up 'Xiaoteng Ma' in Google

Zhaoran Wang

This author has not been identified. Look up 'Zhaoran Wang' in Google

Chongjie Zhang

This author has not been identified. Look up 'Chongjie Zhang' in Google

Lei Han

This author has not been identified. Look up 'Lei Han' in Google