Offline Reinforcement Learning with Value-based Episodic Memory

Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang 0028, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu. Offline Reinforcement Learning with Value-based Episodic Memory. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Xiaoteng Ma

This author has not been identified. Look up 'Xiaoteng Ma' in Google

Yiqin Yang

This author has not been identified. Look up 'Yiqin Yang' in Google

Hao Hu

This author has not been identified. Look up 'Hao Hu' in Google

Jun Yang 0028

This author has not been identified. Look up 'Jun Yang 0028' in Google

Chongjie Zhang

This author has not been identified. Look up 'Chongjie Zhang' in Google

Qianchuan Zhao

This author has not been identified. Look up 'Qianchuan Zhao' in Google

Bin Liang

This author has not been identified. Look up 'Bin Liang' in Google

Qihan Liu

This author has not been identified. Look up 'Qihan Liu' in Google