Behavior Proximal Policy Optimization

Zifeng Zhuang, Kun Lei, Jinxin Liu, Donglin Wang, Yilang Guo. Behavior Proximal Policy Optimization. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: