Behavior Proximal Policy Optimization - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Zifeng Zhuang, Kun Lei, Jinxin Liu, Donglin Wang, Yilang Guo. Behavior Proximal Policy Optimization. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

The following publications are possibly variants of this publication:

Truly Proximal Policy OptimizationYuhui Wang, Hao He, Xiaoyang Tan. uai 2018: 21 [doi]

runs on WebDSL