Proximal Policy Optimization With Policy Feedback

Yang Gu, Yuhu Cheng, C. L. Philip Chen, Xuesong Wang 0001. Proximal Policy Optimization With Policy Feedback. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 52(7):4600-4610, 2022. [doi]

Abstract

Abstract is missing.