Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective

Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng 0002, Zhaowei Shang, Mingliang Zhou. Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective. Eng. Appl. of AI, 151:110676, 2025. [doi]

Abstract

Abstract is missing.