Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng 0002, Zhaowei Shang, Mingliang Zhou. Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective. Eng. Appl. of AI, 151:110676, 2025. [doi]
Abstract is missing.