Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective

Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Yong Feng 0002, Zhaowei Shang, Mingliang Zhou. Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective. Eng. Appl. of AI, 151:110676, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.