A novel multi-step reinforcement learning method for solving reward hacking

Yinlong Yuan, Zhu Liang Yu, Zhenghui Gu, Xiaoyan Deng, Yuanqing Li. A novel multi-step reinforcement learning method for solving reward hacking. Appl. Intell., 49(8):2874-2888, 2019. [doi]

Abstract

Abstract is missing.