A novel multi-step reinforcement learning method for solving reward hacking - researchr publication

researchr

You are not signed in
Sign in
Sign up

Yinlong Yuan, Zhu Liang Yu, Zhenghui Gu, Xiaoyan Deng, Yuanqing Li. A novel multi-step reinforcement learning method for solving reward hacking. Appl. Intell., 49(8):2874-2888, 2019. [doi]

Abstract is missing.

runs on WebDSL