Hindsight Balanced Reward Shaping

Mengxuan Shao, Feng Jiang, Shaohui Liu, Kun Han, Debin Zhao. Hindsight Balanced Reward Shaping. In Mohammad Tanveer 0001, Sonali Agarwal, Seiichi Ozawa, Asif Ekbal, Adam Jatowt, editors, Neural Information Processing - 29th International Conference, ICONIP 2022, Virtual Event, November 22-26, 2022, Proceedings, Part V. Volume 1792 of Communications in Computer and Information Science, pages 492-503, Springer, 2022. [doi]

Abstract

Abstract is missing.