Mengxuan Shao, Feng Jiang, Shaohui Liu, Kun Han, Debin Zhao. Hindsight Balanced Reward Shaping. In Mohammad Tanveer 0001, Sonali Agarwal, Seiichi Ozawa, Asif Ekbal, Adam Jatowt, editors, Neural Information Processing - 29th International Conference, ICONIP 2022, Virtual Event, November 22-26, 2022, Proceedings, Part V. Volume 1792 of Communications in Computer and Information Science, pages 492-503, Springer, 2022. [doi]
No references recorded for this publication.
No citations of this publication recorded.