Chen Gong, Yunpeng Bai, Xinwen Hou, Xiaohui Ji. Stable Training of Bellman Error in Reinforcement Learning. In Haiqin Yang, Kitsuchart Pasupa, Andrew Chi-Sing Leung, James T. Kwok, Jonathan H. Chan, Irwin King, editors, Neural Information Processing - 27th International Conference, ICONIP 2020, Bangkok, Thailand, November 18-22, 2020, Proceedings, Part V. Volume 1333 of Communications in Computer and Information Science, pages 439-448, Springer, 2020. [doi]
Abstract is missing.