The Mirage of Action-Dependent Baselines in Reinforcement Learning

George Tucker, Surya Bhupatiraju, Shixiang Gu, Richard E. Turner, Zoubin Ghahramani, Sergey Levine. The Mirage of Action-Dependent Baselines in Reinforcement Learning. In Jennifer G. Dy, Andreas Krause 0001, editors, Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018. Volume 80 of JMLR Workshop and Conference Proceedings, pages 5022-5031, JMLR.org, 2018. [doi]

Abstract

Abstract is missing.