Action-dependent Control Variates for Policy Optimization via Stein Identity

Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng 0001, Qiang Liu 0001. Action-dependent Control Variates for Policy Optimization via Stein Identity. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. [doi]

Abstract

Abstract is missing.