Diverse Exploration via Conjugate Policies for Policy Gradient Methods

Andrew Cohen, Xingye Qiao, Lei Yu 0001, Elliot Way, Xiangrong Tong. Diverse Exploration via Conjugate Policies for Policy Gradient Methods. In The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019. pages 3404-3411, AAAI Press, 2019. [doi]

Abstract

Abstract is missing.