Dueling Posterior Sampling for Preference-Based Reinforcement Learning

Ellen R. Novoseller, Yibing Wei, Yanan Sui, Yisong Yue, Joel Burdick. Dueling Posterior Sampling for Preference-Based Reinforcement Learning. In Ryan P. Adams, Vibhav Gogate, editors, Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, UAI 2020, virtual online, August 3-6, 2020. pages 424, AUAI Press, 2020. [doi]

Abstract

Abstract is missing.