Generalized Optimistic Q-Learning with Provable Efficiency

Grigory Neustroev, Mathijs Michiel de Weerdt. Generalized Optimistic Q-Learning with Provable Efficiency. In Amal El Fallah-Seghrouchni, Gita Sukthankar, Bo An 0001, Neil Yorke-Smith, editors, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020. pages 913-921, International Foundation for Autonomous Agents and Multiagent Systems, 2020. [doi]

Abstract

Abstract is missing.