UCB Momentum Q-learning: Correcting the bias without forgetting

Pierre Ménard, Omar Darwiche Domingues, Xuedong Shang, Michal Valko. UCB Momentum Q-learning: Correcting the bias without forgetting. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 7609-7618, PMLR, 2021. [doi]

Authors

Pierre Ménard

This author has not been identified. Look up 'Pierre Ménard' in Google

Omar Darwiche Domingues

This author has not been identified. Look up 'Omar Darwiche Domingues' in Google

Xuedong Shang

This author has not been identified. Look up 'Xuedong Shang' in Google

Michal Valko

This author has not been identified. Look up 'Michal Valko' in Google