From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

Daniil Tiapkin, Denis Belomestny, Eric Moulines, Alexey Naumov, Sergey Samsonov, Yunhao Tang, Michal Valko, Pierre Ménard. From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 21380-21431, PMLR, 2022. [doi]

Authors

Daniil Tiapkin

This author has not been identified. Look up 'Daniil Tiapkin' in Google

Denis Belomestny

This author has not been identified. Look up 'Denis Belomestny' in Google

Eric Moulines

This author has not been identified. Look up 'Eric Moulines' in Google

Alexey Naumov

This author has not been identified. Look up 'Alexey Naumov' in Google

Sergey Samsonov

This author has not been identified. Look up 'Sergey Samsonov' in Google

Yunhao Tang

This author has not been identified. Look up 'Yunhao Tang' in Google

Michal Valko

This author has not been identified. Look up 'Michal Valko' in Google

Pierre Ménard

This author has not been identified. Look up 'Pierre Ménard' in Google