From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

researchr

You are not signed in
Sign in
Sign up

Daniil Tiapkin, Denis Belomestny, Eric Moulines, Alexey Naumov, Sergey Samsonov, Yunhao Tang, Michal Valko, Pierre Ménard. From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 21380-21431, PMLR, 2022. [doi]

@inproceedings{TiapkinBMNSTVM22,
  title = {From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses},
  author = {Daniil Tiapkin and Denis Belomestny and Eric Moulines and Alexey Naumov and Sergey Samsonov and Yunhao Tang and Michal Valko and Pierre Ménard},
  year = {2022},
  url = {https://proceedings.mlr.press/v162/tiapkin22a.html},
  researchr = {https://researchr.org/publication/TiapkinBMNSTVM22},
  cites = {0},
  citedby = {0},
  pages = {21380-21431},
  booktitle = {International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA},
  editor = {Kamalika Chaudhuri and Stefanie Jegelka and Le Song and Csaba Szepesvári and Gang Niu 0001 and Sivan Sabato},
  volume = {162},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}

External Links

Cite Key

Statistics

PDF

Researchr

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses