Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, Yutaka Matsuo. Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 17135-17175, PMLR, 2023. [doi]

Authors

Toshinori Kitamura

This author has not been identified. Look up 'Toshinori Kitamura' in Google

Tadashi Kozuno

This author has not been identified. Look up 'Tadashi Kozuno' in Google

Yunhao Tang

This author has not been identified. Look up 'Yunhao Tang' in Google

Nino Vieillard

This author has not been identified. Look up 'Nino Vieillard' in Google

Michal Valko

This author has not been identified. Look up 'Michal Valko' in Google

Wenhao Yang

This author has not been identified. Look up 'Wenhao Yang' in Google

Jincheng Mei

This author has not been identified. Look up 'Jincheng Mei' in Google

Pierre Ménard

This author has not been identified. Look up 'Pierre Ménard' in Google

Mohammad Gheshlaghi Azar

This author has not been identified. Look up 'Mohammad Gheshlaghi Azar' in Google

Rémi Munos

This author has not been identified. Look up 'Rémi Munos' in Google

Olivier Pietquin

This author has not been identified. Look up 'Olivier Pietquin' in Google

Matthieu Geist

This author has not been identified. Look up 'Matthieu Geist' in Google

Csaba Szepesvári

This author has not been identified. Look up 'Csaba Szepesvári' in Google

Wataru Kumagai

This author has not been identified. Look up 'Wataru Kumagai' in Google

Yutaka Matsuo

This author has not been identified. Look up 'Yutaka Matsuo' in Google