Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs

Junkai Zhang, Weitong Zhang, Quanquan Gu. Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs. In Andreas Krause 0001, Emma Brunskill, KyungHyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. Volume 202 of Proceedings of Machine Learning Research, pages 41902-41930, PMLR, 2023. [doi]

Authors

Junkai Zhang

This author has not been identified. Look up 'Junkai Zhang' in Google

Weitong Zhang

This author has not been identified. Look up 'Weitong Zhang' in Google

Quanquan Gu

This author has not been identified. Look up 'Quanquan Gu' in Google