Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning

Gen Li 0005, Laixi Shi, Yuxin Chen 0002, Yuantao Gu, Yuejie Chi. Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 17762-17776, 2021. [doi]

Authors

Gen Li 0005

This author has not been identified. Look up 'Gen Li 0005' in Google

Laixi Shi

This author has not been identified. Look up 'Laixi Shi' in Google

Yuxin Chen 0002

This author has not been identified. Look up 'Yuxin Chen 0002' in Google

Yuantao Gu

This author has not been identified. Look up 'Yuantao Gu' in Google

Yuejie Chi

This author has not been identified. Look up 'Yuejie Chi' in Google