Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity

Laixi Shi, Gen Li 0005, Yuting Wei, Yuxin Chen 0002, Yuejie Chi. Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 19967-20025, PMLR, 2022. [doi]

Authors

Laixi Shi

This author has not been identified. Look up 'Laixi Shi' in Google

Gen Li 0005

This author has not been identified. Look up 'Gen Li 0005' in Google

Yuting Wei

This author has not been identified. Look up 'Yuting Wei' in Google

Yuxin Chen 0002

This author has not been identified. Look up 'Yuxin Chen 0002' in Google

Yuejie Chi

This author has not been identified. Look up 'Yuejie Chi' in Google