Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency

Qi Cai, Zhuoran Yang, Zhaoran Wang. Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 2485-2522, PMLR, 2022. [doi]

Authors

Qi Cai

This author has not been identified. Look up 'Qi Cai' in Google

Zhuoran Yang

This author has not been identified. Look up 'Zhuoran Yang' in Google

Zhaoran Wang

This author has not been identified. Look up 'Zhaoran Wang' in Google