Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

Jialian Li, Tongzheng Ren, Dong Yan, Hang Su, Jun Zhu. Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022. pages 7417-7425, AAAI Press, 2022. [doi]

Authors

Jialian Li

This author has not been identified. Look up 'Jialian Li' in Google

Tongzheng Ren

This author has not been identified. Look up 'Tongzheng Ren' in Google

Dong Yan

This author has not been identified. Look up 'Dong Yan' in Google

Hang Su

This author has not been identified. Look up 'Hang Su' in Google

Jun Zhu

This author has not been identified. Look up 'Jun Zhu' in Google