Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes

Florent Delgrange, Ann Nowé, Guillermo A. Pérez 0001. Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022. pages 6497-6505, AAAI Press, 2022. [doi]

Authors

Florent Delgrange

This author has not been identified. Look up 'Florent Delgrange' in Google

Ann Nowé

This author has not been identified. Look up 'Ann Nowé' in Google

Guillermo A. Pérez 0001

This author has not been identified. Look up 'Guillermo A. Pérez 0001' in Google