Learning reward machines: A study in partially observable reinforcement learning

Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano, Margarita P. Castro, Ethan Waldie, Sheila A. McIlraith. Learning reward machines: A study in partially observable reinforcement learning. Artificial Intelligence, 323:103989, October 2023. [doi]

Abstract

Abstract is missing.