Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning

Taylor Dohmen, Noah Topper, George K. Atia, Andre Beckus, Ashutosh Trivedi 0001, Alvaro Velasquez. Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning. In Akshat Kumar, Sylvie Thiébaux, Pradeep Varakantham, William Yeoh 0001, editors, Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling, ICAPS 2022, Singapore (virtual), June 13-24, 2022. pages 574-582, AAAI Press, 2022. [doi]

Authors

Taylor Dohmen

This author has not been identified. Look up 'Taylor Dohmen' in Google

Noah Topper

This author has not been identified. Look up 'Noah Topper' in Google

George K. Atia

This author has not been identified. Look up 'George K. Atia' in Google

Andre Beckus

This author has not been identified. Look up 'Andre Beckus' in Google

Ashutosh Trivedi 0001

This author has not been identified. Look up 'Ashutosh Trivedi 0001' in Google

Alvaro Velasquez

This author has not been identified. Look up 'Alvaro Velasquez' in Google