Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning

Richard S. Sutton, Doina Precup, Satinder P. Singh. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Artificial Intelligence, 112(1-2):181-211, 1999. [doi]

Authors

Richard S. Sutton

This author has not been identified. Look up 'Richard S. Sutton' in Google

Doina Precup

This author has not been identified. Look up 'Doina Precup' in Google

Satinder P. Singh

This author has not been identified. Look up 'Satinder P. Singh' in Google