The following publications are possibly variants of this publication:
- Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement LearningRichard S. Sutton, Doina Precup, Satinder P. Singh. ai, 112(1-2):181-211, 1999. [doi]
- Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor RepresentationMohammad Salimibeni, Arash Mohammadi 0001, Parvin Malekzadeh, Konstantinos N. Plataniotis. sensors, 22(4):1393, 2022. [doi]
- A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor RepresentationScott Fujimoto, David Meger, Doina Precup. icml 2021: 3518-3529 [doi]