Takeshi Shibuya, Seiji Yasunobu. Reinforcement learning with nonstationary reward depending on the episode. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Anchorage, Alaska, USA, October 9-12, 2011. pages 2145-2150, IEEE, 2011. [doi]
Abstract is missing.