Reinforcement learning with nonstationary reward depending on the episode

Takeshi Shibuya, Seiji Yasunobu. Reinforcement learning with nonstationary reward depending on the episode. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Anchorage, Alaska, USA, October 9-12, 2011. pages 2145-2150, IEEE, 2011. [doi]

Authors

Takeshi Shibuya

This author has not been identified. Look up 'Takeshi Shibuya' in Google

Seiji Yasunobu

This author has not been identified. Look up 'Seiji Yasunobu' in Google