Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems

Makoto Sato, Shigenobu Kobayashi. Average-Reward Reinforcement Learning for Variance Penalized Markov Decision Problems. In Carla E. Brodley, Andrea Pohoreckyj Danyluk, editors, Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28 - July 1, 2001. pages 473-480, Morgan Kaufmann, 2001.

Abstract

Abstract is missing.