Optimizing Average Reward Using Discounted Rewards

Sham Kakade. Optimizing Average Reward Using Discounted Rewards. In David P. Helmbold, Bob Williamson, editors, Computational Learning Theory, 14th Annual Conference on Computational Learning Theory, COLT 2001 and 5th European Conference on Computational Learning Theory, EuroCOLT 2001, Amsterdam, The Netherlands, July 16-19, 2001, Proceedings. Volume 2111 of Lecture Notes in Computer Science, pages 605-615, Springer, 2001. [doi]

Abstract

Abstract is missing.