Optimizing Average Reward Using Discounted Rewards - researchr publication

researchr

You are not signed in
Sign in
Sign up

Sham Kakade. Optimizing Average Reward Using Discounted Rewards. In David P. Helmbold, Bob Williamson, editors, Computational Learning Theory, 14th Annual Conference on Computational Learning Theory, COLT 2001 and 5th European Conference on Computational Learning Theory, EuroCOLT 2001, Amsterdam, The Netherlands, July 16-19, 2001, Proceedings. Volume 2111 of Lecture Notes in Computer Science, pages 605-615, Springer, 2001. [doi]

Abstract is missing.

runs on WebDSL