Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

Sridhar Mahadevan. Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning. In ICML. pages 328-336, 1996.

Authors

Sridhar Mahadevan

This author has not been identified. Look up 'Sridhar Mahadevan' in Google