Convergent Learning Algorithms for Unknown Reward Games

Archie C. Chapman, David S. Leslie, Alex Rogers, Nicholas R. Jennings. Convergent Learning Algorithms for Unknown Reward Games. SIAM J. Control and Optimization, 51(4):3154-3180, 2013. [doi]

Bibliographies