Improved upper bounds on the expected error in constant step-size Q-learning

Carolyn L. Beck, R. Srikant. Improved upper bounds on the expected error in constant step-size Q-learning. In American Control Conference, ACC 2013, Washington, DC, USA, June 17-19, 2013. pages 1926-1931, IEEE, 2013. [doi]

Abstract

Abstract is missing.