Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

André da Motta Salles Barreto, Charles W. Anderson. Restricted gradient-descent algorithm for value-function approximation in reinforcement learning. Artificial Intelligence, 172(4-5):454-482, 2008. [doi]

Abstract

Abstract is missing.