Diffusion gradient temporal difference for cooperative reinforcement learning with linear function approximation

Sergio Valcarcel Macua, Pavle Belanovic, Santiago Zazo. Diffusion gradient temporal difference for cooperative reinforcement learning with linear function approximation. In 3rd International Workshop on Cognitive Information Processing, CIP 2012, Baiona, Spain, May 28-30, 2012. pages 1-6, IEEE, 2012. [doi]

Abstract

Abstract is missing.