Sergio Valcarcel Macua, Pavle Belanovic, Santiago Zazo. Diffusion gradient temporal difference for cooperative reinforcement learning with linear function approximation. In 3rd International Workshop on Cognitive Information Processing, CIP 2012, Baiona, Spain, May 28-30, 2012. pages 1-6, IEEE, 2012. [doi]
Abstract is missing.