Diffusion gradient temporal difference for cooperative reinforcement learning with linear function approximation

Sergio Valcarcel Macua, Pavle Belanovic, Santiago Zazo. Diffusion gradient temporal difference for cooperative reinforcement learning with linear function approximation. In 3rd International Workshop on Cognitive Information Processing, CIP 2012, Baiona, Spain, May 28-30, 2012. pages 1-6, IEEE, 2012. [doi]

@inproceedings{MacuaBZ12,
  title = {Diffusion gradient temporal difference for cooperative reinforcement learning with linear function approximation},
  author = {Sergio Valcarcel Macua and Pavle Belanovic and Santiago Zazo},
  year = {2012},
  doi = {10.1109/CIP.2012.6232901},
  url = {http://dx.doi.org/10.1109/CIP.2012.6232901},
  researchr = {https://researchr.org/publication/MacuaBZ12},
  cites = {0},
  citedby = {0},
  pages = {1-6},
  booktitle = {3rd International Workshop on Cognitive Information Processing, CIP 2012, Baiona, Spain, May 28-30, 2012},
  publisher = {IEEE},
  isbn = {978-1-4673-1877-8},
}