Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir. Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation. In Daphne Koller, Dale Schuurmans, Yoshua Bengio, Léon Bottou, editors, Advances in Neural Information Processing Systems 21, Proceedings of the Twenty-Second Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 8-11, 2008. pages 385-392, MIT Press, 2008. [doi]

@inproceedings{CastroVM08,
  title = {Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation},
  author = {Dotan Di Castro and Dmitry Volkinshtein and Ron Meir},
  year = {2008},
  url = {http://books.nips.cc/papers/files/nips21/NIPS2008_0437.pdf},
  tags = {rule-based},
  researchr = {https://researchr.org/publication/CastroVM08},
  cites = {0},
  citedby = {0},
  pages = {385-392},
  booktitle = {Advances in Neural Information Processing Systems 21, Proceedings of the Twenty-Second Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 8-11, 2008},
  editor = {Daphne Koller and Dale Schuurmans and Yoshua Bengio and Léon Bottou},
  publisher = {MIT Press},
}