Bayesian Policy Gradient Algorithms

Mohammad Ghavamzadeh, Yaakov Engel. Bayesian Policy Gradient Algorithms. In Bernhard Schölkopf, John C. Platt, Thomas Hoffman, editors, Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 4-7, 2006. pages 457-464, MIT Press, 2006. [doi]

@inproceedings{GhavamzadehE06,
  title = {Bayesian Policy Gradient Algorithms},
  author = {Mohammad Ghavamzadeh and Yaakov Engel},
  year = {2006},
  url = {http://books.nips.cc/papers/files/nips19/NIPS2006_0865.pdf},
  researchr = {https://researchr.org/publication/GhavamzadehE06},
  cites = {0},
  citedby = {0},
  pages = {457-464},
  booktitle = {Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 4-7, 2006},
  editor = {Bernhard Schölkopf and John C. Platt and Thomas Hoffman},
  publisher = {MIT Press},
  isbn = {0-262-19568-2},
}