Toward Off-Policy Learning Control with Function Approximation

Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton. Toward Off-Policy Learning Control with Function Approximation. In Johannes Fürnkranz, Thorsten Joachims, editors, Proceedings of the 27th International Conference on Machine Learning (ICML-10), June 21-24, 2010, Haifa, Israel. pages 719-726, Omnipress, 2010. [doi]

@inproceedings{MaeiSBS10,
  title = {Toward Off-Policy Learning Control with Function Approximation},
  author = {Hamid Reza Maei and Csaba Szepesvári and Shalabh Bhatnagar and Richard S. Sutton},
  year = {2010},
  url = {http://www.icml2010.org/papers/627.pdf},
  researchr = {https://researchr.org/publication/MaeiSBS10},
  cites = {0},
  citedby = {0},
  pages = {719-726},
  booktitle = {Proceedings of the 27th International Conference on Machine Learning (ICML-10), June 21-24, 2010, Haifa, Israel},
  editor = {Johannes Fürnkranz and Thorsten Joachims},
  publisher = {Omnipress},
}