Toward Off-Policy Learning Control with Function Approximation

Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton. Toward Off-Policy Learning Control with Function Approximation. In Johannes Fürnkranz, Thorsten Joachims, editors, Proceedings of the 27th International Conference on Machine Learning (ICML-10), June 21-24, 2010, Haifa, Israel. pages 719-726, Omnipress, 2010. [doi]

Authors

Hamid Reza Maei

This author has not been identified. Look up 'Hamid Reza Maei' in Google

Csaba Szepesvári

This author has not been identified. Look up 'Csaba Szepesvári' in Google

Shalabh Bhatnagar

This author has not been identified. Look up 'Shalabh Bhatnagar' in Google

Richard S. Sutton

This author has not been identified. Look up 'Richard S. Sutton' in Google