Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view

Bruno Scherrer. Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view. In Johannes Fürnkranz, Thorsten Joachims, editors, Proceedings of the 27th International Conference on Machine Learning (ICML-10), June 21-24, 2010, Haifa, Israel. pages 959-966, Omnipress, 2010. [doi]

Authors

Bruno Scherrer

This author has not been identified. Look up 'Bruno Scherrer' in Google