Trust Region Policy Optimization

John Schulman, Sergey Levine, Pieter Abbeel, Michael I. Jordan, Philipp Moritz. Trust Region Policy Optimization. In Francis R. Bach, David M. Blei, editors, Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, Lille, France, 6-11 July 2015. Volume 37 of JMLR Proceedings, pages 1889-1897, JMLR.org, 2015. [doi]

Abstract

Abstract is missing.