Stochastic policy gradient reinforcement learning on a simple 3D biped

Russ Tedrake, Teresa Weirui Zhang, H. Sebastian Seung. Stochastic policy gradient reinforcement learning on a simple 3D biped. In 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28 - October 2, 2004. pages 2849-2854, IEEE, 2004. [doi]

Abstract

Abstract is missing.