Stochastic policy gradient reinforcement learning on a simple 3D biped

Russ Tedrake, Teresa Weirui Zhang, H. Sebastian Seung. Stochastic policy gradient reinforcement learning on a simple 3D biped. In 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28 - October 2, 2004. pages 2849-2854, IEEE, 2004. [doi]

Authors

Russ Tedrake

This author has not been identified. Look up 'Russ Tedrake' in Google

Teresa Weirui Zhang

This author has not been identified. Look up 'Teresa Weirui Zhang' in Google

H. Sebastian Seung

This author has not been identified. Look up 'H. Sebastian Seung' in Google