Student-t policy in reinforcement learning to acquire global optimum of robot control

Taisuke Kobayashi. Student-t policy in reinforcement learning to acquire global optimum of robot control. Appl. Intell., 49(12):4335-4347, 2019. [doi]

Authors

Taisuke Kobayashi

This author has not been identified. Look up 'Taisuke Kobayashi' in Google