Student-t policy in reinforcement learning to acquire global optimum of robot control

Taisuke Kobayashi. Student-t policy in reinforcement learning to acquire global optimum of robot control. Appl. Intell., 49(12):4335-4347, 2019. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.