Student-t policy in reinforcement learning to acquire global optimum of robot control

Taisuke Kobayashi. Student-t policy in reinforcement learning to acquire global optimum of robot control. Appl. Intell., 49(12):4335-4347, 2019. [doi]

Abstract

Abstract is missing.