Reinforcement-learning agents with different temperature parameters explain the variety of human action-selection behavior in a Markov decision process task

Fumihiko Ishida, Takahiro Sasaki, Yutaka Sakaguchi, Hiroyuki Shimai. Reinforcement-learning agents with different temperature parameters explain the variety of human action-selection behavior in a Markov decision process task. Neurocomputing, 72(7-9):1979-1984, 2009. [doi]

Abstract

Abstract is missing.