Adaptive and multiple time-scale eligibility traces for online deep reinforcement learning

Taisuke Kobayashi. Adaptive and multiple time-scale eligibility traces for online deep reinforcement learning. Robotics and Autonomous Systems, 151:104019, 2022. [doi]

Abstract

Abstract is missing.