Alleviating the estimation bias of deep deterministic policy gradient via co-regularization

Yao Li, Yuhui Wang, Yaozhong Gan, Xiaoyang Tan. Alleviating the estimation bias of deep deterministic policy gradient via co-regularization. Pattern Recognition, 131:108872, 2022. [doi]

Abstract

Abstract is missing.