Alleviating the estimation bias of deep deterministic policy gradient via co-regularization

Yao Li, Yuhui Wang, Yaozhong Gan, Xiaoyang Tan. Alleviating the estimation bias of deep deterministic policy gradient via co-regularization. Pattern Recognition, 131:108872, 2022. [doi]

Authors

Yao Li

This author has not been identified. Look up 'Yao Li' in Google

Yuhui Wang

This author has not been identified. Look up 'Yuhui Wang' in Google

Yaozhong Gan

This author has not been identified. Look up 'Yaozhong Gan' in Google

Xiaoyang Tan

This author has not been identified. Look up 'Xiaoyang Tan' in Google