Alleviating the estimation bias of deep deterministic policy gradient via co-regularization

Yao Li, Yuhui Wang, Yaozhong Gan, Xiaoyang Tan. Alleviating the estimation bias of deep deterministic policy gradient via co-regularization. Pattern Recognition, 131:108872, 2022. [doi]

@article{LiWGT22,
  title = {Alleviating the estimation bias of deep deterministic policy gradient via co-regularization},
  author = {Yao Li and Yuhui Wang and Yaozhong Gan and Xiaoyang Tan},
  year = {2022},
  doi = {10.1016/j.patcog.2022.108872},
  url = {https://doi.org/10.1016/j.patcog.2022.108872},
  researchr = {https://researchr.org/publication/LiWGT22},
  cites = {0},
  citedby = {0},
  journal = {Pattern Recognition},
  volume = {131},
  pages = {108872},
}