Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay

Dogan C. Cicek, Enes Duran, Baturay Saglam, Furkan B. Mutlu, Suleyman S. Kozat. Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay. In 33rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2021, Washington, DC, USA, November 1-3, 2021. pages 1255-1262, IEEE, 2021. [doi]

@inproceedings{CicekDSMK21,
  title = {Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay},
  author = {Dogan C. Cicek and Enes Duran and Baturay Saglam and Furkan B. Mutlu and Suleyman S. Kozat},
  year = {2021},
  doi = {10.1109/ICTAI52525.2021.00199},
  url = {https://doi.org/10.1109/ICTAI52525.2021.00199},
  researchr = {https://researchr.org/publication/CicekDSMK21},
  cites = {0},
  citedby = {0},
  pages = {1255-1262},
  booktitle = {33rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2021, Washington, DC, USA, November 1-3, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-0898-1},
}