Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay

Dogan C. Cicek, Enes Duran, Baturay Saglam, Furkan B. Mutlu, Suleyman S. Kozat. Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay. In 33rd IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2021, Washington, DC, USA, November 1-3, 2021. pages 1255-1262, IEEE, 2021. [doi]

Authors

Dogan C. Cicek

This author has not been identified. Look up 'Dogan C. Cicek' in Google

Enes Duran

This author has not been identified. Look up 'Enes Duran' in Google

Baturay Saglam

This author has not been identified. Look up 'Baturay Saglam' in Google

Furkan B. Mutlu

This author has not been identified. Look up 'Furkan B. Mutlu' in Google

Suleyman S. Kozat

This author has not been identified. Look up 'Suleyman S. Kozat' in Google