Prioritized Experience Replay based on Multi-armed Bandit

Ximing Liu, Tianqing Zhu, Cuiqing Jiang, Dayong Ye, Fuqing Zhao. Prioritized Experience Replay based on Multi-armed Bandit. Expert Syst. Appl., 189:116023, 2022. [doi]

Abstract

Abstract is missing.