Prioritized Experience Replay based on Multi-armed Bandit

Ximing Liu, Tianqing Zhu, Cuiqing Jiang, Dayong Ye, Fuqing Zhao. Prioritized Experience Replay based on Multi-armed Bandit. Expert Syst. Appl., 189:116023, 2022. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: