Prioritized Experience Replay based on Multi-armed Bandit

Ximing Liu, Tianqing Zhu, Cuiqing Jiang, Dayong Ye, Fuqing Zhao. Prioritized Experience Replay based on Multi-armed Bandit. Expert Syst. Appl., 189:116023, 2022. [doi]

Authors

Ximing Liu

This author has not been identified. Look up 'Ximing Liu' in Google

Tianqing Zhu

This author has not been identified. Look up 'Tianqing Zhu' in Google

Cuiqing Jiang

This author has not been identified. Look up 'Cuiqing Jiang' in Google

Dayong Ye

This author has not been identified. Look up 'Dayong Ye' in Google

Fuqing Zhao

This author has not been identified. Look up 'Fuqing Zhao' in Google