Reward shaping via expectation maximization method

Zelin Deng, Xing Liu, Yunlong Dong. Reward shaping via expectation maximization method. Neurocomputing, 609:128471, 2024. [doi]

Abstract

Abstract is missing.