Kun Dong, Yongle Luo, Yuxin Wang, Yu Liu, Chengeng Qu, Qiang Zhang, Erkang Cheng, Zhiyong Sun, Bo Song. Dyna-style Model-based reinforcement learning with Model-Free Policy Optimization. Knowl.-Based Syst., 287:111428, 2024. [doi]
Abstract is missing.