Deterministic policy optimization with clipped value expansion and long-horizon planning

ShiQing Gao, Haibo Shi, Fang Wang, Zijian Wang, Siyu Zhang, Yunxia Li, Yaoru Sun. Deterministic policy optimization with clipped value expansion and long-horizon planning. Neurocomputing, 483:299-310, 2022. [doi]

Abstract

Abstract is missing.