Deterministic policy optimization with clipped value expansion and long-horizon planning

ShiQing Gao, Haibo Shi, Fang Wang, Zijian Wang, Siyu Zhang, Yunxia Li, Yaoru Sun. Deterministic policy optimization with clipped value expansion and long-horizon planning. Neurocomputing, 483:299-310, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.