Deterministic policy optimization with clipped value expansion and long-horizon planning

ShiQing Gao, Haibo Shi, Fang Wang, Zijian Wang, Siyu Zhang, Yunxia Li, Yaoru Sun. Deterministic policy optimization with clipped value expansion and long-horizon planning. Neurocomputing, 483:299-310, 2022. [doi]

Authors

ShiQing Gao

This author has not been identified. Look up 'ShiQing Gao' in Google

Haibo Shi

This author has not been identified. Look up 'Haibo Shi' in Google

Fang Wang

This author has not been identified. Look up 'Fang Wang' in Google

Zijian Wang

This author has not been identified. Look up 'Zijian Wang' in Google

Siyu Zhang

This author has not been identified. Look up 'Siyu Zhang' in Google

Yunxia Li

This author has not been identified. Look up 'Yunxia Li' in Google

Yaoru Sun

This author has not been identified. Look up 'Yaoru Sun' in Google