A CNN-based policy for optimizing continuous action control by learning state sequences

Tianyi Huang, Min Li, Xiaolong Qin, William Zhu. A CNN-based policy for optimizing continuous action control by learning state sequences. Neurocomputing, 468:286-295, 2022. [doi]

Abstract

Abstract is missing.