M-A3C: A Mean-Asynchronous Advantage Actor-Critic Reinforcement Learning Method for Real-Time Gait Planning of Biped Robot

Jie Leng, Suozhong Fan, Jun Tang, Haiming Mou, Junxiao Xue, Qingdu Li. M-A3C: A Mean-Asynchronous Advantage Actor-Critic Reinforcement Learning Method for Real-Time Gait Planning of Biped Robot. IEEE Access, 10:76523-76536, 2022. [doi]

No reviews for this publication, yet.