Sébastien Gros, Mario Zanon. Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method. In 2021 American Control Conference, ACC 2021, New Orleans, LA, USA, May 25-28, 2021. pages 1947-1952, IEEE, 2021. [doi]
No references recorded for this publication.
No citations of this publication recorded.