Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method

Sébastien Gros, Mario Zanon. Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method. In 2021 American Control Conference, ACC 2021, New Orleans, LA, USA, May 25-28, 2021. pages 1947-1952, IEEE, 2021. [doi]

Abstract

Abstract is missing.