Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method

Sébastien Gros, Mario Zanon. Reinforcement Learning based on MPC and the Stochastic Policy Gradient Method. In 2021 American Control Conference, ACC 2021, New Orleans, LA, USA, May 25-28, 2021. pages 1947-1952, IEEE, 2021. [doi]

Authors

Sébastien Gros

This author has not been identified. Look up 'Sébastien Gros' in Google

Mario Zanon

This author has not been identified. Look up 'Mario Zanon' in Google