Bias Correction in Reinforcement Learning via the Deterministic Policy Gradient Method for MPC-Based Policies

Sébastien Gros, Mario Zanon. Bias Correction in Reinforcement Learning via the Deterministic Policy Gradient Method for MPC-Based Policies. In 2021 American Control Conference, ACC 2021, New Orleans, LA, USA, May 25-28, 2021. pages 2543-2548, IEEE, 2021. [doi]

Abstract

Abstract is missing.