Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods

Xingang Guo, Bin Hu. Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods. In American Control Conference, ACC 2022, Atlanta, GA, USA, June 8-10, 2022. pages 3317-3322, IEEE, 2022. [doi]

Abstract

Abstract is missing.