Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms

Vaneet Aggarwal, Washim Uddin Mondal, Qinbo Bai. Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms. Foundations and Trends in Optimization, 6(4):193-298, 2024. [doi]

Abstract

Abstract is missing.