ORAD: a new framework of offline Reinforcement Learning with Q-value regularization

Longfei Zhang, Yulong Zhang, Shixuan Liu, Li Chen 0015, Xingxing Liang, Guangquan Cheng, Zhong Liu. ORAD: a new framework of offline Reinforcement Learning with Q-value regularization. Evolutionary Intelligence, 17(1):339-347, 2024. [doi]

Abstract

Abstract is missing.