ORAD: a new framework of offline Reinforcement Learning with Q-value regularization - researchr publication

researchr

You are not signed in
Sign in
Sign up

Longfei Zhang, Yulong Zhang, Shixuan Liu, Li Chen 0015, Xingxing Liang, Guangquan Cheng, Zhong Liu. ORAD: a new framework of offline Reinforcement Learning with Q-value regularization. Evolutionary Intelligence, 17(1):339-347, 2024. [doi]

Abstract is missing.

runs on WebDSL