Offline Reinforcement Learning With Behavior Value Regularization

Longyang Huang, Botao Dong, Wei Xie, Weidong Zhang. Offline Reinforcement Learning With Behavior Value Regularization. IEEE T. Cybernetics, 54(6):3692-3704, June 2024. [doi]

Abstract

Abstract is missing.