A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning

Zhiyou Yang, Hong Qu, Mingsheng Fu, Wang Hu, Yongze Zhao. A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning. IEEE T. Cybernetics, 53(3):1499-1510, March 2023. [doi]

Abstract

Abstract is missing.