Approximate Policy Iteration With Deep Minimax Average Bellman Error Minimization

Lican Kang, Yuhui Liu, Yuan Luo, Jerry Zhijian Yang, Han Yuan, Chang Zhu. Approximate Policy Iteration With Deep Minimax Average Bellman Error Minimization. IEEE Transactions on Neural Networks, 36(2):2288-2299, February 2025. [doi]

Abstract

Abstract is missing.