Off-policy inverse Q-learning for discrete-time antagonistic unknown systems

Bosen Lian, Wenqian Xue, Yijing Xie, Frank L. Lewis, Ali Davoudi. Off-policy inverse Q-learning for discrete-time antagonistic unknown systems. Automatica, 155:111171, September 2023. [doi]

Abstract

Abstract is missing.