Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability

Ali Devran Kara, Serdar Yüksel. Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability. Math. Oper. Res., 48(4):2066-2093, 2023. [doi]

Abstract

Abstract is missing.