Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems

Victor G. Lopez, Mohammad Alsalti, Matthias A. Müller 0001. Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems. IEEE Trans. Automat. Contr., 68(5):2922-2933, May 2023. [doi]

Abstract

Abstract is missing.