Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning over a Finite-Time Horizon

Matteo Basei, Xin Guo, Anran Hu, Yufei Zhang. Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning over a Finite-Time Horizon. Journal of Machine Learning Research, 23, 2022. [doi]

Abstract

Abstract is missing.