Finite-Time Analysis for the Knowledge-Gradient Policy

Yingfei Wang, Warren B. Powell. Finite-Time Analysis for the Knowledge-Gradient Policy. SIAM J. Control and Optimization, 56(2):1105-1129, 2018. [doi]

Abstract

Abstract is missing.