Provable generalization of clipped double Q -learning for variance reduction and sample efficiency

Jangwon Kim, Jiseok Jeong, Soohee Han. Provable generalization of clipped double Q -learning for variance reduction and sample efficiency. Neurocomputing, 673:132772, 2026. [doi]

Abstract

Abstract is missing.