A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies

Phalguni Nanda, Zaiwei Chen. A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies. In Abstracts of the 2026 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS Abstracts 2026, Ann Arbor, MI, USA, June 8-12, 2026. pages 213-215, ACM, 2026. [doi]

Abstract

Abstract is missing.