Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning

Alex Beeson, Giovanni Montana. Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning. Machine Learning, 113(1):443-488, January 2024. [doi]

Abstract

Abstract is missing.