Rethinking exploration-exploitation trade-off in reinforcement learning via cognitive consistency

Da Wang, Wei Wei 0018, Lin Li, Xin Wang, Jiye Liang. Rethinking exploration-exploitation trade-off in reinforcement learning via cognitive consistency. Neural Networks, 187:107342, 2025. [doi]

Abstract

Abstract is missing.