Cournot Policy Model: Rethinking centralized training in multi-agent reinforcement learning

Jingchen Li, Yusen Yang, Ziming He, Huarui Wu, Haobin Shi, Wenbai Chen. Cournot Policy Model: Rethinking centralized training in multi-agent reinforcement learning. Inf. Sci., 677:120983, 2024. [doi]

Abstract

Abstract is missing.