Learning to attend and reorder: Scalable policy optimization in large-scale multi-agent systems

Zhaohan Feng, Wei Xiao, Jian Sun 0003, Jie Chen 0003, Gang Wang 0014. Learning to attend and reorder: Scalable policy optimization in large-scale multi-agent systems. Neurocomputing, 671:132646, 2026. [doi]

Abstract

Abstract is missing.