TSPPO: transformer-based sequential proximal policy optimization for multi-agent systems

Tao Yang, Yuxiao Gao, Cheng Xu 0005, Hongzhe Liu 0001. TSPPO: transformer-based sequential proximal policy optimization for multi-agent systems. Multimedia Syst., 32(2):118, April 2026. [doi]

Abstract

Abstract is missing.