QuadQ: Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

Siying Wang 0002, Ruoning Zhang, Yang Zhou, Jinliang Shao, Yuhua Cheng 0001. QuadQ: Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning. IEEE CAA J. Autom. Sinica, 13(3):728-730, March 2026. [doi]

Abstract

Abstract is missing.