Multi-agent off-policy actor-critic algorithm for distributed multi-task reinforcement learning

Milos S. Stankovic, Marko Beko, Nemanja Ilic, Srdjan S. Stankovic. Multi-agent off-policy actor-critic algorithm for distributed multi-task reinforcement learning. Eur. J. Control, 74:100853, November 2023. [doi]

Abstract

Abstract is missing.