Optimization for Reinforcement Learning: From a single agent to cooperative agents

Donghwan Lee 0002, Niao He, Parameswaran Kamalaruban, Volkan Cevher. Optimization for Reinforcement Learning: From a single agent to cooperative agents. IEEE Signal Process. Mag., 37(3):123-135, 2020. [doi]

Abstract

Abstract is missing.