Learning in Two-Player Matrix Games by Policy Gradient Lagging Anchor

Shiyao Ding, Toshimitsu Ushio. Learning in Two-Player Matrix Games by Policy Gradient Lagging Anchor. IEICE Transactions, 102-A(4):708-711, 2019. [doi]

Abstract

Abstract is missing.