Learning in Two-Player Matrix Games by Policy Gradient Lagging Anchor

Shiyao Ding, Toshimitsu Ushio. Learning in Two-Player Matrix Games by Policy Gradient Lagging Anchor. IEICE Transactions, 102-A(4):708-711, 2019. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.