An adaptive policy gradient in learning Nash equilibria

Huaxiang Zhang, Ying Fan. An adaptive policy gradient in learning Nash equilibria. Neurocomputing, 72(1-3):533-538, 2008. [doi]

Abstract

Abstract is missing.