A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information

T. E. S. Raghavan, Zamir Syed. A policy-improvement type algorithm for solving zero-sum two-person stochastic games of perfect information. Math. Program., 95(3):513-532, 2003. [doi]