A policy gradient method for semi-Markov decision processes with application to call admission control

Sumeetpal S. Singh, Vladislav B. Tadic, Arnaud Doucet. A policy gradient method for semi-Markov decision processes with application to call admission control. European Journal of Operational Research, 178(3):808-818, 2007. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.