Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion

Rolando Cavazos-Cadena, Raúl Montes-De-Oca, Karel Sladký. Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion. J. Applied Probability, 52(2):419-440, 2015. [doi]

Authors

Rolando Cavazos-Cadena

This author has not been identified. Look up 'Rolando Cavazos-Cadena' in Google

Raúl Montes-De-Oca

This author has not been identified. Look up 'Raúl Montes-De-Oca' in Google

Karel Sladký

This author has not been identified. Look up 'Karel Sladký' in Google