Uniqueness of optimal policies as a generic property of discounted Markov decision processes: Ekeland's variational principle approach

R. Israel Ortega-Gutiérrez, Raúl Montes-De-Oca, Enrique Lemus-Rodríguez. Uniqueness of optimal policies as a generic property of discounted Markov decision processes: Ekeland's variational principle approach. Kybernetika, 52(1):66-75, 2016. [doi]

Abstract

Abstract is missing.