Advantage based value iteration for Markov decision processes with unknown rewards - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Pegah Alizadeh, Yann Chevaleyre, François Lévy. Advantage based value iteration for Markov decision processes with unknown rewards. In 2016 International Joint Conference on Neural Networks, IJCNN 2016, Vancouver, BC, Canada, July 24-29, 2016. pages 3837-3844, IEEE, 2016. [doi]

This author has not been identified. Look up 'Pegah Alizadeh' in GoogleThis author has not been identified. Look up 'Yann Chevaleyre' in GoogleThis author has not been identified. Look up 'François Lévy' in Google

runs on WebDSL