Deterministic policies based on maximum regrets in MDPs with imprecise rewards

Pegah Alizadeh, Emiliano Traversi, Aomar Osmani. Deterministic policies based on maximum regrets in MDPs with imprecise rewards. AI Commun., 34(4):229-244, 2021. [doi]

Authors

Pegah Alizadeh

This author has not been identified. Look up 'Pegah Alizadeh' in Google

Emiliano Traversi

This author has not been identified. Look up 'Emiliano Traversi' in Google

Aomar Osmani

This author has not been identified. Look up 'Aomar Osmani' in Google