Deterministic policies based on maximum regrets in MDPs with imprecise rewards

Pegah Alizadeh, Emiliano Traversi, Aomar Osmani. Deterministic policies based on maximum regrets in MDPs with imprecise rewards. AI Commun., 34(4):229-244, 2021. [doi]

Abstract

Abstract is missing.