Deterministic policies based on maximum regrets in MDPs with imprecise rewards

Pegah Alizadeh, Emiliano Traversi, Aomar Osmani. Deterministic policies based on maximum regrets in MDPs with imprecise rewards. AI Commun., 34(4):229-244, 2021. [doi]

No reviews for this publication, yet.