Deterministic policies based on maximum regrets in MDPs with imprecise rewards

Pegah Alizadeh, Emiliano Traversi, Aomar Osmani. Deterministic policies based on maximum regrets in MDPs with imprecise rewards. AI Commun., 34(4):229-244, 2021. [doi]

@article{AlizadehTO21,
  title = {Deterministic policies based on maximum regrets in MDPs with imprecise rewards},
  author = {Pegah Alizadeh and Emiliano Traversi and Aomar Osmani},
  year = {2021},
  doi = {10.3233/AIC-190632},
  url = {https://doi.org/10.3233/AIC-190632},
  researchr = {https://researchr.org/publication/AlizadehTO21},
  cites = {0},
  citedby = {0},
  journal = {AI Commun.},
  volume = {34},
  number = {4},
  pages = {229-244},
}