Single-shot policy explanation to improve task performance via semantic reward coaching

Aaquib Tabrez, Ryan Leonard, Bradley Hayes. Single-shot policy explanation to improve task performance via semantic reward coaching. Neural Computing and Applications, 37(26):22315-22337, September 2025. [doi]

Abstract

Abstract is missing.