Single-shot policy explanation to improve task performance via semantic reward coaching - researchr publication

researchr

You are not signed in
Sign in
Sign up

Aaquib Tabrez, Ryan Leonard, Bradley Hayes. Single-shot policy explanation to improve task performance via semantic reward coaching. Neural Computing and Applications, 37(26):22315-22337, September 2025. [doi]

Abstract is missing.

runs on WebDSL