HARM: Learning Hate-Aware Reward Model for Evaluating Natural Language Explanations of Offensive Content - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Lorenzo Puppi Vecchi, Alceu de Souza Britto Jr., Emerson Cabrera Paraiso, Rafael M. O. Cruz. HARM: Learning Hate-Aware Reward Model for Evaluating Natural Language Explanations of Offensive Content. In Vera Demberg, Kentaro Inui, Lluís Marquez, editors, Findings of the Association for Computational Linguistics: EACL 2026, Rabat, Morocco, March 24-29, 2026. pages 4393-4431, Association for Computational Linguistics, 2026. [doi]

This author has not been identified. Look up 'Lorenzo Puppi Vecchi' in GoogleThis author has not been identified. Look up 'Alceu de Souza Britto Jr.' in GoogleThis author has not been identified. Look up 'Emerson Cabrera Paraiso' in GoogleThis author has not been identified. Look up 'Rafael M. O. Cruz' in Google

runs on WebDSL