Align before Attend: Aligning Visual and Textual Features for Multimodal Hateful Content Detection

Eftekhar Hossain, Omar Sharif, Mohammed Moshiul Hoque, Sarah Masud Preum. Align before Attend: Aligning Visual and Textual Features for Multimodal Hateful Content Detection. In Neele Falk, Sara Papi, Mike Zhang, editors, Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2024: Student Research Workshop, St. Julian's, Malta, March 21-22, 2024. pages 162-174, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.