Meme Trojan: Backdoor Attacks Against Hateful Meme Detection via Cross-Modal Triggers

Ruofei Wang, Hongzhan Lin 0001, Ziyuan Luo, Ka-Chun Cheung, Simon See, Jing Ma 0004, Renjie Wan. Meme Trojan: Backdoor Attacks Against Hateful Meme Detection via Cross-Modal Triggers. In Toby Walsh, Julie Shah, Zico Kolter, editors, AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA. pages 7844-7852, AAAI Press, 2025. [doi]

Abstract

Abstract is missing.