HateVersarial: Adversarial Attack Against Hate Speech Detection Algorithms on Twitter

Edita Grolman, Hodaya Binyamini, Asaf Shabtai, Yuval Elovici, Ikuya Morikawa, Toshiya Shimizu. HateVersarial: Adversarial Attack Against Hate Speech Detection Algorithms on Twitter. In UMAP '22: 30th ACM Conference on User Modeling, Adaptation and Personalization, Barcelona, Spain, July 4 - 7, 2022. pages 143-152, ACM, 2022. [doi]

Abstract

Abstract is missing.