DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition?

Urja Khurana, Eric T. Nalisnick, Antske Fokkens. DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition?. In Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa 0001, Barbara Di Eugenio, Steven Schockaert, editors, Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025, Abu Dhabi, UAE, January 19-24, 2025. pages 4341-4358, Association for Computational Linguistics, 2025. [doi]

@inproceedings{KhuranaNF25,
  title = {DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition?},
  author = {Urja Khurana and Eric T. Nalisnick and Antske Fokkens},
  year = {2025},
  url = {https://aclanthology.org/2025.coling-main.293/},
  researchr = {https://researchr.org/publication/KhuranaNF25},
  cites = {0},
  citedby = {0},
  pages = {4341-4358},
  booktitle = {Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025, Abu Dhabi, UAE, January 19-24, 2025},
  editor = {Owen Rambow and Leo Wanner and Marianna Apidianaki and Hend Al-Khalifa 0001 and Barbara Di Eugenio and Steven Schockaert},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-196-4},
}