Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering

Helena Bonaldi, Greta Damo, Nicolas Ocampo, Elena Cabrio, Serena Villata, Marco Guerini. Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. pages 3446-3463, Association for Computational Linguistics, 2024. [doi]

@inproceedings{BonaldiDOCVG24,
  title = {Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering},
  author = {Helena Bonaldi and Greta Damo and Nicolas Ocampo and Elena Cabrio and Serena Villata and Marco Guerini},
  year = {2024},
  url = {https://aclanthology.org/2024.emnlp-main.201},
  researchr = {https://researchr.org/publication/BonaldiDOCVG24},
  cites = {0},
  citedby = {0},
  pages = {3446-3463},
  booktitle = {Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024},
  editor = {Yaser Al-Onaizan and Mohit Bansal and Yun-Nung Chen},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-164-3},
}