The Unintended Trade-off of AI Alignment: Balancing Hallucination Mitigation and Safety in LLMs

Omar Mahmoud, Ali Khalil, Thommen George Karimpanal, Buddhika Laknath Semage, Santu Rana. The Unintended Trade-off of AI Alignment: Balancing Hallucination Mitigation and Safety in LLMs. In Vera Demberg, Kentaro Inui, LluĂ­s Marquez, editors, Findings of the Association for Computational Linguistics: EACL 2026, Rabat, Morocco, March 24-29, 2026. pages 1017-1037, Association for Computational Linguistics, 2026. [doi]

Abstract

Abstract is missing.