Dynamic Evaluation for Oversensitivity in LLMs

Sophia Xiao Pu, Sitao Cheng, Xin Eric Wang, William Yang Wang. Dynamic Evaluation for Oversensitivity in LLMs. In Christos Christodoulopoulos 0001, Tanmoy Chakraborty 0002, Carolyn Rose, Violet Peng, editors, Findings of the Association for Computational Linguistics: EMNLP 2025, Suzhou, China, November 4-9, 2025. pages 2337-2344, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.