DELPHI: Data for Evaluating LLMs' Performance in Handling Controversial Issues

David Q. Sun, Artem Abzaliev, Hadas Kotek, Christopher Klein, Zidi Xiu, Jason D. Williams. DELPHI: Data for Evaluating LLMs' Performance in Handling Controversial Issues. In Mingxuan Wang, Imed Zitouni, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023 - Industry Track, Singapore, December 6-10, 2023. pages 820-827, Association for Computational Linguistics, 2023. [doi]

Authors

David Q. Sun

This author has not been identified. Look up 'David Q. Sun' in Google

Artem Abzaliev

This author has not been identified. Look up 'Artem Abzaliev' in Google

Hadas Kotek

This author has not been identified. Look up 'Hadas Kotek' in Google

Christopher Klein

This author has not been identified. Look up 'Christopher Klein' in Google

Zidi Xiu

This author has not been identified. Look up 'Zidi Xiu' in Google

Jason D. Williams

This author has not been identified. Look up 'Jason D. Williams' in Google