Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation

A. Seza Dogruöz, Sunayana Sitaram, Zheng Xin Yong. Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 5751-5767, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.