Koala: An Index for Quantifying Overlaps with Pre-training Corpora

Thuy-Trang Vu, Xuanli He, Gholamreza Haffari, Ehsan Shareghi. Koala: An Index for Quantifying Overlaps with Pre-training Corpora. In Yansong Feng, Els Lefever, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023 - System Demonstrations, Singapore, December 6-10, 2023. pages 90-98, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.