Optimizing the Size of Subword Vocabularies in Dialect Classification

Vani Kanjirangat, Tanja Samardzic, Ljiljana Dolamic, Fabio Rinaldi 0001. Optimizing the Size of Subword Vocabularies in Dialect Classification. In Yves Scherrer, Tommi Jauhiainen, Nikola Ljubesic, Preslav Nakov, Jörg Tiedemann, Marcos Zampieri, editors, Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@EACL 2023, Dubrovnik, Croatia, May 5, 2023. pages 14-30, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.