Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland

Tommi Jauhiainen, Jussi Piitulainen, Erik Axelson, Krister Lindén. Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland. In Karl Berglund, Matti La Mela, Inge Zwart, editors, Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022. Volume 3232 of CEUR Workshop Proceedings, pages 251-259, CEUR-WS.org, 2022. [doi]

Abstract

Abstract is missing.