Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland

Tommi Jauhiainen, Jussi Piitulainen, Erik Axelson, Krister Lindén. Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland. In Karl Berglund, Matti La Mela, Inge Zwart, editors, Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022. Volume 3232 of CEUR Workshop Proceedings, pages 251-259, CEUR-WS.org, 2022. [doi]

Authors

Tommi Jauhiainen

This author has not been identified. Look up 'Tommi Jauhiainen' in Google

Jussi Piitulainen

This author has not been identified. Look up 'Jussi Piitulainen' in Google

Erik Axelson

This author has not been identified. Look up 'Erik Axelson' in Google

Krister Lindén

This author has not been identified. Look up 'Krister Lindén' in Google