Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland

Tommi Jauhiainen, Jussi Piitulainen, Erik Axelson, Krister Lindén. Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland. In Karl Berglund, Matti La Mela, Inge Zwart, editors, Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022. Volume 3232 of CEUR Workshop Proceedings, pages 251-259, CEUR-WS.org, 2022. [doi]

@inproceedings{JauhiainenPAL22,
  title = {Language Identification as part of the Text Corpus Creation Pipeline at the Language Bank of Finland},
  author = {Tommi Jauhiainen and Jussi Piitulainen and Erik Axelson and Krister Lindén},
  year = {2022},
  url = {http://ceur-ws.org/Vol-3232/paper23.pdf},
  researchr = {https://researchr.org/publication/JauhiainenPAL22},
  cites = {0},
  citedby = {0},
  pages = {251-259},
  booktitle = {Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Uppsala, Sweden, March 15-18, 2022},
  editor = {Karl Berglund and Matti La Mela and Inge Zwart},
  volume = {3232},
  series = {CEUR Workshop Proceedings},
  publisher = {CEUR-WS.org},
}