The Norwegian Colossal Corpus: A Text Corpus for Training Large Norwegian Language Models

Per Egil Kummervold, Freddy Wetjen, Javier de la Rosa. The Norwegian Colossal Corpus: A Text Corpus for Training Large Norwegian Language Models. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis, editors, Proceedings of the Thirteenth Language Resources and Evaluation Conference, LREC 2022, Marseille, France, 20-25 June 2022. pages 3852-3860, European Language Resources Association, 2022. [doi]

Authors

Per Egil Kummervold

This author has not been identified. Look up 'Per Egil Kummervold' in Google

Freddy Wetjen

This author has not been identified. Look up 'Freddy Wetjen' in Google

Javier de la Rosa

This author has not been identified. Look up 'Javier de la Rosa' in Google