N-gram Counts and Language Models from the Common Crawl

Christian Buck, Kenneth Heafield, Bas van Ooyen. N-gram Counts and Language Models from the Common Crawl. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, AsunciĆ³n Moreno, Jan Odijk, Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014), Reykjavik, Iceland, May 26-31, 2014. pages 3579-3584, European Language Resources Association (ELRA), 2014. [doi]

Authors

Christian Buck

This author has not been identified. Look up 'Christian Buck' in Google

Kenneth Heafield

This author has not been identified. Look up 'Kenneth Heafield' in Google

Bas van Ooyen

This author has not been identified. Look up 'Bas van Ooyen' in Google