N-gram Counts and Language Models from the Common Crawl

Christian Buck, Kenneth Heafield, Bas van Ooyen. N-gram Counts and Language Models from the Common Crawl. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, AsunciĆ³n Moreno, Jan Odijk, Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014), Reykjavik, Iceland, May 26-31, 2014. pages 3579-3584, European Language Resources Association (ELRA), 2014. [doi]

Abstract

Abstract is missing.