Language Identification of Short Text Segments with N-gram Models

Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpioja. Language Identification of Short Text Segments with N-gram Models. In Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias, editors, Proceedings of the International Conference on Language Resources and Evaluation, LREC 2010, 17-23 May 2010, Valletta, Malta. European Language Resources Association, 2010. [doi]

Abstract

Abstract is missing.