Web Text Data Mining for Building Large Scale Language Modelling Corpus

Jan Svec, Jan Hoidekr, Daniel Soutner, Jan Vavruska. Web Text Data Mining for Building Large Scale Language Modelling Corpus. In Ivan Habernal, Václav Matousek, editors, Text, Speech and Dialogue - 14th International Conference, TSD 2011, Pilsen, Czech Republic, September 1-5, 2011. Proceedings. Volume 6836 of Lecture Notes in Computer Science, pages 356-363, Springer, 2011. [doi]

Abstract

Abstract is missing.