Jan Svec, Jan Hoidekr, Daniel Soutner, Jan Vavruska. Web Text Data Mining for Building Large Scale Language Modelling Corpus. In Ivan Habernal, Václav Matousek, editors, Text, Speech and Dialogue - 14th International Conference, TSD 2011, Pilsen, Czech Republic, September 1-5, 2011. Proceedings. Volume 6836 of Lecture Notes in Computer Science, pages 356-363, Springer, 2011. [doi]
Abstract is missing.