Improving Document Clustering by Removing Unnatural Language

Myungha Jang, Jinho D. Choi, James Allan. Improving Document Clustering by Removing Unnatural Language. In Leon Derczynski, Wei Xu 0004, Alan Ritter, Tim Baldwin, editors, Proceedings of the 3rd Workshop on Noisy User-generated Text, NUT@EMNLP 2017, Copenhagen, Denmark, September 7, 2017. pages 122-130, Association for Computational Linguistics, 2017. [doi]

Abstract

Abstract is missing.