uniblock: Scoring and Filtering Corpus with Unicode Block Information

Yingbo Gao, Weiyue Wang, Hermann Ney. uniblock: Scoring and Filtering Corpus with Unicode Block Information. In Kentaro Inui, Jing Jiang, Vincent Ng, Xiaojun Wan 0001, editors, Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019. pages 1324-1329, Association for Computational Linguistics, 2019. [doi]

Authors

Yingbo Gao

This author has not been identified. Look up 'Yingbo Gao' in Google

Weiyue Wang

This author has not been identified. Look up 'Weiyue Wang' in Google

Hermann Ney

This author has not been identified. Look up 'Hermann Ney' in Google