An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks

Kyubyong Park, Joohong Lee, Seongbo Jang, Dawoon Jung. An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks. In Kam-Fai Wong, Kevin Knight, Hua Wu, editors, Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, AACL/IJCNLP 2020, Suzhou, China, December 4-7, 2020. pages 133-142, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.