A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning

Md Mofijul Islam, Gustavo Aguilar, Pragaash Ponnusamy, Clint Solomon Mathialagan, Chengyuan Ma, Chenlei Guo. A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning. In Spandana Gella, He He 0001, Bodhisattwa Prasad Majumder, Burcu Can, Eleonora Giunchiglia, Samuel Cahyawijaya, Sewon Min, Maximilian Mozes, Xiang Lorraine Li, Isabelle Augenstein, Anna Rogers, KyungHyun Cho, Edward Grefenstette, Laura Rimell, Chris Dyer, editors, Proceedings of the 7th Workshop on Representation Learning for NLP, RepL4NLP@ACL 2022, Dublin, Ireland, May 26, 2022. pages 91-99, Association for Computational Linguistics, 2022. [doi]

Abstract

Abstract is missing.