A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning

Md Mofijul Islam, Gustavo Aguilar, Pragaash Ponnusamy, Clint Solomon Mathialagan, Chengyuan Ma, Chenlei Guo. A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning. In Spandana Gella, He He 0001, Bodhisattwa Prasad Majumder, Burcu Can, Eleonora Giunchiglia, Samuel Cahyawijaya, Sewon Min, Maximilian Mozes, Xiang Lorraine Li, Isabelle Augenstein, Anna Rogers, KyungHyun Cho, Edward Grefenstette, Laura Rimell, Chris Dyer, editors, Proceedings of the 7th Workshop on Representation Learning for NLP, RepL4NLP@ACL 2022, Dublin, Ireland, May 26, 2022. pages 91-99, Association for Computational Linguistics, 2022. [doi]

@inproceedings{IslamAPMMG22,
  title = {A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning},
  author = {Md Mofijul Islam and Gustavo Aguilar and Pragaash Ponnusamy and Clint Solomon Mathialagan and Chengyuan Ma and Chenlei Guo},
  year = {2022},
  url = {https://aclanthology.org/2022.repl4nlp-1.10},
  researchr = {https://researchr.org/publication/IslamAPMMG22},
  cites = {0},
  citedby = {0},
  pages = {91-99},
  booktitle = {Proceedings of the 7th Workshop on Representation Learning for NLP, RepL4NLP@ACL 2022, Dublin, Ireland, May 26, 2022},
  editor = {Spandana Gella and He He 0001 and Bodhisattwa Prasad Majumder and Burcu Can and Eleonora Giunchiglia and Samuel Cahyawijaya and Sewon Min and Maximilian Mozes and Xiang Lorraine Li and Isabelle Augenstein and Anna Rogers and KyungHyun Cho and Edward Grefenstette and Laura Rimell and Chris Dyer},
  publisher = {Association for Computational Linguistics},
  isbn = {978-1-955917-48-3},
}