Analyzing the Impact of Tokenization on Multilingual Epidemic Surveillance in Low-Resource Languages

Stephen Mutuvi, Emanuela Boros, Antoine Doucet, Gaël Lejeune, Adam Jatowt, Moses Odeo. Analyzing the Impact of Tokenization on Multilingual Epidemic Surveillance in Low-Resource Languages. In Gernot A. Fink, Rajiv Jain, Koichi Kise, Richard Zanibbi, editors, Document Analysis and Recognition - ICDAR 2023 - 17th International Conference, San José, CA, USA, August 21-26, 2023, Proceedings, Part III. Volume 14189 of Lecture Notes in Computer Science, pages 17-32, Springer, 2023. [doi]

Authors

Stephen Mutuvi

This author has not been identified. Look up 'Stephen Mutuvi' in Google

Emanuela Boros

This author has not been identified. Look up 'Emanuela Boros' in Google

Antoine Doucet

This author has not been identified. Look up 'Antoine Doucet' in Google

Gaël Lejeune

This author has not been identified. Look up 'Gaël Lejeune' in Google

Adam Jatowt

This author has not been identified. Look up 'Adam Jatowt' in Google

Moses Odeo

This author has not been identified. Look up 'Moses Odeo' in Google