IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization

Fajri Koto, Jey Han Lau, Timothy Baldwin. IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih, editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021. pages 10660-10668, Association for Computational Linguistics, 2021. [doi]

Authors

Fajri Koto

This author has not been identified. Look up 'Fajri Koto' in Google

Jey Han Lau

This author has not been identified. Look up 'Jey Han Lau' in Google

Timothy Baldwin

This author has not been identified. Look up 'Timothy Baldwin' in Google