TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at Twitter

Xinyang Zhang 0002, Yury Malkov, Omar Florez, Serim Park, Brian McWilliams, Jiawei Han 0001, Ahmed El-Kishky. TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at Twitter. In Ambuj Singh, Yizhou Sun, Leman Akoglu, Dimitrios Gunopulos, Xifeng Yan, Ravi Kumar 0001, Fatma Ozcan, Jieping Ye, editors, Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, CA, USA, August 6-10, 2023. pages 5597-5607, ACM, 2023. [doi]

Authors

Xinyang Zhang 0002

This author has not been identified. Look up 'Xinyang Zhang 0002' in Google

Yury Malkov

This author has not been identified. Look up 'Yury Malkov' in Google

Omar Florez

This author has not been identified. Look up 'Omar Florez' in Google

Serim Park

This author has not been identified. Look up 'Serim Park' in Google

Brian McWilliams

This author has not been identified. Look up 'Brian McWilliams' in Google

Jiawei Han 0001

This author has not been identified. Look up 'Jiawei Han 0001' in Google

Ahmed El-Kishky

This author has not been identified. Look up 'Ahmed El-Kishky' in Google