Tafsir Dataset: A Novel Multi-Task Benchmark for Named Entity Recognition and Topic Modeling in Classical Arabic Literature

Sajawel Ahmed, Rob van der Goot, Misbahur Rehman, Carl Kruse, Ömer Özsoy, Alexander Mehler, Gemma Roig. Tafsir Dataset: A Novel Multi-Task Benchmark for Named Entity Recognition and Topic Modeling in Classical Arabic Literature. In Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, James Pustejovsky, Leo Wanner, Key-Sun Choi, Pum-Mo Ryu, Hsin-Hsi Chen, Lucia Donatelli, Heng Ji, Sadao Kurohashi, Patrizia Paggio, Nianwen Xue, Seokhwan Kim, YoungGyun Hahm, Zhong He, Tony Kyungil Lee, Enrico Santus, Francis Bond, Seung-Hoon Na, editors, Proceedings of the 29th International Conference on Computational Linguistics, COLING 2022, Gyeongju, Republic of Korea, October 12-17, 2022. pages 3753-3768, International Committee on Computational Linguistics, 2022. [doi]

Authors

Sajawel Ahmed

This author has not been identified. Look up 'Sajawel Ahmed' in Google

Rob van der Goot

This author has not been identified. Look up 'Rob van der Goot' in Google

Misbahur Rehman

This author has not been identified. Look up 'Misbahur Rehman' in Google

Carl Kruse

This author has not been identified. Look up 'Carl Kruse' in Google

Ömer Özsoy

This author has not been identified. Look up 'Ömer Özsoy' in Google

Alexander Mehler

This author has not been identified. Look up 'Alexander Mehler' in Google

Gemma Roig

This author has not been identified. Look up 'Gemma Roig' in Google