Rakuten's Participation in WAT 2022: Parallel Dataset Filtering by Leveraging Vocabulary Heterogeneity

Alberto Poncelas, Johanes Effendi, Ohnmar Htun, Sunil Yadav, Dongzhe Wang, Saurabh Jain. Rakuten's Participation in WAT 2022: Parallel Dataset Filtering by Leveraging Vocabulary Heterogeneity. In Proceedings of the 9th Workshop on Asian Translation, WAT@COLING 2022, Gyeongju, Republic of Korea, October 17, 2022. pages 68-72, International Conference on Computational Linguistics, 2022. [doi]

Authors

Alberto Poncelas

This author has not been identified. Look up 'Alberto Poncelas' in Google

Johanes Effendi

This author has not been identified. Look up 'Johanes Effendi' in Google

Ohnmar Htun

This author has not been identified. Look up 'Ohnmar Htun' in Google

Sunil Yadav

This author has not been identified. Look up 'Sunil Yadav' in Google

Dongzhe Wang

This author has not been identified. Look up 'Dongzhe Wang' in Google

Saurabh Jain

This author has not been identified. Look up 'Saurabh Jain' in Google