DocHPLT: A Massively Multilingual Document-Level Translation Dataset

Dayyán O'Brien, Bhavitvya Malik, Ona De Gibert Bonet, Pinzhen Chen, Barry Haddow, Jörg Tiedemann. DocHPLT: A Massively Multilingual Document-Level Translation Dataset. In Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz, editors, Proceedings of the Tenth Conference on Machine Translation, WMT 2025, Suzhou, China, November 8-9, 2025. pages 286-300, Association for Computational Linguistics, 2025. [doi]

Authors

Dayyán O'Brien

This author has not been identified. Look up 'Dayyán O'Brien' in Google

Bhavitvya Malik

This author has not been identified. Look up 'Bhavitvya Malik' in Google

Ona De Gibert Bonet

This author has not been identified. Look up 'Ona De Gibert Bonet' in Google

Pinzhen Chen

This author has not been identified. Look up 'Pinzhen Chen' in Google

Barry Haddow

This author has not been identified. Look up 'Barry Haddow' in Google

Jörg Tiedemann

This author has not been identified. Look up 'Jörg Tiedemann' in Google