Filtering and rescoring the CCMatrix corpus for Neural Machine Translation training

Antoni Oliver González, Sergi Alvarez. Filtering and rescoring the CCMatrix corpus for Neural Machine Translation training. In Mary Nurminen, Judith Brenner, Maarit Koponen, Sirkku Latomaa, Mikhail Mikhailov, Frederike Schierl, Tharindu Ranasinghe, Eva Vanmassenhove, Sergi Alvarez Vidal, Nora Aranberri, Mara Nunziatini, Carla Parra Escartín, Mikel L. Forcada, Maja Popovic, Carolina Scarton, Helena Moniz, editors, Proceedings of the 24th Annual Conference of the European Association for Machine Translation, EAMT 2023, Tampere, Finland, 12-15 June 2023. pages 39-45, European Association for Machine Translation, 2023. [doi]

Abstract

Abstract is missing.