A hybrid pipeline of rules and machine learning to filter web-crawled parallel corpora

Eduard Barbu, Verginica Barbu Mititelu. A hybrid pipeline of rules and machine learning to filter web-crawled parallel corpora. In Ondrej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor, editors, Proceedings of the Third Conference on Machine Translation: Shared Task Papers, WMT 2018, Belgium, Brussels, October 31 - November 1, 2018. pages 867-871, Association for Computational Linguistics, 2018. [doi]

Authors

Eduard Barbu

This author has not been identified. Look up 'Eduard Barbu' in Google

Verginica Barbu Mititelu

This author has not been identified. Look up 'Verginica Barbu Mititelu' in Google