Combining Content-Based and URL-Based Heuristics to Harvest Aligned Bitexts from Multilingual Sites with Bitextor

Miquel EsplĂ -Gomis, Mikel L. Forcada. Combining Content-Based and URL-Based Heuristics to Harvest Aligned Bitexts from Multilingual Sites with Bitextor. Prague Bull. Math. Linguistics, 93:77-86, 2010. [doi]

Abstract

Abstract is missing.