Parag Mulendra Joshi, Sam Liu. Web document text and images extraction using DOM analysis and natural language processing. In Uwe M. Borghoff, Boris Chidlovskii, editors, Proceedings of the 2009 ACM Symposium on Document Engineering, Munich, Germany, September 16-18, 2009. pages 218-221, ACM, 2009. [doi]
Abstract is missing.