Fixing the Threshold for Effective Detection of Near Duplicate Web Documents in Web Crawling

V. A. Narayana, P. Premchand, A. Govardhan. Fixing the Threshold for Effective Detection of Near Duplicate Web Documents in Web Crawling. In Longbing Cao, Yong Feng, Jiang Zhong, editors, Advanced Data Mining and Applications - 6th International Conference, ADMA 2010, Chongqing, China, November 19-21, 2010, Proceedings, Part I. Volume 6440 of Lecture Notes in Computer Science, pages 169-180, Springer, 2010. [doi]

Abstract

Abstract is missing.