Detecting near-duplicates for web crawling

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma. Detecting near-duplicates for web crawling. In Carey L. Williamson, Mary Ellen Zurko, Peter F. Patel-Schneider, Prashant J. Shenoy, editors, Proceedings of the 16th International Conference on World Wide Web, WWW 2007, Banff, Alberta, Canada, May 8-12, 2007. pages 141-150, ACM, 2007. [doi]

Authors

Gurmeet Singh Manku

This author has not been identified. Look up 'Gurmeet Singh Manku' in Google

Arvind Jain

This author has not been identified. Look up 'Arvind Jain' in Google

Anish Das Sarma

This author has not been identified. Look up 'Anish Das Sarma' in Google