SizeSpotSigs: An Effective Deduplicate Algorithm Considering the Size of Page Content

Xianling Mao, Xiaobing Liu, Nan Di, Xiaoming Li, Hongfei Yan. SizeSpotSigs: An Effective Deduplicate Algorithm Considering the Size of Page Content. In Joshua Zhexue Huang, Longbing Cao, Jaideep Srivastava, editors, Advances in Knowledge Discovery and Data Mining - 15th Pacific-Asia Conference, PAKDD 2011, Shenzhen, China, May 24-27, 2011, Proceedings, Part I. Volume 6634 of Lecture Notes in Computer Science, pages 537-548, Springer, 2011. [doi]

Abstract

Abstract is missing.