SizeSpotSigs: An Effective Deduplicate Algorithm Considering the Size of Page Content

Xianling Mao, Xiaobing Liu, Nan Di, Xiaoming Li, Hongfei Yan. SizeSpotSigs: An Effective Deduplicate Algorithm Considering the Size of Page Content. In Joshua Zhexue Huang, Longbing Cao, Jaideep Srivastava, editors, Advances in Knowledge Discovery and Data Mining - 15th Pacific-Asia Conference, PAKDD 2011, Shenzhen, China, May 24-27, 2011, Proceedings, Part I. Volume 6634 of Lecture Notes in Computer Science, pages 537-548, Springer, 2011. [doi]

@inproceedings{MaoLDLY11,
  title = {SizeSpotSigs: An Effective Deduplicate Algorithm Considering the Size of Page Content},
  author = {Xianling Mao and Xiaobing Liu and Nan Di and Xiaoming Li and Hongfei Yan},
  year = {2011},
  doi = {10.1007/978-3-642-20841-6_44},
  url = {http://dx.doi.org/10.1007/978-3-642-20841-6_44},
  researchr = {https://researchr.org/publication/MaoLDLY11},
  cites = {0},
  citedby = {0},
  pages = {537-548},
  booktitle = {Advances in Knowledge Discovery and Data Mining - 15th Pacific-Asia Conference, PAKDD 2011, Shenzhen, China, May 24-27, 2011, Proceedings, Part I},
  editor = {Joshua Zhexue Huang and Longbing Cao and Jaideep Srivastava},
  volume = {6634},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-642-20840-9},
}