MapDupReducer: detecting near duplicates over massive datasets

Chaokun Wang, Jianmin Wang, Xuemin Lin, Wei Wang, Haixun Wang, Hongsong Li, Wanpeng Tian, Jun Xu, Rui Li. MapDupReducer: detecting near duplicates over massive datasets. In Ahmed K. Elmagarmid, Divyakant Agrawal, editors, Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2010, Indianapolis, Indiana, USA, June 6-10, 2010. pages 1119-1122, ACM, 2010. [doi]

@inproceedings{WangWLWWLTXL10,
  title = {MapDupReducer: detecting near duplicates over massive datasets},
  author = {Chaokun Wang and Jianmin Wang and Xuemin Lin and Wei Wang and Haixun Wang and Hongsong Li and Wanpeng Tian and Jun Xu and Rui Li},
  year = {2010},
  doi = {10.1145/1807167.1807296},
  url = {http://doi.acm.org/10.1145/1807167.1807296},
  researchr = {https://researchr.org/publication/WangWLWWLTXL10},
  cites = {0},
  citedby = {0},
  pages = {1119-1122},
  booktitle = {Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2010, Indianapolis, Indiana, USA, June 6-10, 2010},
  editor = {Ahmed K. Elmagarmid and Divyakant Agrawal},
  publisher = {ACM},
  isbn = {978-1-4503-0032-2},
}