MapDupReducer: detecting near duplicates over massive datasets

Chaokun Wang, Jianmin Wang, Xuemin Lin, Wei Wang, Haixun Wang, Hongsong Li, Wanpeng Tian, Jun Xu, Rui Li. MapDupReducer: detecting near duplicates over massive datasets. In Ahmed K. Elmagarmid, Divyakant Agrawal, editors, Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2010, Indianapolis, Indiana, USA, June 6-10, 2010. pages 1119-1122, ACM, 2010. [doi]

Authors

Chaokun Wang

This author has not been identified. Look up 'Chaokun Wang' in Google

Jianmin Wang

This author has not been identified. Look up 'Jianmin Wang' in Google

Xuemin Lin

This author has not been identified. Look up 'Xuemin Lin' in Google

Wei Wang

This author has not been identified. Look up 'Wei Wang' in Google

Haixun Wang

This author has not been identified. Look up 'Haixun Wang' in Google

Hongsong Li

This author has not been identified. Look up 'Hongsong Li' in Google

Wanpeng Tian

This author has not been identified. Look up 'Wanpeng Tian' in Google

Jun Xu

This author has not been identified. Look up 'Jun Xu' in Google

Rui Li

This author has not been identified. Look up 'Rui Li' in Google