Leveraging hadoop framework to develop duplication detector and analysis using Mapreduce, Hive and Pig

Priyanka Sethi, Prakash Kumar. Leveraging hadoop framework to develop duplication detector and analysis using Mapreduce, Hive and Pig. In Manish Parashar, Umesh Bellur, S. D. Madhu Kumar, Priya Chandran, Murali Krishnan, Kamesh Madduri, Sushil K. Prasad, C. Chandra Sekhar, Nanjangud C. Narendra, Carlos Valera, Sanjay Chaudhary, Kavi Arya, Xiaolin Li 0001, editors, Seventh International Conference on Contemporary Computing, IC3 2014, Noida, India, August 7-9, 2014. pages 454-460, IEEE, 2014. [doi]

Abstract

Abstract is missing.