SparkDQ: Efficient generic big data quality management on distributed data-parallel computation

Rong Gu, Yang Qi, Tongyu Wu, Zhaokang Wang, Xiaolong Xu, Chunfeng Yuan, Yihua Huang. SparkDQ: Efficient generic big data quality management on distributed data-parallel computation. J. Parallel Distrib. Comput., 156:132-147, 2021. [doi]

Abstract

Abstract is missing.