SparkDQ: Efficient generic big data quality management on distributed data-parallel computation

Rong Gu, Yang Qi, Tongyu Wu, Zhaokang Wang, Xiaolong Xu, Chunfeng Yuan, Yihua Huang. SparkDQ: Efficient generic big data quality management on distributed data-parallel computation. J. Parallel Distrib. Comput., 156:132-147, 2021. [doi]

@article{GuQWWXYH21,
  title = {SparkDQ: Efficient generic big data quality management on distributed data-parallel computation},
  author = {Rong Gu and Yang Qi and Tongyu Wu and Zhaokang Wang and Xiaolong Xu and Chunfeng Yuan and Yihua Huang},
  year = {2021},
  doi = {10.1016/j.jpdc.2021.05.012},
  url = {https://doi.org/10.1016/j.jpdc.2021.05.012},
  researchr = {https://researchr.org/publication/GuQWWXYH21},
  cites = {0},
  citedby = {0},
  journal = {J. Parallel Distrib. Comput.},
  volume = {156},
  pages = {132-147},
}