Using Apache Spark for Ensuring Data Quality in Modern Data Lake Pipeline Architectures

Martina Sestak, Timi Vovk. Using Apache Spark for Ensuring Data Quality in Modern Data Lake Pipeline Architectures. In Zoran Budimac, Valentino Vranic, Ján Lang, editors, Proceedings of the Tenth Workshop on Software Quality Analysis, Monitoring, Improvement, and Applications, Bratislava, Slovakia, September 10-13, 2023. Volume 3588 of CEUR Workshop Proceedings, CEUR-WS.org, 2023. [doi]

Abstract

Abstract is missing.