On Tuning the Sorted Neighborhood Method for Record Comparisons in a Data Deduplication Pipeline - Industrial Experience Report

Pawel Boinski, Witold Andrzejewski, Bartosz Bebel, Robert Wrembel. On Tuning the Sorted Neighborhood Method for Record Comparisons in a Data Deduplication Pipeline - Industrial Experience Report. In Christine Strauss, Toshiyuki Amagasa, Gabriele Kotsis, A Min Tjoa, Ismail Khalil, editors, Database and Expert Systems Applications - 34th International Conference, DEXA 2023, Penang, Malaysia, August 28-30, 2023, Proceedings, Part I. Volume 14146 of Lecture Notes in Computer Science, pages 164-178, Springer, 2023. [doi]

Abstract

Abstract is missing.