On Tuning the Sorted Neighborhood Method for Record Comparisons in a Data Deduplication Pipeline - Industrial Experience Report

Pawel Boinski, Witold Andrzejewski, Bartosz Bebel, Robert Wrembel. On Tuning the Sorted Neighborhood Method for Record Comparisons in a Data Deduplication Pipeline - Industrial Experience Report. In Christine Strauss, Toshiyuki Amagasa, Gabriele Kotsis, A Min Tjoa, Ismail Khalil, editors, Database and Expert Systems Applications - 34th International Conference, DEXA 2023, Penang, Malaysia, August 28-30, 2023, Proceedings, Part I. Volume 14146 of Lecture Notes in Computer Science, pages 164-178, Springer, 2023. [doi]

@inproceedings{BoinskiABW23,
  title = {On Tuning the Sorted Neighborhood Method for Record Comparisons in a Data Deduplication Pipeline - Industrial Experience Report},
  author = {Pawel Boinski and Witold Andrzejewski and Bartosz Bebel and Robert Wrembel},
  year = {2023},
  doi = {10.1007/978-3-031-39847-6_11},
  url = {https://doi.org/10.1007/978-3-031-39847-6_11},
  researchr = {https://researchr.org/publication/BoinskiABW23},
  cites = {0},
  citedby = {0},
  pages = {164-178},
  booktitle = {Database and Expert Systems Applications - 34th International Conference, DEXA 2023, Penang, Malaysia, August 28-30, 2023, Proceedings, Part I},
  editor = {Christine Strauss and Toshiyuki Amagasa and Gabriele Kotsis and A Min Tjoa and Ismail Khalil},
  volume = {14146},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-031-39847-6},
}