Text Similarity Measures in a Data Deduplication Pipeline for Customers Records

Witold Andrzejewski, Bartosz Bebel, Pawel Boinski, Mariusz Sienkiewicz, Robert Wrembel. Text Similarity Measures in a Data Deduplication Pipeline for Customers Records. In Enrico Gallinucci, Lukasz Golab, editors, Proceedings of the 25th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP) co-located with the 26th International Conference on Extending Database Technology and the 26th International Conference on Database Theory (EDBT/ICDT 2023), Ioannina, Greece, March 28, 2023. Volume 3369 of CEUR Workshop Proceedings, pages 33-42, CEUR-WS.org, 2023. [doi]

@inproceedings{AndrzejewskiBBS23,
  title = {Text Similarity Measures in a Data Deduplication Pipeline for Customers Records},
  author = {Witold Andrzejewski and Bartosz Bebel and Pawel Boinski and Mariusz Sienkiewicz and Robert Wrembel},
  year = {2023},
  url = {https://ceur-ws.org/Vol-3369/paper3.pdf},
  researchr = {https://researchr.org/publication/AndrzejewskiBBS23},
  cites = {0},
  citedby = {0},
  pages = {33-42},
  booktitle = {Proceedings of the 25th International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP) co-located with the 26th International Conference on Extending Database Technology and the 26th International Conference on Database Theory (EDBT/ICDT 2023), Ioannina, Greece, March 28, 2023},
  editor = {Enrico Gallinucci and Lukasz Golab},
  volume = {3369},
  series = {CEUR Workshop Proceedings},
  publisher = {CEUR-WS.org},
}