Deidentifying a Corpus of 100 Million Clinical Text Documents for Information Extraction: Lessons Learned

Lakshmi Radhakrishnan, Gundolf Schenk, Kathlene Muenzen, Boris Oskotsky, Sharat Israni, Atul J. Butte. Deidentifying a Corpus of 100 Million Clinical Text Documents for Information Extraction: Lessons Learned. In AMIA 2022, American Medical Informatics Association Annual Symposium, Washington, DC, USA, November 5-9, 2022. AMIA, 2022. [doi]

@inproceedings{RadhakrishnanSM22,
  title = {Deidentifying a Corpus of 100 Million Clinical Text Documents for Information Extraction: Lessons Learned},
  author = {Lakshmi Radhakrishnan and Gundolf Schenk and Kathlene Muenzen and Boris Oskotsky and Sharat Israni and Atul J. Butte},
  year = {2022},
  url = {https://knowledge.amia.org/76677-amia-1.4637602/f007-1.4641746/f007-1.4641747/570-1.4641898/810-1.4641895},
  researchr = {https://researchr.org/publication/RadhakrishnanSM22},
  cites = {0},
  citedby = {0},
  booktitle = {AMIA 2022, American Medical Informatics Association Annual Symposium, Washington, DC, USA, November 5-9, 2022},
  publisher = {AMIA},
}