The Newspaper Navigator Dataset: Extracting Headlines and Visual Content from 16 Million Historic Newspaper Pages in Chronicling America

Benjamin Charles Germain Lee, Jaime Mears, Eileen Jakeway, Meghan Ferriter, Chris Adams, Nathan Yarasavage, Deborah Thomas, Kate Zwaard, Daniel S. Weld. The Newspaper Navigator Dataset: Extracting Headlines and Visual Content from 16 Million Historic Newspaper Pages in Chronicling America. In Mathieu d'Aquin, Stefan Dietze, Claudia Hauff, Edward Curry, Philippe Cudré-Mauroux, editors, CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, Virtual Event, Ireland, October 19-23, 2020. pages 3055-3062, ACM, 2020. [doi]

@inproceedings{LeeMJFAYTZW20,
  title = {The Newspaper Navigator Dataset: Extracting Headlines and Visual Content from 16 Million Historic Newspaper Pages in Chronicling America},
  author = {Benjamin Charles Germain Lee and Jaime Mears and Eileen Jakeway and Meghan Ferriter and Chris Adams and Nathan Yarasavage and Deborah Thomas and Kate Zwaard and Daniel S. Weld},
  year = {2020},
  doi = {10.1145/3340531.3412767},
  url = {https://doi.org/10.1145/3340531.3412767},
  researchr = {https://researchr.org/publication/LeeMJFAYTZW20},
  cites = {0},
  citedby = {0},
  pages = {3055-3062},
  booktitle = {CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, Virtual Event, Ireland, October 19-23, 2020},
  editor = {Mathieu d'Aquin and Stefan Dietze and Claudia Hauff and Edward Curry and Philippe Cudré-Mauroux},
  publisher = {ACM},
  isbn = {978-1-4503-6859-9},
}