Scraping Scientific Web Repositories: Challenges and Solutions for Automated Content Extraction

Philipp Meschenmoser, Norman Meuschke, Manuel Hotz, Bela Gipp. Scraping Scientific Web Repositories: Challenges and Solutions for Automated Content Extraction. D-Lib Magazine, 22(9/10), 2016. [doi]

@article{MeschenmoserMHG16,
  title = {Scraping Scientific Web Repositories: Challenges and Solutions for Automated Content Extraction},
  author = {Philipp Meschenmoser and Norman Meuschke and Manuel Hotz and Bela Gipp},
  year = {2016},
  doi = {10.1045/september2016-meschenmoser},
  url = {http://dx.doi.org/10.1045/september2016-meschenmoser},
  researchr = {https://researchr.org/publication/MeschenmoserMHG16},
  cites = {0},
  citedby = {0},
  journal = {D-Lib Magazine},
  volume = {22},
  number = {9/10},
}