VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups

Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, Doug Downey. VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups. TACL, 10:376-392, 2022. [doi]

@article{ShenLWKWD22,
  title = {VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups},
  author = {Zejiang Shen and Kyle Lo and Lucy Lu Wang and Bailey Kuehl and Daniel S. Weld and Doug Downey},
  year = {2022},
  doi = {10.1162/tacl_a_00466},
  url = {https://doi.org/10.1162/tacl_a_00466},
  researchr = {https://researchr.org/publication/ShenLWKWD22},
  cites = {0},
  citedby = {0},
  journal = {TACL},
  volume = {10},
  pages = {376-392},
}