PLAtE: A Large-scale Dataset for List Page Web Extraction

Aidan San, Yuan Zhuang, Jan Bakus, Colin Lockard, David M. Ciemiewicz, Sandeep Atluri, Kevin Small, Yangfeng Ji, Heba Elfardy. PLAtE: A Large-scale Dataset for List Page Web Extraction. In Sunayana Sitaram, Beata Beigman Klebanov, Jason D. Williams, editors, Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, ACL 2023, Toronto, Canada, July 9-14, 2023. pages 284-294, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.