AdminSet and AdminBERT: a Dataset and a Pre-trained Language Model to Explore the Unstructured Maze of French Administrative Documents

Thomas Sebbag, Solen Quiniou, Nicolas Stucky, Emmanuel Morin. AdminSet and AdminBERT: a Dataset and a Pre-trained Language Model to Explore the Unstructured Maze of French Administrative Documents. In Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa 0001, Barbara Di Eugenio, Steven Schockaert, editors, Proceedings of the 31st International Conference on Computational Linguistics, COLING 2025, Abu Dhabi, UAE, January 19-24, 2025. pages 392-406, Association for Computational Linguistics, 2025. [doi]

Authors

Thomas Sebbag

This author has not been identified. Look up 'Thomas Sebbag' in Google

Solen Quiniou

This author has not been identified. Look up 'Solen Quiniou' in Google

Nicolas Stucky

This author has not been identified. Look up 'Nicolas Stucky' in Google

Emmanuel Morin

This author has not been identified. Look up 'Emmanuel Morin' in Google