Proceedings of the 12th Web as Corpus Workshop, WAC@LREC 2020, Marseille, France, May 2020 - researchr publication

researchr

You are not signed in
Sign in
Sign up

Adrien Barbaresi, Felix Bildhauer, Roland Schäfer, Egon Stemle, editors, Proceedings of the 12th Web as Corpus Workshop, WAC@LREC 2020, Marseille, France, May 2020. European Language Resources Association, 2020. [doi]

Conference: aclwac2020

Abstract is missing.

Current Challenges in Web Corpus BuildingMilos Jakubícek, Vojtech Kovár, Pavel Rychlý, Vit Suchomel. 1-4 [doi]

Out-of-the-Box and into the Ditch? Multilingual Evaluation of Generic Text Extraction ToolsAdrien Barbaresi, Gaël Lejeune. 5-13 [doi]

From Web Crawl to Clean Register-Annotated CorporaVeronika Laippala, Samuel Rönnqvist, Saara Hellström, Juhani Luotolahti, Liina Repo, Anna Salmela, Valtteri Skantsi, Sampo Pyysalo. 14-22 [doi]

Building Web Corpora for Minority LanguagesHeidi Jauhiainen, Tommi Jauhiainen, Krister Lindén. 23-32 [doi]

The ELTE.DH Pilot Corpus - Creating a Handcrafted Gigaword Web Corpus with MetadataBalázs Indig, Árpád Knap, Zsófia Sárközi-Lindner, Mária Timári, Gábor Palkó. 33-41 [doi]

Hypernym-LIBre: A Free Web-based Corpus for Hypernym DetectionShaurya Rawat, Mariano Rico, Óscar Corcho. 42-49 [doi]

A Cross-Genre Ensemble Approach to Robust Reddit Part of Speech TaggingShabnam Behzad, Amir Zeldes. 50-56 [doi]

Streaming Language-Specific Twitter Data with Optimal KeywordsTim Kreutz, Walter Daelemans. 57-64 [doi]

runs on WebDSL