Abstract is missing.
- Current Challenges in Web Corpus BuildingMilos Jakubícek, Vojtech Kovár, Pavel Rychlý, Vit Suchomel. 1-4 [doi]
- Out-of-the-Box and into the Ditch? Multilingual Evaluation of Generic Text Extraction ToolsAdrien Barbaresi, Gaël Lejeune. 5-13 [doi]
- From Web Crawl to Clean Register-Annotated CorporaVeronika Laippala, Samuel Rönnqvist, Saara Hellström, Juhani Luotolahti, Liina Repo, Anna Salmela, Valtteri Skantsi, Sampo Pyysalo. 14-22 [doi]
- Building Web Corpora for Minority LanguagesHeidi Jauhiainen, Tommi Jauhiainen, Krister Lindén. 23-32 [doi]
- The ELTE.DH Pilot Corpus - Creating a Handcrafted Gigaword Web Corpus with MetadataBalázs Indig, Árpád Knap, Zsófia Sárközi-Lindner, Mária Timári, Gábor Palkó. 33-41 [doi]
- Hypernym-LIBre: A Free Web-based Corpus for Hypernym DetectionShaurya Rawat, Mariano Rico, Óscar Corcho. 42-49 [doi]
- A Cross-Genre Ensemble Approach to Robust Reddit Part of Speech TaggingShabnam Behzad, Amir Zeldes. 50-56 [doi]
- Streaming Language-Specific Twitter Data with Optimal KeywordsTim Kreutz, Walter Daelemans. 57-64 [doi]