Abstract is missing.
- Automatic Classification by Topic Domain for Meta Data Generation, Web Corpus Evaluation, and Corpus ComparisonRoland Schäfer, Felix Bildhauer. 1-6 [doi]
- Efficient construction of metadata-enhanced web corporaAdrien Barbaresi. 7-16 [doi]
- Topically-focused Blog Corpora for Multiple LanguagesAndrew Salway, Dag Elgesem, Knut Hofland, Øystein Reigem, L'ubos Steskal. 17-26 [doi]
- The Challenges and Joys of Analysing Ongoing Language Change in Web-based Corpora: a Case StudyAnne Krause. 27-34 [doi]
- Using the Web and Social Media as Corpora for Monitoring the Spread of Neologisms. The case of 'rapefugee', 'rapeugee', and 'rapugee'Quirin Würschinger, Mohammad Fazleh Elahi, Desislava Zhekova, Hans-Jörg Schmid. 35-43 [doi]
- EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web CorporaMichael Beißwenger, Sabine Bartsch, Stefan Evert, Kay-Michael Würzner. 44-56 [doi]
- SoMaJo: State-of-the-art tokenization for German web and social media textsThomas Proisl, Peter Uhrig. 57-62 [doi]
- UdS-(retrain|distributional|surface): Improving POS Tagging for OOV Words in German CMC and Web DataJakob Prange, Andrea Horbach, Stefan Thater. 63-71 [doi]
- Babler - Data Collection from the Web to Support Speech Recognition and Keyword SearchGideon Mendels, Erica Cooper, Julia Hirschberg. 72-81 [doi]
- A Global Analysis of Emoji UsageNikola Ljubesic, Darja Fiser. 82-89 [doi]
- Genre classification for a corpus of academic webpagesErika Dalan, Serge Sharoff. 90-98 [doi]
- On Bias-free Crawling and Representative Web CorporaRoland Schäfer. 99-105 [doi]
- EmpiriST: AIPHES - Robust Tokenization and POS-Tagging for Different GenresSteffen Remus, Gerold Hintz, Chris Biemann, Christian M. Meyer, Darina Benikova, Judith Eckle-Kohler, Margot Mieskes, Thomas Arnold. 106-114 [doi]
- bot.zen $@$ EmpiriST 2015 - A minimally-deep learning PoS-tagger (trained for German CMC and Web data)Egon Stemle. 115-119 [doi]
- LTL-UDE $@$ EmpiriST 2015: Tokenization and PoS Tagging of Social Media TextTobias Horsmann, Torsten Zesch. 120-126 [doi]