Abstract is missing.
- Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available SourcesAdrien Barbaresi. 1-8 [doi]
- Focused Web Corpus CrawlingRoland Schäfer, Adrien Barbaresi, Felix Bildhauer. 9-15 [doi]
- Less Destructive Cleaning of Web Documents by Using Standoff AnnotationMaik Stührenberg. 16-21 [doi]
- Some Issues on the Normalization of a Corpus of Products Reviews in PortugueseMagali Sanches Duran, Lucas Avanço, Sandra M. Aluísio, Thiago A. S. Pardo, Maria das Graças Volpe Nunes. 22-28 [doi]
- {bs, hr, sr}WaC - Web Corpora of Bosnian, Croatian and SerbianNikola Ljubesic, Filip Klubicka. 29-35 [doi]
- The PAISÀ Corpus of Italian Web TextsVerena Lyding, Egon Stemle, Claudia Borghetti, Marco Brunello, Sara Castagnoli, Felice dell'Orletta, Henrik Dittmann, Alessandro Lenci, Vito Pirrelli. 36-43 [doi]