Abstract is missing.
- A Report on the VarDial Evaluation Campaign 2020Mihaela Gaman, Dirk Hovy, Radu-Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri. 1-14 [doi]
- ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss GermanIuliia Nigmatulina, Tannon Kew, Tanja Samardzic. 15-24 [doi]
- LSDC - A comprehensive dataset for Low Saxon Dialect ClassificationJanine Siewert, Yves Scherrer, Martijn Wieling, Jörg Tiedemann. 25-35 [doi]
- Machine-oriented NMT Adaptation for Zero-shot NLP tasks: Comparing the Usefulness of Close and Distant LanguagesAmirhossein Tebbifakhr, Matteo Negri, Marco Turchi. 36-46 [doi]
- Character Alignment in Morphologically Complex Translation Sets for Related LanguagesMichael Gasser, Binyam Ephrem Seyoum, Nazareth Amlesom Kifle. 47-56 [doi]
- Bilingual Lexicon Induction across Orthographically-distinct Under-Resourced Dravidian LanguagesBharathi Raja Chakravarthi, Navaneethan Rajasekaran, Mihael Arcan, Kevin McGuinness, Noel E. O'Connor, John P. McCrae. 57-69 [doi]
- Building a Corpus for the Zaza-Gorani Language FamilySina Ahmadi. 70-78 [doi]
- Dealing with dialectal variation in the construction of the Basque historical corpusAinara Estarrona, Izaskun Etxeberria, Ricardo Etxepare, Manuel Padilla-Moyano, Ander Soraluze. 79-89 [doi]
- Recycling and Comparing Morphological Annotation Models for Armenian Diachronic-Variational Corpus ProcessingChahan Vidal-Gorène, Victoria Khurshudyan, Anaïd Donabédian-Demopoulos. 90-101 [doi]
- Neural Machine Translation for translating into Croatian and SerbianMaja Popovic, Alberto Poncelas, Marija Brkic, Andy Way. 102-113 [doi]
- A Tokenization System for the Kurdish LanguageSina Ahmadi. 114-127 [doi]
- Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language IdentificationBadr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius, Dietrich Klakow. 128-139 [doi]
- A Four-Dialect Treebank for Occitan: Building Process and Parsing ExperimentsAleksandra Miletic, Myriam Bras, Marianne Vergez-Couret, Louise Esher, Clamença Poujade, Jean Sibille. 140-149 [doi]
- Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian LanguageAndrea Zugarini, Matteo Tiezzi, Marco Maggini. 150-159 [doi]
- Towards Augmenting Lexical Resources for Slang and African American EnglishAlyssa Hwang, William R. Frey, Kathleen R. McKeown. 160-172 [doi]
- Uralic Language Identification (ULI) 2020 shared task dataset and the Wanca 2017 corporaTommi Jauhiainen, Heidi Jauhiainen, Niko Partanen, Krister Lindén. 173-185 [doi]
- Dialect Identification under Domain Shift: Experiments with Discriminating Romanian and MoldavianÇagri Çöltekin. 186-192 [doi]
- Applying Multilingual and Monolingual Transformer-Based Models for Dialect IdentificationCristian Popa, Vlad Stefanescu. 193-201 [doi]
- HeLju@VarDial 2020: Social Media Variety Geolocation with BERT ModelsYves Scherrer, Nikola Ljubesic. 202-211 [doi]
- A dual-encoding system for dialect classificationPetru Rebeja, Dan Cristea. 212-219 [doi]
- Experiments in Language Variety Geolocation and Dialect IdentificationTommi Jauhiainen, Heidi Jauhiainen, Krister Lindén. 220-231 [doi]
- Exploring the Power of Romanian BERT for Dialect IdentificationGeorge-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea. 232-241 [doi]
- Combining Deep Learning and String Kernels for the Localization of Swiss German TweetsMihaela Gaman, Radu-Tudor Ionescu. 242-253 [doi]
- ZHAW-InIT - Social Media Geolocation at VarDial 2020Fernando Benites, Manuela Hürlimann, Pius von Däniken, Mark Cieliebak. 254-264 [doi]
- Discriminating between standard Romanian and Moldavian tweets using filtered character ngramsAndrea Ceolin, Hong Zhang. 265-272 [doi]
- Challenges in Neural Language Identification: NRC at VarDial 2020Gabriel Bernier-Colborne, Cyril Goutte. 273-282 [doi]
- Geolocation of Tweets with a BiLSTM Regression ModelPiyush Mishra. 283-289 [doi]