Abstract is missing.
- Giving Lexical Resources a Second Life: Démonette, a Multi-sourced Morpho-semantic Network for FrenchNabil Hathout, Fiammetta Namer. [doi]
- Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and OpportunitiesGeorg Rehm, Jan Hajic, Josef van Genabith, Andrejs Vasiljevs. [doi]
- Analysing Constraint Grammars with a SAT-solverInari Listenmaa, Koen Claessen. [doi]
- A Framework for Cross-lingual/Node-wise Alignment of Lexical-Semantic ResourcesYoshihiko Hayashi. [doi]
- Applying Core Scientific Concepts to Context-Based Citation RecommendationDaniel Duma, Maria Liakata, Amanda Clare, James Ravenscroft, Ewan Klein. [doi]
- Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and EvaluationBorja Navarro-Colorado, María Ribes-Lafoz, Noelia Sánchez. [doi]
- Use of Domain-Specific Language Resources in Machine TranslationSanja Stajner, Andreia Querido, Nuno Rendeiro, João António Rodrigues, António Branco. [doi]
- Context-enhanced Adaptive Entity LinkingFilip Ilievski, Giuseppe Rizzo 0002, Marieke van Erp, Julien Plu, Raphaël Troncy. [doi]
- VerbLexPor: a lexical resource with semantic roles for PortugueseLeonardo Zilio, Maria José Bocorny Finatto, Aline Villavicencio. [doi]
- Deep Learning of Audio and Language Features for Humor PredictionDario Bertero, Pascale Fung. [doi]
- Syntactic Analysis of Phrasal Compounds in Corpora: a Challenge for NLP ToolsCarola Trips. [doi]
- VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for CroatianIvan Sekulic, Jan Snajder. [doi]
- Towards a Language Service Infrastructure for Mobile EnvironmentsNguyen Cao Hong Ngoc, Donghui Lin, Takao Nakaguchi, Toru Ishida. [doi]
- Port4NooJ v3.0: Integrated Linguistic Resources for Portuguese NLPCristina Mota, Paula Carvalho, Anabela Barreiro. [doi]
- Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert KnowledgeElena Arsevska, Mathieu Roche, Sylvain Falala, Renaud Lancelot, David Chavernac, Pascal Hendrikx, Barbara Dufour. [doi]
- Phonetic Inventory for an Arabic Speech CorpusNawar Halabi, Mike Wald. [doi]
- A Computational Perspective on the Romanian DialectsAlina Maria Ciobanu, Liviu P. Dinu. [doi]
- Speech Corpus Spoken by Young-old, Old-old and Oldest-old JapaneseYurie Iribe, Norihide Kitaoka, Shuhei Segawa. [doi]
- Towards Building Semantic Role Labeler for Indian LanguagesMaaz Anwar, Dipti Misra Sharma. [doi]
- JATE 2.0: Java Automatic Term Extraction with Apache SolrZiqi Zhang, Jie Gao, Fabio Ciravegna. [doi]
- Introducing the Asian Language Treebank (ALT)Ye Kyaw Thu, Win Pa Pa, Masao Utiyama, Andrew M. Finch, Eiichiro Sumita. [doi]
- How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent EnvironmentPatrick Holthaus, Christian Leichsenring, Jasmin Bernotat, Viktor Richter, Marian Pohling, Birte Carlmeyer, Norman Köster, Sebastian Meyer Zu Borgsen, René Zorn, Birte Schiffhauer, Kai Frederic Engelmann, Florian Lier, Simon Schulz, Philipp Cimiano, Friederike Eyssel, Thomas Hermann, Franz Kummert, David Schlangen, Sven Wachsmuth, Petra Wagner, Britta Wrede, Sebastian Wrede. [doi]
- Speech Synthesis of Code-Mixed TextSunayana Sitaram, Alan W. Black. [doi]
- Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual SubtitlesAitor Álvarez, Marina Balenciaga, Arantza del Pozo, Haritz Arzelus, Anna Matamala, Carlos D. Martínez-Hinarejos. [doi]
- If You Even Don't Have a Bit of Bible: Learning Delexicalized POS TaggersZhiwei Yu, David Marecek, Zdenek Zabokrtský, Daniel Zeman. [doi]
- Polish Rhythmic Database ― New Resources for Speech Timing and Rhythm AnalysisAgnieszka Wagner, Katarzyna Klessa, Jolanta Bachan. [doi]
- Ambiguity Diagnosis for Terms in Digital HumanitiesBéatrice Daille, Évelyne Jacquey, Gaël Lejeune, Luis Felipe Melo Mora, Yannick Toussaint. [doi]
- A Web Tool for Building Parallel Corpora of Spoken and Sign LanguagesAlex Becker, Fabio Kepler, Sara Candeias. [doi]
- Encoding Adjective Scales for Fine-grained ResourcesCédric Lopez, Frédérique Segond, Christiane Fellbaum. [doi]
- Multiword Expressions in Child LanguageRodrigo Wilkens, Marco Idiart, Aline Villavicencio. [doi]
- Example-based Acquisition of Fine-grained Collocation ResourcesSara Rodríguez-Fernández, Roberto Carlini, Luis Espinosa Anke, Leo Wanner. [doi]
- NorGramBank: A 'Deep' Treebank for NorwegianHelge Dyvik, Paul Meurer, Victoria Rosén, Koenraad De Smedt, Petter Haugereid, Gyri Smørdal Losnegaard, Gunn Inger Lyse, Martha Thunes. [doi]
- Web Chat Conversations from Contact Centers: a Descriptive StudyGéraldine Damnati, Aleksandra Guerraz, Delphine Charlet. [doi]
- CATaLog Online: Porting a Post-editing Tool to the WebSantanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela, Josef van Genabith. [doi]
- A Comparison of Domain-based Word Polarity Estimation using different Word EmbeddingsAitor García Pablos, Montse Cuadros, German Rigau. [doi]
- ASPEC: Asian Scientific Paper Excerpt CorpusToshiaki Nakazawa, Manabu Yaguchi, Kiyotaka Uchimoto, Masao Utiyama, Eiichiro Sumita, Sadao Kurohashi, Hitoshi Isahara. [doi]
- BAS Speech Science Web Services - an Update of Current DevelopmentsThomas Kisler, Uwe D. Reichel, Florian Schiel, Christoph Draxler, Bernhard Jackl, Nina Pörner. [doi]
- Data Formats and Management Strategies from the Perspective of Language Resource Producers ― Personal Diachronic and Social Synchronic Data Sharing ―Kazushi Ohya. [doi]
- Using Data Mining Techniques for Sentiment Shifter IdentificationSamira Noferesti, Mehrnoush Shamsfard. [doi]
- TEITOK: Text-Faithful Annotated CorporaMaarten Janssen. [doi]
- Sense-annotating a Lexical Substitution Data Set with UbylineTristan Miller, Mohamed Khemakhem, Richard Eckart de Castilho, Iryna Gurevych. [doi]
- Yes, We Care! Results of the Ethics and Natural Language Processing SurveysKarën Fort, Alain Couillault. [doi]
- A Language Resource of German Errors Written by Children with DyslexiaMaria Rauschenberger, Luz Rello, Silke Füchsel, Jörg Thomaschewski. [doi]
- Persian Proposition BankAzadeh Mirzaei, Amirsaeid Moloodi. [doi]
- Semantic Links for PortugueseFabricio Chalub, Livy Real, Alexandre Rademaker, Valeria de Paiva. [doi]
- WIKIPARQ: A Tabulated Wikipedia Resource Using the Parquet FormatMarcus Klang, Pierre Nugues. [doi]
- Acquiring Opposition Relations among Italian Verb Senses using CrowdsourcingAnna Feltracco, Simone Magnolini, Elisabetta Jezek, Bernardo Magnini. [doi]
- Towards producing bilingual lexica from monolingual corporaJingyi Han, Núria Bel. [doi]
- FREME: Multilingual Semantic Enrichment with Linked Data and Language TechnologiesMilan Dojchinovski, Felix Sasaki, Tatjana Gornostaja, Sebastian Hellmann, Erik Mannens, Frank Salliau, Michele Osella, Phil Ritchie, Giannis Stoitsis, Kevin Koidl, Markus Ackermann, Nilesh Chakraborty. [doi]
- A Dataset for Open Event Extraction in EnglishKiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besançon. [doi]
- DT-Neg: Tutorial Dialogues Annotated for Negation Scope and Focus in ContextRajendra Banjade, Vasile Rus. [doi]
- Translation Errors and Incomprehensibility: a Case Study using Machine-Translated Second Language Proficiency TestsTakuya Matsuzaki, Akira Fujita, Naoya Todo, Noriko H. Arai. [doi]
- A Framework for Automatic Acquisition of Croatian and Serbian Verb Aspect from CorporaTanja Samardzic, Maja Milicevic. [doi]
- ELRA Activities and ServicesKhalid Choukri, Valérie Mapelli, Hélène Mazo, Vladimir Popescu. [doi]
- Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and MeansKimmo Kettunen, Tuula Pääkkönen. [doi]
- Korean TimeML and Korean TimeBankYoung-Seob Jeong, Won-Tae Joo, Hyun-Woo Do, Chae-Gyun Lim, Key-Sun Choi, Ho-Jin Choi. [doi]
- A Machine Learning based Music Retrieval and Recommendation SystemNaziba Mostafa, Yan Wan, Unnayan Amitabh, Pascale Fung. [doi]
- LanguageCrawl: A Generic Tool for Building Language Models Upon Common-CrawlSzymon Roziewski, Wojciech Stokowiec. [doi]
- Nine Features in a Random Forest to Learn Taxonomical Semantic RelationsEnrico Santus, Alessandro Lenci, Tin Shing Chiu, Qin Lu, Chu-Ren Huang. [doi]
- A Taxonomy of Spanish Nouns, a Statistical Algorithm to Generate it and its Implementation in Open Source CodeRogelio Nazar, Irene Renau. [doi]
- Learning Tone and Attribution for Financial Text MiningMahmoud El-Haj, Paul Rayson, Steven Young 0001, Andrew Moore, Martin Walker, Thomas Schleicher, Vasiliki Athanasakou. [doi]
- Collecting Language Resources for the Latvian e-Government Machine Translation PlatformRoberts Rozis, Andrejs Vasiljevs, Raivis Skadins. [doi]
- Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous SpeechRoberto Seara, Marta Martinez, Rocío Varela, Carmen García-Mateo, Elisa Fernández Rei, Xose Luis Regueira. [doi]
- Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of WolofElodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese, Uriel Pascal Elingui. [doi]
- Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in SpanishAndre Quispersaravia, Walter Perez, Marco Sobrevilla, Fernando Alva-Manchengo. [doi]
- Metonymy Analysis Using Associative Relations between WordsTakehiro Teraoka. [doi]
- Towards a Corpus of Violence Acts in Arabic Social MediaAyman Alhelbawy, Massimo Poesio, Udo Kruschwitz. [doi]
- The VU Sound Corpus: Adding More Fine-grained Annotations to the Freesound DatabaseEmiel van Miltenburg, Benjamin Timmermans, Lora Aroyo. [doi]
- DART: a Dataset of Arguments and their Relations on TwitterTom Bosc, Elena Cabrio, Serena Villata. [doi]
- Creation of comparable corpora for English-Urdu, Arabic, PersianMurad Abouammoh, Kashif Shah, Ahmet Aker. [doi]
- Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion RecognitionNurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura, Kazuhiro Nakadai. [doi]
- A New Integrated Open-source Morphological Analyzer for HungarianAttila Novák, Borbála Siklósi, Csaba Oravecz. [doi]
- Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)Damir Cavar, Malgorzata Cavar, Lwin Moe. [doi]
- Combining Ontologies and Neural Networks for Analyzing Historical Language Varieties. A Case Study in Middle Low GermanMaria Sukhareva, Christian Chiarcos. [doi]
- AVAB-DBS: an Audio-Visual Affect Bursts Database for SynthesisKevin El Haddad, Hüseyin Çakmak, Stéphane Dupont, Thierry Dutoit. [doi]
- The COPLE2 corpus: a learner corpus for PortugueseAmália Mendes, Sandra Antunes, Maarten Janssen, Anabela Gonçalves. [doi]
- The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language ResourcesGeorg Rehm. [doi]
- Affective Lexicon Creation for the Greek LanguageElisavet Palogiannidi, Polychronis Koutsakis, Elias Iosif, Alexandros Potamianos. [doi]
- Effect Functors for Opinion InferenceJosef Ruppenhofer, Jasper Brandes. [doi]
- Discovering Fuzzy Synsets from the Redundancy in Different Lexical-Semantic ResourcesHugo Gonçalo Oliveira, Fábio Santos. [doi]
- Quality Assessment of the Reuters Vol. 2 Multilingual CorpusRobin Eriksson. [doi]
- Crowdsourcing Salient Information from News and TweetsOana Inel, Tommaso Caselli, Lora Aroyo. [doi]
- A Corpus of Gesture-Annotated Dialogues for Monologue-to-Dialogue Generation from Personal NarrativesZhichao Hu, Michelle Dick, Chung-Ning Chang, Kevin Bowden, Michael Neff, Jean E. Fox Tree, Marilyn A. Walker. [doi]
- A Multimodal Corpus for the Assessment of Public Speaking Ability and AnxietyMathieu Chollet, Torsten Wörtwein, Louis-Philippe Morency, Stefan Scherer. [doi]
- A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic AdultsVictoria Yaneva, Irina P. Temnikova, Ruslan Mitkov. [doi]
- Evaluating the Noisy Channel Model for the Normalization of Historical Texts: Basque, Spanish and SloveneIzaskun Etxeberria, Iñaki Alegria, Larraitz Uria, Mans Hulden. [doi]
- A Hungarian Sentiment Corpus Manually Annotated at Aspect LevelMartina Katalin Szabó, Veronika Vincze, Katalin Ilona Simkó, Viktor Varga, Viktor Hangya. [doi]
- DALILA: The Dialectal Arabic Linguistic Learning AssistantSalam Khalifa, Houda Bouamor, Nizar Habash. [doi]
- Distributional Thesauri for Information Retrieval and vice versaVincent Claveau, Ewa Kijak. [doi]
- Leveraging RDF Graphs for Crossing Multiple Bilingual DictionariesMarta Villegas, Maite Melero, Núria Bel, Jorge Gracia. [doi]
- 1 Million Captioned Dutch Newspaper ImagesDesmond Elliott, Martijn Kleppe. [doi]
- The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media DocumentsJohann Poignant, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickaël Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Kemal Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset, Thomas Tamisier. [doi]
- Extracting Structured Scholarly Information from the Machine Translation LiteratureEunsol Choi, Matic Horvat, Jonathan May, Kevin Knight, Daniel Marcu. [doi]
- Extracting Weighted Language Lexicons from WikipediaGregory Grefenstette. [doi]
- Large Multi-lingual, Multi-level and Multi-genre Annotation CorpusXuansong Li, Martha Palmer, Nianwen Xue, Lance A. Ramshaw, Mohamed Maamouri, Ann Bies, Kathryn Conger, Stephen Grimes, Stephanie Strassel. [doi]
- Corpus Resources for Dispute Mediation DiscourseMathilde Janier, Chris Reed. [doi]
- AMISCO: The Austrian German Multi-Sensor CorpusHannes Pessentheiner, Thomas Pichler, Martin Hagmüller. [doi]
- Multilevel Annotation of Agreement and Disagreement in Italian News BlogsFabio Celli, Giuseppe Riccardi, Firoj Alam. [doi]
- User, who art thou? User Profiling for Oral Corpus PlatformsChristian Fandrych, Elena Frick, Hanna Hedeland, Anna Iliash, Daniel Jettka, Cordula Meißner, Thomas Schmidt, Franziska Wallner, Kathrin Weigert, Swantje Westpfahl. [doi]
- SlangNet: A WordNet like resource for English SlangShehzaad Dhuliawala, Diptesh Kanojia, Pushpak Bhattacharyya. [doi]
- Towards a Linguistic Ontology with an Emphasis on Reasoning and Knowledge ReuseArtemis Parvizi, Matt Kohl, Meritxell Gonzàlez, Roser Saurí. [doi]
- How does Dictionary Size Influence Performance of Vietnamese Word Segmentation?Wuying Liu, Lin Wang. [doi]
- MEANTIME, the NewsReader Multilingual Event and Time CorpusAnne-Lyse Minard, Manuela Speranza, Ruben Urizar, Begoña Altuna, Marieke van Erp, Anneleen Schoen, Chantal van Son. [doi]
- Homing in on Twitter Users: Evaluating an Enhanced Geoparser for User Profile LocationsBeatrice Alex, Clare Llewellyn, Claire Grover, Jon Oberlander, Richard Tobin. [doi]
- Design and Development of the MERLIN Learner Corpus PlatformVerena Lyding, Karin Schöne. [doi]
- Quantitative Analysis of Gazes and Grounding Acts in L1 and L2 ConversationsIchiro Umata, Koki Ijuin, Mitsuru Ishida, Moe Takeuchi, Seiichi Yamamoto. [doi]
- RankDCG: Rank-Ordering Evaluation MeasureDenys Katerenchuk, Andrew Rosenberg. [doi]
- D(H)ante: A New Set of Tools for XIII Century ItalianAngelo Basile, Federico Sangati. [doi]
- Semantic Layer of the Valence Dictionary of Polish WalentyElzbieta Hajnicz, Anna Andrzejczuk, Tomasz Bartosiak. [doi]
- Can Topic Modelling benefit from Word Sense Information?Adriana Ferrugento, Hugo Gonçalo Oliveira, Ana Oliveira Alves, Filipe Rodrigues. [doi]
- NileULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard ArabicSamhaa R. El-Beltagy. [doi]
- New Developments in the LRE MapVladimir Popescu, Lin Liu, Riccardo Del Gratta, Khalid Choukri, Nicoletta Calzolari. [doi]
- Using the TED Talks to Evaluate Spoken Post-editing of Machine TranslationJeevanthi Liyanapathirana, Andrei Popescu-Belis. [doi]
- An Arabic-Moroccan Darija Code-Switched CorpusYounes Samih, Wolfgang Maier. [doi]
- A Multilingual, Multi-style and Multi-granularity Dataset for Cross-language Textual Similarity DetectionJérémy Ferrero, Frédéric Agnès, Laurent Besacier, Didier Schwab. [doi]
- Improving POS Tagging of German Learner Language in a Reading Comprehension ScenarioLena Keiper, Andrea Horbach, Stefan Thater. [doi]
- Integration of Lexical and Semantic Knowledge for Sentiment Analysis in SMSWejdene Khiari, Mathieu Roche, Asma Bouhafs Hafsia. [doi]
- The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and AnalysisBeáta Megyesi, Jesper Näsman, Anne Palmér. [doi]
- Generating a Yiddish Speech Corpus, Forced Aligner and Basic ASR System for the AHEYM ProjectMalgorzata Cavar, Damir Cavar, Dov-Ber Kerler, Anya Quilitzsch. [doi]
- Universal Dependencies for PersianMojgan Seraji, Filip Ginter, Joakim Nivre. [doi]
- A Bilingual Discourse Corpus and Its ApplicationsYang Liu, Jiajun Zhang, Chengqing Zong, Yating Yang, Xi Zhou. [doi]
- Enhancing Cross-border EU E-commerce through Machine Translation: Needed Language Resources, Challenges and OpportunitiesMeritxell Fernández Barrera, Vladimir Popescu, Antonio Toral, Federico Gaspari, Khalid Choukri. [doi]
- Classifying Out-of-vocabulary Terms in a Domain-Specific Social Media CorpusSohyun Park, Afsaneh Fazly, Annie Lee, Brandon Seibel, Wenjie Zi, Paul Cook. [doi]
- Finding Definitions in Large Corpora with Sketch EngineVojtech Kovár, Monika Mociariková, Pavel Rychlý. [doi]
- Creating a General Russian Sentiment LexiconNatalia V. Loukachevitch, Anatolii Levchik. [doi]
- EasyTree: A Graphical Tool for Dependency Tree AnnotationAlexa Little, Stephen Tratz. [doi]
- The Denoised Web Treebank: Evaluating Dependency Parsing under Noisy Input ConditionsJoachim Daiber, Rob van der Goot. [doi]
- A Semi-Supervised Approach for Gender IdentificationJuan Soler, Leo Wanner. [doi]
- Exploiting Arabic Diacritization for High Quality Automatic AnnotationNizar Habash, Anas Shahrour, Muhamed al Khalil. [doi]
- Named Entity Resources - Overview and OutlookMaud Ehrmann, Damien Nouvel, Sophie Rosset. [doi]
- PROTEST: A Test Suite for Evaluating Pronouns in Machine TranslationLiane Guillou, Christian Hardmeier. [doi]
- The SpeDial datasets: datasets for Spoken Dialogue Systems analyticsJosé Lopes, Arodami Chorianopoulou, Elisavet Palogiannidi, Helena Moniz, Alberto Abad, Katerina Louka, Elias Iosif, Alexandros Potamianos. [doi]
- Building A Case-based Semantic English-Chinese Parallel TreebankHuaxing Shi, Tiejun Zhao, Keh-Yih Su. [doi]
- A Corpus of Images and Text in Online NewsLaura Hollink, Adriatik Bedjeti, Martin van Harmelen, Desmond Elliott. [doi]
- Two Architectures for Parallel Processing of Huge Amounts of TextMathijs Kattenberg, Zuhaitz Beloki, Aitor Soroa, Xabier Artola, Antske Fokkens, Paul Huygen, Kees Verstoep. [doi]
- From Interoperable Annotations towards Interoperable Resources: A Multilingual Approach to the Analysis of DiscourseEkaterina Lapshinova-Koltunski, Kerstin Anna Kunz, Anna Nedoluzhko. [doi]
- Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in LatvianMarcis Pinnis, Askars Salimbajevs, Ilze Auzina. [doi]
- Automatically Generated Affective Norms of Abstractness, Arousal, Imageability and Valence for 350 000 German LemmasMaximilian Köper, Sabine Schulte im Walde. [doi]
- Typology of Adjectives Benchmark for Compositional Distributional ModelsDaria Ryzhova, Maria Kyuseva, Denis Paperno. [doi]
- Analyzing Pre-processing Settings for Urdu Single-document Extractive SummarizationMuhammad Humayoun, Hwanjo Yu. [doi]
- Adapting the TANL tool suite to Universal DependenciesMaria Simi, Giuseppe Attardi. [doi]
- FLAT: Constructing a CLARIN Compatible Home for Language ResourcesMenzo Windhouwer, Marc Kemps-Snijders, Paul Trilsbeek, André Moreira, Bas Van der Veen, Guilherme Silva, Daniel Von Reihn. [doi]
- A Longitudinal Bilingual Frisian-Dutch Radio Broadcast Database Designed for Code-Switching ResearchEmre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits Van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David A. van Leeuwen. [doi]
- Building Evaluation Datasets for Consumer-Oriented Information RetrievalLorraine Goeuriot, Liadh Kelly, Guido Zuccon, João R. M. Palotti. [doi]
- Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment CorpusJames Ravenscroft, Anika Oellrich, Shyamasree Saha, Maria Liakata. [doi]
- Emotion Corpus Construction Based on Selection from HashtagsMinglei Li, Yunfei Long, Lu Qin, Wenjie Li. [doi]
- A sense-based lexicon of count and mass expressions: The Bochum English Countability LexiconTibor Kiss, Francis Jeffry Pelletier, Halima Husic, Roman Nino Simunic, Johanna Marie Poppek. [doi]
- Combining Semantic Annotation of Word Sense & Semantic Roles: A Novel Annotation Scheme for VerbNet Roles on German Language DataÉva Mújdricza-Maydt, Silvana Hartmann, Iryna Gurevych, Anette Frank. [doi]
- Spanish Word Vectors from WikipediaMathías Etcheverry, Dina Wonsever. [doi]
- Comparing Speech and Text Classification on ICNALESergiu Nisioi. [doi]
- Phrase Level Segmentation and Labelling of Machine Translation ErrorsFrédéric Blain, Varvara Logacheva, Lucia Specia. [doi]
- Farasa: A New Fast and Accurate Arabic Word SegmenterKareem Darwish, Hamdy Mubarak. [doi]
- AppDialogue: Multi-App Dialogues for Intelligent AssistantsMing Sun, Yun-Nung Chen, Zhenhao Hua, Yulian Tamres-Rudnicky, Arnab Dash, Alexander I. Rudnicky. [doi]
- More than Word Cooccurrence: Exploring Support and Opposition in International Climate Negotiations with Semantic ParsingPablo Ruiz 0001, Clément Plancq, Thierry Poibeau. [doi]
- An Empirical Exploration of Moral Foundations Theory in Partisan News SourcesDean Fulgoni, Jordan Carpenter, Lyle H. Ungar, Daniel Preotiuc-Pietro. [doi]
- The Scielo Corpus: a Parallel Corpus of Scientific Publications for BiomedicineMariana L. Neves, Antonio Jimeno-Yepes, Aurélie Névéol. [doi]
- Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word EmbeddingsEda Okur, Hakan Demir, Arzucan Özgür. [doi]
- Edit Categories and Editor Role Identification in WikipediaDiyi Yang, Aaron Halfaker, Robert E. Kraut, Eduard H. Hovy. [doi]
- Evaluating the Readability of Text Simplification Output for Readers with Cognitive DisabilitiesVictoria Yaneva, Irina P. Temnikova, Ruslan Mitkov. [doi]
- A Corpus of Argument Networks: Using Graph Properties to Analyse Divisive IssuesBarbara Konat, John Lawrence, Joonsuk Park, Katarzyna Budzynska, Chris Reed. [doi]
- mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE ProcessingSilvio Cordeiro, Carlos Ramisch, Aline Villavicencio. [doi]
- Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-artSteffen Eger, Rüdiger Gleim, Alexander Mehler. [doi]
- A Novel Evaluation Method for Morphological SegmentationJavad Nouri, Roman Yangarber. [doi]
- IMS HotCoref DE: A Data-driven Co-reference Resolver for GermanIna Rösiger, Jonas Kuhn. [doi]
- Aspectual Flexibility Increases with Agentivity and Concreteness\\ A Computational Classification Experiment on Polysemous VerbsIngrid Falk, Fabienne Martin. [doi]
- Providing a Catalogue of Language Resources for Commercial UsersBente Maegaard, Lina Henriksen, Andrew Joscelyne, Vesna Lusicky, Margaretha Mazura, Sussi Olsen, Claus Povlsen, Philippe Wacker. [doi]
- The OFAI Multi-Modal Task Description CorpusStephanie Schreitter, Brigitte Krenn. [doi]
- Adding Semantic Relations to a Large-Coverage Connective Lexicon of GermanTatjana Scheffler, Manfred Stede. [doi]
- Information structure in the Potsdam Commentary Corpus: TopicsManfred Stede, Sara Mamprin. [doi]
- Tweeting and Being Ironic in the Debate about a Political Reform: the French Annotated Corpus TWitter-MariagePourTousCristina Bosco, Mirko Lai, Viviana Patti, Daniela Virone. [doi]
- Creating Linked Data Morphological Language Resources with MMoOn - The Hebrew Morpheme InventoryBettina Klimek, Natanael Arndt, Sebastian Krause, Timotheus Arndt. [doi]
- Enriching TimeBank: Towards a more precise annotation of temporal relations in a textVolker Gast, Lennart Bierkandt, Stephan Druskat, Christoph Rzymski. [doi]
- ANEW+: Automatic Expansion and Validation of Affective Norms of Words Lexicons in Multiple LanguagesSamira Shaikh, Kit Cho, Tomek Strzalkowski, Laurie Feldman, John Lien, Ting Liu, George Aaron Broadwell. [doi]
- EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment AnalysisJasy Suet Yan Liew, Howard R. Turtle, Elizabeth D. Liddy. [doi]
- EVALution-MAN: A Chinese Dataset for the Training and Evaluation of DSMsHongchao Liu, Karl Neergaard, Enrico Santus, Chu-Ren Huang. [doi]
- Automatic Construction of Discourse Corpora for Dialogue TranslationLongyue Wang, Xiaojun Zhang, Zhaopeng Tu, Andy Way, Qun Liu. [doi]
- What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL DatasetsEnrico Santus, Alessandro Lenci, Tin Shing Chiu, Qin Lu, Chu-Ren Huang. [doi]
- A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source PerformanceFelix Burkhardt, Uwe D. Reichel. [doi]
- Crossmodal Network-Based Distributional Semantic ModelsElias Iosif, Alexandros Potamianos. [doi]
- A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat SummarizationFajri Koto. [doi]
- Discontinuous Verb Phrases in Parsing and Machine Translation of English and GermanSharid Loáiciga, Kristina Gulordava. [doi]
- The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly PeopleMichel Vacher, Saïda Bouakaz, Marc-Eric Bobillier-Chaumon, Frédéric Aman, Rizwan Ahmed Khan, Salima Body-Bekkadja, François Portet, Erwan Guillou, Solange Rossato, Benjamin Lecouteux. [doi]
- Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?Shammur Absar Chowdhury, Evgeny A. Stepanov, Giuseppe Riccardi. [doi]
- Accurate Deep Syntactic Parsing of Graphs: The Case of FrenchCorentin Ribeyre, Éric Villemonte de la Clergerie, Djamé Seddah. [doi]
- Temporal Information Annotation: Crowd vs. ExpertsTommaso Caselli, Rachele Sprugnoli, Oana Inel. [doi]
- Combining Manual and Automatic Prosodic Annotation for Expressive Speech SynthesisSandrine Brognaux, Thomas François, Marco Saerens. [doi]
- Cross-lingual Linking of Multi-word Entities and their corresponding AcronymsGuillaume Jacquet, Maud Ehrmann, Ralf Steinberger, Jaakko Väyrynen. [doi]
- Joining-in-type Humanoid Robot Assisted Language Learning SystemAlBara Khalifa, Tsuneo Kato, Seiichi Yamamoto. [doi]
- Graphical Annotation for Syntax-Semantics MappingKôiti Hasida. [doi]
- A Shared Task for Spoken CALL?Claudia Baur, Johanna Gerlach, Manny Rayner, Martin Russell, Helmer Strik. [doi]
- Detecting Word Usage Errors in Chinese Sentences for Learning Chinese as a Foreign LanguageYow-Ting Shiue, Hsin-Hsi Chen. [doi]
- Rule-based Automatic Multi-word Term Extraction and LemmatizationRanka Stankovic, Cvetana Krstev, Ivan Obradovic, Biljana Lazic, Aleksandra Trtovac. [doi]
- Semi-automatic Parsing for Web Knowledge Extraction through Semantic AnnotationMaria Pia di Buono. [doi]
- Creating Annotated Dialogue Resources: Cross-domain Dialogue Act ClassificationDilafruz Amanova, Volha Petukhova, Dietrich Klakow. [doi]
- A Singing Voice Database in Basque for Statistical Singing Synthesis of BertsolaritzaXabier Sarasola, Eva Navas, David Tavarez, Daniel Erro, Ibon Saratxaga, Inma Hernáez. [doi]
- Operational Assessment of Keyword Search on Oral HistoryElizabeth Salesky, Jessica Ray, Wade Shen. [doi]
- Annotating Topic Development in Information Seeking QueriesMarta Andersson, Adnan Ozturel, Silvia Pareti. [doi]
- Towards Comparability of Linguistic Graph Banks for Semantic ParsingStephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajic, Angelina Ivanova, Zdenka Uresová. [doi]
- The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition MethodsBehrang QasemiZadeh, Anne-Kathrin Schumann. [doi]
- CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for PortugueseRita de Carvalho, Andreia Querido, Marisa Campos, Rita Valadas Pereira, João Ricardo Silva, António Branco. [doi]
- A Japanese Chess Commentary CorpusShinsuke Mori, John Richardson, Atsushi Ushiku, Tetsuro Sasada, Hirotaka Kameko, Yoshimasa Tsuruoka. [doi]
- Al Qamus al Muhit, a Medieval Arabic Lexicon in LMFOuafae Nahli, Francesca Frontini, Monica Monachini, Fahad Khan, Arsalane Zarghili, Mustapha Khalfi. [doi]
- Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation ModelsSteven Neale, Luís Gomes 0002, Eneko Agirre, Oier Lopez de Lacalle, António Branco. [doi]
- The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological FactorsTilia Ellendorff, Simon Foster, Fabio Rinaldi. [doi]
- A Multi-domain Corpus of Swedish Word Sense AnnotationRichard Johansson, Yvonne Adesam, Gerlof Bouma, Karin Hedberg. [doi]
- Automatic Classification of Tweets for Analyzing Communication Behavior of MuseumsNicolas Foucault, Antoine Courtin. [doi]
- The Universal Dependencies Treebank of Spoken SlovenianKaja Dobrovoljc, Joakim Nivre. [doi]
- Optimizing Computer-Assisted Transcription Quality with Iterative User InterfacesMatthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel. [doi]
- Corpus Query Lingua Franca (CQLF)Piotr Banski, Elena Frick, Andreas Witt. [doi]
- Privacy Issues in Online Machine Translation Services - European PerspectivePawel Kamocki, Jim O'Regan. [doi]
- A Large DataBase of Hypernymy Relations Extracted from the WebJulian Seitner, Christian Bizer, Kai Eckert 0001, Stefano Faralli, Robert Meusel, Heiko Paulheim, Simone Paolo Ponzetto. [doi]
- Fast and Robust POS tagger for Arabic Tweets Using Agreement-based BootstrappingFahad Albogamy, Allan Ramsay. [doi]
- Accessing and Elaborating Walenty - a Valence Dictionary of Polish - via Internet BrowserBartlomiej Niton, Tomasz Bartosiak, Elzbieta Hajnicz. [doi]
- Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot StudySilvie Cinková, Ema Krejcová, Anna Vernerová, Vít Baisa. [doi]
- A Corpus of Read and Spontaneous Upper Saxon German Speech for ASR EvaluationRobert Herms, Laura Seelig, Stefanie Münch, Maximilian Eibl. [doi]
- Passing a USA National Bar Exam: a First Corpus for ExperimentationBiralatei Fawei, Adam Z. Wyner, Jeff Z. Pan. [doi]
- Extracting Interlinear Glossed Text from LaTeX DocumentsMathias Schenner, Sebastian Nordhoff. [doi]
- The Alaskan Athabascan Grammar DatabaseSebastian Nordhoff, Siri Tuttle, Olga Lovick. [doi]
- Detecting Expressions of Blame or Praise in TextOrizu Udochukwu, Yulan He. [doi]
- PARSEME Survey on MWE ResourcesGyri Smørdal Losnegaard, Federico Sangati, Carla Parra Escartín, Agata Savary, Sascha Bargmann, Johanna Monti. [doi]
- Ensemble Classification of Grants using LDA-based FeaturesIoannis Korkontzelos, Beverley Thomas, Makoto Miwa, Sophia Ananiadou. [doi]
- MarsaGram: an excursion in the forests of parsing treesPhilippe Blache, Stéphane Rauzy, Grégoire de Montcheuil. [doi]
- Compilation of an Arabic Children's CorpusLatifa Al-Sulaiti, Noorhan Abbas, Claire Brierley, Eric Atwell, Ayman Alghamdi. [doi]
- Poly-GrETEL: Cross-Lingual Example-based Querying of Syntactic ConstructionsLiesbeth Augustinus, Vincent Vandeghinste, Tom Vanallemeersch. [doi]
- Explicit Fine grained Syntactic and Semantic Annotation of the Idafa Construction in ArabicAbdelati Hawwari, Mohammed Attia, Mahmoud-Ghoneim, Mona T. Diab. [doi]
- Can Tweets Predict TV Ratings?Bridget Sommerdijk, Eric Sanders, Antal van den Bosch. [doi]
- MARMOT: A Toolkit for Translation Quality Estimation at the Word LevelVarvara Logacheva, Chris Hokamp, Lucia Specia. [doi]
- The REAL Corpus: A Crowd-Sourced Corpus of Human Generated and Evaluated Spatial References to Real-World Urban ScenesPhil J. Bartie, William A. Mackaness, Dimitra Gkatzia, Verena Rieser. [doi]
- A Rule-based Shallow-transfer Machine Translation System for Scots and EnglishGavin Abercrombie. [doi]
- Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuolaMarco Stranisci, Cristina Bosco, Delia Irazú Hernández Farías, Viviana Patti. [doi]
- Evaluating Entity Linking: An Analysis of Current Benchmark Datasets and a Roadmap for Doing a Better JobMarieke van Erp, Pablo N. Mendes, Heiko Paulheim, Filip Ilievski, Julien Plu, Giuseppe Rizzo 0002, Jörg Waitelonis. [doi]
- Language Resource Citation: the ISLRN Dissemination and Further DevelopmentsValérie Mapelli, Vladimir Popescu, Lin Liu, Khalid Choukri. [doi]
- Rapid Development of Morphological Analyzers for Typologically Diverse LanguagesSeth Kulick, Ann Bies. [doi]
- A Dependency Treebank of the Chinese Buddhist CanonTak-sum Wong, John Lee. [doi]
- Introducing the SEA_AP: an Enhanced Tool for Automatic Prosodic AnalysisMarta Martinez, Rocío Varela, Carmen García-Mateo, Elisa Fernández Rei, Adela Martínez-Calvo. [doi]
- EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment AnalysisDavid Vilares, Miguel A. Alonso, Carlos Gómez-Rodríguez. [doi]
- VPS-GradeUp: Graded Decisions on Usage PatternsVít Baisa, Silvie Cinková, Ema Krejcová, Anna Vernerová. [doi]
- Exploring Language Variation Across Europe - A Web-based Tool for Computational SociolinguisticsDirk Hovy, Anders Johannsen. [doi]
- On Developing Resources for Patient-level Information RetrievalStephen T. Wu, Tamara Timmons, Amy Yates, Meikun Wang, Steven Bedrick, William R. Hersh, Hongfang Liu. [doi]
- Publishing the Trove Newspaper CorpusSteve Cassidy. [doi]
- PROMETHEUS: A Corpus of Proverbs Annotated with MetaphorsGözde Özbal, Carlo Strapparava, Serra Sinem Tekiroglu. [doi]
- Dialogue System Characterisation by Back-channelling Patterns Extracted from Dialogue CorpusMasashi Inoue, Hiroshi Ueno. [doi]
- QUEMDISSE? Reported speech in PortugueseCláudia Freitas, Bianca Freitas, Diana Santos. [doi]
- Legal Text Interpretation: Identifying Hohfeldian Relations from TextWim Peters, Adam Z. Wyner. [doi]
- Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA ProjectGuntis Barzdins, Steve Renals, Didzis Gosko. [doi]
- PE2rr Corpus: Manual Error Annotation of Automatically Pre-annotated MT Post-editsMaja Popovic, Mihael Arcan. [doi]
- Mining the Spoken Wikipedia for Speech Data and BeyondArne Köhn, Florian Stegen, Timo Baumann. [doi]
- Complementarity, F-score, and NLP EvaluationLeon Derczynski. [doi]
- Domain Adaptation for Named Entity Recognition Using CRFsTian Tian, Marco Dinarelli, Isabelle Tellier, Pedro Miguel Dias Cardoso. [doi]
- LibN3L: A Lightweight Package for Neural NLPMeishan Zhang, Jie Yang, Zhiyang Teng, Yue Zhang. [doi]
- Effects of Sampling on Twitter Trend DetectionAndrew Yates, Alek Kolcz, Nazli Goharian, Ophir Frieder. [doi]
- Discriminative Analysis of Linguistic Features for Typological StudyHiroya Takamura, Ryo Nagata, Yoshifumi Kawasaki. [doi]
- PreMOn: a Lemon Extension for Exposing Predicate Models as Linked DataFrancesco Corcoglioniti, Marco Rospocher, Alessio Palmero Aprosio, Sara Tonelli. [doi]
- Two Decades of Terminology: European Framework Programmes TitlesGabriella Pardelli, Sara Goggi, Silvia Giannini, Stefania Biagioni. [doi]
- TLT-CRF: A Lexicon-supported Morphological Tagger for Latin Based on Conditional Random FieldsTim vor der Brück, Alexander Mehler. [doi]
- Parallel Speech Corpora of Japanese DialectsKoichiro Yoshino, Naoki Hirayama, Shinsuke Mori, Fumihiko Takahashi, Katsutoshi Itoyama, Hiroshi G. Okuno. [doi]
- Using Word Embeddings to Translate Named EntitiesOctavia-Maria Sulea, Sergiu Nisioi, Liviu P. Dinu. [doi]
- An Annotated Corpus of Direct SpeechJohn Lee, Chak Yan Yeung. [doi]
- Adapting an Entity Centric Model for Portuguese Coreference ResolutionEvandro Brasil da Fonseca, Renata Vieira, Aline A. Vanin. [doi]
- POS-tagging of Historical DutchDieuwke Hupkes, Rens Bod. [doi]
- Best of Both Worlds: Making Word Sense Embeddings InterpretableAlexander Panchenko. [doi]
- On the Use of a Serious Game for Recording a Speech Corpus of People with Intellectual DisabilitiesMario Corrales-Astorgano, David Escudero Mancebo, Yurena Gutiérrez-González, Valle Flores-Lucas, César González Ferreras, Valentín Cardeñoso-Payo. [doi]
- First Steps Towards Coverage-Based Sentence AlignmentLuís Gomes 0002, Gabriel Pereira Lopes. [doi]
- Mapping Ontologies Using Ontologies: Cross-lingual Semantic Role Information TransferBalázs Indig, Márton Miháltz, András Simonyi. [doi]
- Latin Vallex. A Treebank-based Semantic Valency Lexicon for LatinMarco Passarotti, Berta González Saavedra, Christophe Onambélé. [doi]
- A Large-scale Recipe and Meal Data Collection as Infrastructure for Food ResearchJun Harashima, Michiaki Ariga, Kenta Murata, Masayuki Ioki. [doi]
- AfriBooms: An Online Treebank for AfrikaansLiesbeth Augustinus, Peter Dirix, Daniel R. van Niekerk, Ineke Schuurman, Vincent Vandeghinste, Frank Van Eynde, Gerhard B. Van Huyssteen. [doi]
- The on-line version of Grammatical Dictionary of PolishMarcin Wolinski, Witold Kieras. [doi]
- ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing ToolXiaofeng Wu, Jinhua Du, Qun Liu, Andy Way. [doi]
- Controlled Propagation of Concept Annotations in Textual CorporaCyril Grouin. [doi]
- Universal Dependencies v1: A Multilingual Treebank CollectionJoakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajic, Christopher D. Manning, Ryan T. McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman. [doi]
- WTF-LOD - A New Resource for Large-Scale NER EvaluationLubomir Otrusina, Pavel Smrz. [doi]
- BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance DomainsNecati Cihan Camgöz, Ahmet Alp Kindiroglu, Serpil Karabüklü, Meltem Kelepir, A. Sumru Özsoy, Lale Akarun. [doi]
- Building an Arabic Machine Translation Post-Edited Corpus: Guidelines and AnnotationWajdi Zaghouani, Nizar Habash, Ossama Obeid, Behrang Mohit, Houda Bouamor, Kemal Oflazer. [doi]
- Could Speaker, Gender or Age Awareness be beneficial in Speech-based Emotion Recognition?Maxim Sidorov, Alexander Schmitt, Eugene Semenkin, Wolfgang Minker. [doi]
- Corpus Annotation within the French FrameNet: a Domain-by-domain MethodologyMarianne Djemaa, Marie Candito, Philippe Muller, Laure Vieu. [doi]
- Towards a Multi-dimensional Taxonomy of Stories in DialogueKathryn J. Collins, David R. Traum. [doi]
- DUEL: A Multi-lingual Multimodal Dialogue Corpus for Disfluency, Exclamations and LaughterJulian Hough, Ye Tian, Laura E. de Ruiter, Simon Betz, Spyros Kousidis, David Schlangen, Jonathan Ginzburg. [doi]
- Corpus for Customer Purchase Behavior Prediction in Social MediaShigeyuki Sakaki, Francine Chen, Mandy Korpusik, Yan-Ying Chen. [doi]
- A Morphological Lexicon of Esperanto with Morpheme FrequenciesEckhard Bick. [doi]
- Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veteransHenk van den Heuvel, Nelleke Oostdijk. [doi]
- Deriving Morphological Analyzers from Example InflectionsMarkus Forsberg, Mans Hulden. [doi]
- EstNLTK - NLP Toolkit for EstonianSiim Orasmaa, Timo Petmanson, Alexander Tkachenko, Sven Laur, Heiki Jaan Kaalep. [doi]
- Parallel Sentence Extraction from Comparable Corpora with Neural Network FeaturesChenhui Chu, Raj Dabre, Sadao Kurohashi. [doi]
- A Comparative Study of Text Preprocessing Approaches for Topic Detection of User UtterancesRoman B. Sergienko, Muhammad Shan, Wolfgang Minker. [doi]
- A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)Franco Salvetti, John B. Lowe, James H. Martin. [doi]
- "He Said She Said" ― a Male/Female Corpus of PolishFilip Gralinski, Lukasz Borchmann, Piotr Wierzchon. [doi]
- Exploiting a Large Strongly Comparable CorpusThierry Etchegoyhen, Andoni Azpeitia, Naiara Pérez. [doi]
- ARRAU: Linguistically-Motivated Annotation of Anaphoric DescriptionsOlga Uryupina, Ron Artstein, Antonella Bristot, Federica Cavicchio, Kepa Joseba Rodríguez, Massimo Poesio. [doi]
- A Dataset for Detecting Stance in TweetsSaif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiao-Dan Zhu, Colin Cherry. [doi]
- Hidden Resources ― Strategies to Acquire and Exploit Potential Spoken Language Resources in National ArchivesJens Edlund, Joakim Gustafson. [doi]
- Evaluating a Topic Modelling Approach to Measuring Corpus SimilarityRichard Fothergill, Paul Cook, Timothy Baldwin. [doi]
- A Regional News Corpora for Contextualized Entity Discovery and LinkingAdrian Brasoveanu, Lyndon J. B. Nixon, Albert Weichselbraun, Arno Scharl. [doi]
- Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource LanguagesStefan Ecker, Andrea Horbach, Stefan Thater. [doi]
- Question-Answering with Logic Specific to Video GamesCorentin Dumont, Ran Tian, Kentaro Inui. [doi]
- Urdu Summary CorpusMuhammad Humayoun, Rao Muhammad Adeel Nawab, Muhammad Uzair, Saba Aslam, Omer Farzand. [doi]
- Managing Linguistic and Terminological Variation in a Medical Dialogue SystemLeonardo Campillos Llanos, Dhouha Bouamor, Pierre Zweigenbaum, Sophie Rosset. [doi]
- Construction of an English Dependency Corpus incorporating Compound Function WordsAkihiko Kato, Hiroyuki Shindo, Yuji Matsumoto. [doi]
- Towards Multiple Antecedent Coreference Resolution in Specialized DiscourseAlicia Burga, Sergio Cajal, Joan Codina-Filbà, Leo Wanner. [doi]
- Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space ModelsAmir Hazem, Emmanuel Morin. [doi]
- Recent Advances in Development of a Lexicon-Grammar of Polish: PolNet 3.0Zygmunt Vetulani, Grazyna Vetulani, Bartlomiej Kochanowski. [doi]
- Palabras: Crowdsourcing Transcriptions of L2 SpeechEric Sanders, Pepi Burgos, Catia Cucchiarini, Roeland Van Hout. [doi]
- LREC as a Graph: People and Resources in a NetworkRiccardo Del Gratta, Francesca Frontini, Monica Monachini, Gabriella Pardelli, Irene Russo, Roberto Bartolini, Fahad Khan, Claudia Soria, Nicoletta Calzolari. [doi]
- Improving corpus search via parsingNatalia Klyueva, Pavel Stranák. [doi]
- Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP TasksCarole Lailler, Anaïs Landeau, Frédéric Béchet, Yannick Estève, Paul Deléglise. [doi]
- Automatic Enrichment of WordNet with Common-Sense KnowledgeLuigi Di Caro, Guido Boella. [doi]
- SCALE: A Scalable Language Engineering ToolkitJoris Pelemans, Lyan Verwimp, Kris Demuynck, Hugo Van Hamme, Patrick Wambacq. [doi]
- Tools and Guidelines for Principled Machine Translation DevelopmentNora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondrej Klejch, Martin Popel, Maja Popovic. [doi]
- Typed Entity and Relation Annotation on Computer Science PapersYuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao, Akiko Aizawa. [doi]
- A Corpus of Native, Non-native and Translated TextsSergiu Nisioi, Ella Rabinovich, Liviu P. Dinu, Shuly Wintner. [doi]
- Wikification for Scriptio ContinuaYugo Murawaki, Shinsuke Mori. [doi]
- Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-levelMarcos Garcia. [doi]
- Corpora for Learning the Mutual Relationship between Semantic Relatedness and Textual EntailmentNgoc Phuoc An Vo, Octavian Popescu. [doi]
- Evaluating Context Selection Strategies to Build Emotive Vector Space ModelsLucia C. Passaro, Alessandro Lenci. [doi]
- Detecting Implicit Expressions of Affect from Text using Semantic Knowledge on Common Concept PropertiesAlexandra Balahur, Hristo Tanev. [doi]
- SweLL on the rise: Swedish Learner Language corpus for European Reference Level studiesElena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg, Monica Sandell. [doi]
- Crowdsourcing Ontology LexiconsBettina Lanser, Christina Unger, Philipp Cimiano. [doi]
- The United Nations Parallel Corpus v1.0Michal Ziemski, Marcin Junczys-Dowmunt, Bruno Pouliquen. [doi]
- An Empirical Study of Arabic Formulaic Sequence Extraction MethodsAyman Alghamdi, Eric Atwell, Claire Brierley. [doi]
- Text Segmentation of Digitized Clinical TextsCyril Grouin. [doi]
- Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of SloveneNikola Ljubesic, Tomaz Erjavec. [doi]
- Coreference Annotation Scheme and Relation Types for HindiVandan Mujadia, Palash Gupta, Dipti Misra Sharma. [doi]
- ArchiMob - A Corpus of Spoken Swiss GermanTanja Samardzic, Yves Scherrer, Elvira Glaser. [doi]
- An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in TextEric Yeh, John Niekrasz, Dayne Freitag, Richard Rohwer. [doi]
- Semantic Relation Extraction with Semantic Patterns Experiment on Radiology ReportsMathieu Lafourcade, Lionel Ramadier. [doi]
- A Language Independent Method for Generating Large Scale Polarity LexiconsGiuseppe Castellucci, Danilo Croce, Roberto Basili. [doi]
- Training & Quality Assessment of an Optical Character Recognition Model for Northern HaidaIsabell Hubert, Antti Arppe, Jordan Lachler, Eddie Antonio Santos. [doi]
- Curation of Dutch Regional DictionariesHenk van den Heuvel, Eric Sanders, Nicoline van der Sijs. [doi]
- Markov Logic Networks for Text Mining: A Qualitative and Empirical Comparison with Integer Linear ProgrammingLuis Gerardo Mojica, Vincent Ng. [doi]
- Universal Dependencies for JapaneseTakaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori, Yuji Matsumoto. [doi]
- QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six LanguagesArantxa Otegi, Nora Aranberri, António Branco, Jan Hajic, Martin Popel, Kiril Ivanov Simov, Eneko Agirre, Petya Osenova, Rita Valadas Pereira, João Ricardo Silva, Steven Neale. [doi]
- Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex ResourceAnaïs Tack, Thomas François, Anne-Laure Ligozat, Cédrick Fairon. [doi]
- An Interaction-Centric Dataset for Learning Automation Rules in Smart HomesKai Frederic Engelmann, Patrick Holthaus, Britta Wrede, Sebastian Wrede. [doi]
- Evaluation of the KIT Lecture Translation SystemMarkus Müller, Sarah Fünfer, Sebastian Stüker, Alex Waibel. [doi]
- CLARIN-EL Web-based Annotation ToolIoannis Manousos Katakis, Georgios Petasis, Vangelis Karkaletsis. [doi]
- Comparison of Emotional Understanding in Modality-Controlled Environments using Multimodal Online Emotional Communication CorpusYoshiko Arimoto, Kazuo Okanoya. [doi]
- Odin's Runes: A Rule Language for Information ExtractionMarco Antonio Valenzuela-Escárcega, Gus Hahn-Powell, Mihai Surdeanu. [doi]
- Finding Recurrent Features of Image Schema Gestures: the FIGURE corpusAndy Lücking, Alexander Mehler, Désirée Walther, Marcel Mauri, Dennis Kurfürst. [doi]
- Using SMT for OCR Error Correction of Historical TextsHaithem Afli, Zhengwei Qiu, Andy Way, Páraic Sheridan. [doi]
- New release of Mixer-6: Improved validity for phonetic study of speaker variation and identificationEleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur, John Godfrey. [doi]
- How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio NewsImran A. Sheikh, Irina Illina, Dominique Fohr. [doi]
- Error Typology and Remediation Strategies for Requirements Written in English by Non-Native SpeakersMarie Garnier, Patrick Saint-Dizier. [doi]
- A Finite-State Morphological Analyser for SindhiRaveesh Motlani, Francis M. Tyers, Dipti Misra Sharma. [doi]
- Wiktionnaire's Wikicode GLAWIfied: a Workable French Machine-Readable DictionaryNabil Hathout, Franck Sajous. [doi]
- MWEs in Treebanks: From Survey to GuidelinesVictoria Rosén, Koenraad De Smedt, Gyri Smørdal Losnegaard, Eduard Bejcek, Agata Savary, Petya Osenova. [doi]
- Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and EvaluationGorka Labaka, Iñaki Alegria, Kepa Sarasola. [doi]
- GATE-Time: Extraction of Temporal Expressions and EventsLeon Derczynski, Jannik Strötgen, Diana Maynard, Mark A. Greenwood, Manuel Jung. [doi]
- Multi-prototype Chinese Character EmbeddingYanan Lu, Yue Zhang, Dong-Hong Ji. [doi]
- The TYPALOC Corpus: A Collection of Various Dysarthric Speech Recordings in Read and Spontaneous StylesChristine Meunier, Cécile Fougeron, Corinne Fredouille, Brigitte Bigi, Lise Crevier-Buchman, Elisabeth Delais-Roussarie, Laurianne Georgeton, Alain Ghio, Imed Laaridh, Thierry Legou, Claire Pillot-Loiseau, Gilles Pouchoulin. [doi]
- Review on the Existing Language Resources for Languages of FranceThibault Grouas, Valérie Mapelli, Quentin Samier. [doi]
- The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015Mijail A. Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Marc Poch, Hugo Zaragoza. [doi]
- Detection of Major ASL Sign Types in Continuous Signing For ASL RecognitionPolina Yanovich, Carol Neidle, Dimitris N. Metaxas. [doi]
- Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual IntelligibilityAndrea Fischer, Klara Jagrova, Irina Stenger, Tania Avgustinova, Dietrich Klakow, Roland Marti. [doi]
- IRIS: English-Irish Machine Translation SystemMihael Arcan, Caoilfhionn Lane, Eoin Ó Droighneáin, Paul Buitelaar. [doi]
- FlexTag: A Highly Flexible PoS Tagging FrameworkTorsten Zesch, Tobias Horsmann. [doi]
- English-to-Japanese Translation vs. Dictation vs. Post-editing: Comparing Translation Modes in a Multilingual SettingMichael Carl, Akiko Aizawa, Masaru Yamada. [doi]
- OPFI: A Tool for Opinion Finding in PolishAleksander Wawer. [doi]
- Features for Generic Corpus QueryingThomas Eckart, Christoph Kuras, Uwe Quasthoff. [doi]
- Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption ContestDragomir R. Radev, Amanda Stent, Joel R. Tetreault, Aasish Pappu, Aikaterini Iliakopoulou, Agustin Chanfreau, Paloma de Juan, Jordi Vallmitjana, Alejandro Jaimes, Rahul Jha, Robert Mankoff. [doi]
- Enriching a Portuguese WordNet using Synonyms from a Monolingual DictionaryAlberto Simões, Xavier Gómez Guinovart, José João Almeida. [doi]
- The Query of Everything: Developing Open-Domain, Natural-Language Queries for BOLT Information RetrievalKira Griffitt, Stephanie Strassel. [doi]
- BulPhonC: Bulgarian Speech Corpus for the Development of ASR TechnologyNeli Hateva, Petar Mitankin, Stoyan Mihov. [doi]
- The BAS Speech Data RepositoryUwe D. Reichel, Florian Schiel, Thomas Kisler, Christoph Draxler, Nina Pörner. [doi]
- UIMA-Based JCoRe 2.0 Goes GitHub and Maven Central ― State-of-the-Art Software Resource Engineering and Distribution of NLP PipelinesUdo Hahn, Franz Matthies, Erik Faessler, Johannes Hellrich. [doi]
- Bilbo-Val: Automatic Identification of Bibliographical Zone in PapersAmal Htait, Sébastien Fournier, Patrice Bellot. [doi]
- Entity Linking with a Paraphrase FlavorMaria Pershina, Yifan He, Ralph Grishman. [doi]
- Comprehensive and Consistent PropBank Light Verb AnnotationClaire Bonial, Martha Palmer. [doi]
- The Methodius Corpus of Rhetorical Discourse Structures and Generated TextsAmy Isard. [doi]
- Detecting Annotation Scheme Variation in Out-of-Domain TreebanksYannick Versley, Julius Steen. [doi]
- Parallel Discourse Annotations on a Corpus of Short TextsManfred Stede, Stergos D. Afantenos, Andreas Peldszus, Nicholas Asher, Jérémy Perret. [doi]
- A Gold Standard for Scalar AdjectivesBryan Wilkinson, Tim Oates. [doi]
- OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV SubtitlesPierre Lison, Jörg Tiedemann. [doi]
- Axolotl: a Web Accessible Parallel Corpus for Spanish-NahuatlXimena Gutierrez-Vasques, Gerardo Sierra, Isaac Hernandez Pompa. [doi]
- Using BabelNet to Improve OOV Coverage in SMTJinhua Du, Andy Way, Andrzej Zydron. [doi]
- FABIOLE, a Speech Database for Forensic Speaker ComparisonMoez Ajili, Jean-François Bonastre, Juliette Kahn, Solange Rossato, Guillaume Bernard. [doi]
- Studying the Temporal Dynamics of Word Co-occurrences: An Application to Event DetectionDaniel Preotiuc-Pietro, P. K. Srijith, Mark Hepple, Trevor Cohn. [doi]
- Corpus Analysis based on Structural Phenomena in Texts: Exploiting TEI Encoding for Linguistic ResearchSusanne Haaf. [doi]
- Exploitation of Co-reference in Distributional SemanticsDominik Schlechtweg. [doi]
- OSMAN ― A Novel Arabic Readability MetricMahmoud El-Haj, Paul Rayson. [doi]
- Building a Dataset for Possessions Identification in TextCarmen Banea, Xi Chen, Rada Mihalcea. [doi]
- NNBlocks: A Deep Learning Framework for Computational Linguistics Neural Network ModelsFrederico Tommasi Caroli, André Freitas, João Carlos Pereira da Silva, Siegfried Handschuh. [doi]
- Mirroring Facial Expressions and Emotions in Dyadic ConversationsCostanza Navarretta. [doi]
- "LVF-lemon ― Towards a Linked Data Representation of ""Les Verbes français"""Ingrid Falk, Achim Stein. [doi]
- Laughter in French Spontaneous Conversational DialogsBrigitte Bigi, Roxane Bertrand. [doi]
- TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality ProfilingBen Verhoeven, Walter Daelemans, Barbara Plank. [doi]
- Lexical Resources to Enrich English Malayalam Machine TranslationSreelekha S, Pushpak Bhattacharyya. [doi]
- Towards Automatic Identification of Effective Clues for Team Word-Guessing GamesEli Pincus, David R. Traum. [doi]
- Using a Small Lexicon with CRFs Confidence Measure to Improve POS Tagging AccuracyMohamed Outahajala, Paolo Rosso. [doi]
- A Database of Laryngeal High-Speed Videos with Simultaneous High-Quality Audio Recordings of Pathological and Non-Pathological VoicesPhilipp Aichinger, Imme Roesner, Matthias Leonhard, Doris-Maria Denk-Linnert, Wolfgang Bigenzahn, Berit Schneider-Stickler. [doi]
- TGermaCorp - A (Digital) Humanities Resource for (Computational) LinguisticsAndy Lücking, Armin Hoenen, Alexander Mehler. [doi]
- Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of ViewAchim Stein. [doi]
- Government Domain Named Entity Recognition for South African LanguagesRoald Eiselen. [doi]
- Summ-it++: an Enriched Version of the Summ-it CorpusEvandro Brasil da Fonseca, André Antonitsch, Sandra Collovini, Daniela O. F. do Amaral, Renata Vieira, Anny Figueira. [doi]
- A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb CompoundsAndrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner, Manfred Pinkal. [doi]
- UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and ParsingMilan Straka, Jan Hajic, Jana Straková. [doi]
- A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog SystemsPatricia Braunger, Hansjörg Hofmann, Steffen Werner, Maria Schmidt. [doi]
- CEPLEXicon ― A Lexicon of Child European PortugueseAna Lúcia Santos, Maria João Freitas, Aida Cardoso. [doi]
- Cross-lingual RDF Thesauri InterlinkingTatiana Lesnikova, Jérôme David, Jérôme Euzenat. [doi]
- The DialogBankHarry Bunt, Volha Petukhova, Andrei Malchanau, Kars Wijnhoven, Alex Chengyu Fang. [doi]
- Two Years of Aranea: Increasing Counts and Tuning the PipelineVladimír Benko. [doi]
- Using lexical and Dependency Features to Disambiguate Discourse Connectives in HindiRohit Jain, Himanshu Sharma, Dipti Misra Sharma. [doi]
- Leveraging Native Data to Correct Preposition Errors in Learners' DutchLennart Kloppenburg, Malvina Nissim. [doi]
- A Corpus of Clinical Practice Guidelines Annotated with the Importance of RecommendationsJonathon Read, Erik Velldal, Marc Cavazza, Gersende Georg. [doi]
- Filtering Wiktionary Triangles by Linear Mbetween Distributed Word ModelsMárton Makrai. [doi]
- Crosswalking from CMDI to Dublin Core and MARC 21Claus Zinn, Thorsten Trippel, Steve Kaminski, Emanuel Dima. [doi]
- Argument Mining: the Bottleneck of Knowledge and Language ResourcesPatrick Saint-Dizier. [doi]
- Automatic identification of Mild Cognitive Impairment through the analysis of Italian spontaneous speech productionsDaniela Beltrami, Laura Calzà, Gloria Gagliardi, Enrico Ghidoni, Norina Marcello, Rema Rossini Favretti, Fabio Tamburini. [doi]
- A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue SystemAjda Gokcen, Evan Jaffe, Johnsey Erdmann, Michael White, Douglas Danforth. [doi]
- Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and CorpusVolha Petukhova, Christopher A. Stevens, Harmen de Weerd, Niels Taatgen, Fokie Cnossen, Andrei Malchanau. [doi]
- Sentiment Analysis in Social Networks through Topic modelingDebashis Naskar, Sidahmed Mokaddem, Miguel Rebollo, Eva Onaindia. [doi]
- A Finite-state Morphological Analyser for TuvanFrancis M. Tyers, Aziyana Bayyr-ool, Aelita Salchak, Jonathan Washington. [doi]
- Automatic Biomedical Term Polysemy DetectionJuan Antonio Lossio Ventura, Clement Jonquet, Mathieu Roche, Maguelonne Teisseire. [doi]
- Chatbot Technology with Synthetic Voices in the Acquisition of an Endangered Language: Motivation, Development and Evaluation of a Platform for IrishNeasa Ní Chiaráin, Ailbhe Ní Chasaide. [doi]
- The SemDaX Corpus ― Sense Annotations with Scalable Sense InventoriesBolette S. Pedersen, Anna Braasch, Anders Johannsen, Héctor Martínez Alonso, Sanni Nimb, Sussi Olsen, Anders Søgaard, Nicolai Hartvig Sørensen. [doi]
- Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated NovelHardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper, Derek Ruths. [doi]
- CItA: an L1 Italian Learners Corpus to Study the Development of Writing CompetenceAlessia Barbagli, Pietro Lucisano, Felice dell'Orletta, Simonetta Montemagni, Giulia Venturi. [doi]
- Parallel Chinese-English Entities, Relations and Events CorporaJustin Mott, Ann Bies, Zhiyi Song, Stephanie Strassel. [doi]
- Embedding Open-domain Common-sense Knowledge from TextTravis Goodwin, Sanda M. Harabagiu. [doi]
- Evaluation Set for Slovak News Information RetrievalDaniel Hládek, Ján Stas, Jozef Juhár. [doi]
- A Sequence Model Approach to Relation Extraction in PortugueseSandra Collovini, Gabriel Machado, Renata Vieira. [doi]
- The IPR-cleared Corpus of Contemporary Written and Spoken Romanian LanguageDan Tufis, Verginica Barbu Mititelu, Elena Irimia, Stefan Daniel Dumitrescu, Tiberiu Boros. [doi]
- Relation- and Phrase-level Linking of FrameNet with Sar-graphsAleksandra Gabryszak, Sebastian Krause, Leonhard Hennig, Feiyu Xu, Hans Uszkoreit. [doi]
- Facilitating Metadata Interoperability in CLARIN-DKLene Offersgaard, Dorte Haltrup Hansen. [doi]
- Accuracy of Automatic Cross-Corpus Emotion Labeling for Conversational Speech Corpus CommonizationHiroki Mori, Atsushi Nagaoka, Yoshiko Arimoto. [doi]
- Cysill Ar-lein: A Corpus of Written Contemporary Welsh Compiled from an On-line Spelling and Grammar CheckerDelyth Prys, Gruffudd Prys, Dewi Bryn Jones. [doi]
- Cognitively Motivated Distributional Representations of MeaningElias Iosif, Spiros Georgiladakis, Alexandros Potamianos. [doi]
- Factuality Annotation and Learning in Spanish TextsDina Wonsever, Aiala Rosá, Marisa Malcuori. [doi]
- Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb RelationsMaria Sukhareva, Judith Eckle-Kohler, Ivan Habernal, Iryna Gurevych. [doi]
- Compasses, Magnets, Water Microscopes: Annotation of Terminology in a Diachronic Corpus of Scientific TextsAnne-Kathrin Schumann, Stefan Fischer. [doi]
- A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService CorpusMauro Nicolao, Heidi Christensen, Stuart P. Cunningham, Phil D. Green, Thomas Hain. [doi]
- SPLIT: Smart Preprocessing (Quasi) Language Independent ToolMohamed Al-Badrashiny, Arfath Pasha, Mona T. Diab, Nizar Habash, Owen Rambow, Wael Salloum, Ramy Eskander. [doi]
- GRaSP: A Multilayered Annotation Scheme for PerspectivesChantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo, Piek Vossen. [doi]
- DRANZIERA: An Evaluation Protocol For Multi-Domain Opinion MiningMauro Dragoni, Andrea Tettamanzi, Célia da Costa Pereira. [doi]
- NLP and Public Engagement: The Case of the Italian School ReformTommaso Caselli, Giovanni Moretti, Rachele Sprugnoli, Sara Tonelli, Damien Lanfrey, Donatella Solda Kutzmann. [doi]
- Arabic Corpora for Credibility AnalysisAyman Al Zaatari, Rim El Ballouli, Shady Elbassuoni, Wassim El-Hajj, Hazem M. Hajj, Khaled B. Shaban, Nizar Habash, Emad Yahya. [doi]
- Specialising Paragraph Vectors for Text Polarity DetectionFabio Tamburini. [doi]
- Universal Dependencies for NorwegianLilja Øvrelid, Petter Hohle. [doi]
- Standard Test Collection for English-Persian Cross-Lingual Word Sense DisambiguationNavid Rekabsaz, Serwah Sabetghadam, Mihai Lupu, Linda Andersson, Allan Hanbury. [doi]
- Learning from Within? Comparing PoS Tagging Approaches for Historical TextSarah Schulz, Jonas Kuhn. [doi]
- CORILSE: a Spanish Sign Language Repository for Linguistic AnalysisMaría del Carmen Cabeza-Pereiro, José M. García-Miguel, Carmen García-Mateo, José Luis Alba-Castro. [doi]
- Differentia compositionem facit. A Slower-Paced and Reliable Parser for LatinEdoardo Maria Ponti, Marco Passarotti. [doi]
- Multi-language Speech Collection for NIST LREKaren Jones, Stephanie Strassel, Kevin Walker, David Graff, Jonathan Wright. [doi]
- Solving the AL Chicken-and-Egg Corpus and Model Problem: Model-free Active Learning for Phenomena-driven Corpus ConstructionDain Kaplan, Neil Rubens, Simone Teufel, Takenobu Tokunaga. [doi]
- Building Concept Graphs from Monolingual Dictionary EntriesGábor Recski. [doi]
- OCR Post-Correction Evaluation of Early Dutch Books Online - RevisitedMartin Reynaert. [doi]
- Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease EventsStephen T. Wu, Chung-Il Wi, Sunghwan Sohn, Hongfang Liu, Young J. Juhn. [doi]
- The hunvec framework for NN-CRF-based sequential taggingKatalin Pajkossy, Attila Zséder. [doi]
- Evaluating Machine Translation in a Usage ScenarioRosa Del Gaudio, Aljoscha Burchardt, António Branco. [doi]
- Annotating Named Entities in Consumer Health QuestionsHalil Kilicoglu, Asma Ben Abacha, Yassine Mrabet, Kirk Roberts, Laritza Rodriguez, Sonya E. Shooshan, Dina Demner-Fushman. [doi]
- A lexicon of perception for the identification of synaesthetic metaphors in corporaFrancesca Strik Lievers, Chu-Ren Huang. [doi]
- Graph-Based Induction of Word Senses in CroatianMarko Bekavac, Jan Snajder. [doi]
- The LetsRead Corpus of Portuguese Children Reading Aloud for Performance EvaluationJorge Proença, Dirce Celorico, Sara Candeias, Carla Lopes, Fernando Perdigão. [doi]
- Medical Concept Embeddings via Labeled Background CorporaEneldo Loza Mencía, Gerard de Melo, Jinseok Nam. [doi]
- Assessing the Prosody of Non-Native Speakers of English: Measures and Feature SetsEduardo Coutinho, Florian Hönig, Yue Zhang, Simone Hantke, Anton Batliner, Elmar Nöth, Björn W. Schuller. [doi]
- Constructing a Norwegian Academic WordlistJanne Bondi Johannessen, Arash Saidi, Kristin Hagen. [doi]
- A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational PairsJackson Tolins, Kris Liu, Yingying Wang, Jean E. Fox Tree, Marilyn A. Walker, Michael Neff. [doi]
- Open Data Vocabularies for Assigning Usage Rights to Data Resources from Translation ProjectsDavid Lewis, Kaniz Fatema, Alfredo Maldonado, Brian Walshe, Arturo Calvo. [doi]
- Evaluating Unsupervised Dutch Word Embeddings as a Linguistic ResourceStéphan Tulkens, Chris Emmery, Walter Daelemans. [doi]
- Using Contextual Information for Machine Translation EvaluationMarina Fomicheva, Núria Bel. [doi]
- Detecting Optional Arguments of VerbsAndrás Kornai, Dávid Márk Nemeskey, Gábor Recski. [doi]
- SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language LearnersThomas François, Elena Volodina, Ildikó Pilán, Anaïs Tack. [doi]
- Simultaneous Sentence Boundary Detection and Alignment with Pivot-based Machine Translation Generated LexiconsAntoine Bourlon, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi. [doi]
- A Document Repository for Social Media and Speech ConversationsAdam Funk, Robert J. Gaizauskas, Benoît Favre. [doi]
- CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright LawsRoland Schäfer. [doi]
- Semi-automatically Alignment of Predicates between Speech and OntoNotes dataNiraj Shrestha, Marie-Francine Moens. [doi]
- Cross-validating Image Description Datasets and Evaluation MetricsJosiah Wang, Robert J. Gaizauskas. [doi]
- Identification of Drug-Related Medical Conditions in Social MediaFrançois Morlane-Hondère, Cyril Grouin, Pierre Zweigenbaum. [doi]
- Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific LiteratureKata Gábor, Haïfa Zargayouna, Davide Buscaldi, Isabelle Tellier, Thierry Charnois. [doi]
- Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous SpeechImed Laaridh, Corinne Fredouille, Christine Meunier. [doi]
- Phoneme Alignment Using the Information on Phonological Processes in Continuous SpeechDaniil Kocharov. [doi]
- Lin|gu|is|tik: Building the Linguist's Pathway to Bibliographies, Libraries, Language Resources and Linked Open DataChristian Chiarcos, Christian Fäth, Heike Renner-Westermann, Frank Abromeit, Vanya Dimitrova. [doi]
- The OpenCourseWare Metadiscourse (OCWMD) CorpusGhada AlHarbi, Thomas Hain. [doi]
- EDISON: Feature Extraction for NLP, SimplifiedMark Sammons, Christos Christodoulopoulos, Parisa KordJamshidi, Daniel Khashabi, Vivek Srikumar, Dan Roth. [doi]
- Lexical Coverage Evaluation of Large-scale Multilingual Semantic Lexicons for Twelve LanguagesScott Piao, Paul Rayson, Dawn Archer, Francesca Bianchi, Carmen Dayrell, Mahmoud El-Haj, Ricardo-María Jiménez, Dawn Knight, Michal Kren, Laura Löfberg, Rao Muhammad Adeel Nawab, Jawad Shafi, Phoey-Lee Teh, Olga Mudraya. [doi]
- The IFCASL Corpus of French and German Non-native and Native Read SpeechJürgen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius, Frank Zimmerer. [doi]
- Wow! What a Useful Extension! Introducing Non-Referential Concepts to WordnetLuís Morgado da Costa, Francis Bond. [doi]
- CHATR the Corpus; a 20-year-old archive of Concatenative Speech SynthesisNick Campbell. [doi]
- Addressing the MFS Bias in WSD systemsMarten Postma, Rubén Izquierdo, Eneko Agirre, German Rigau, Piek Vossen. [doi]
- Finding Alternative Translations in a Large Corpus of Movie SubtitleJörg Tiedemann. [doi]
- The dialogue breakdown detection challenge: Task description, datasets, and evaluation metricsRyuichiro Higashinaka, Kotaro Funakoshi, Yuka Kobayashi, Michimasa Inaba. [doi]
- A Semantically Compositional Annotation Scheme for Time NormalizationSteven Bethard, Jonathan Parker. [doi]
- TweetMT: A Parallel Microblog CorpusIñaki San Vicente, Iñaki Alegria, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martínez Garcia, Antonio Toral, Arkaitz Zubiaga, Nora Aranberri. [doi]
- Transfer-Based Learning-to-Rank Assessment of Medical Term TechnicalityDhouha Bouamor, Leonardo Campillos Llanos, Anne-Laure Ligozat, Sophie Rosset, Pierre Zweigenbaum. [doi]
- A Large Scale Corpus of Gulf ArabicSalam Khalifa, Nizar Habash, Dana Abdulrahim, Sara Hassan. [doi]
- Applying the Cognitive Machine Translation Evaluation Approach to ArabicIrina P. Temnikova, Wajdi Zaghouani, Stephan Vogel, Nizar Habash. [doi]
- Subtask Mining from Search Query Logs for How-Knowledge AccelerationChung-Lun Kuo, Hsin-Hsi Chen. [doi]
- Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability ClassificationSimone Hantke, Erik Marchi, Björn W. Schuller. [doi]
- South African National Centre for Digital Language ResourcesJustus Roux. [doi]
- E-TIPSY: Search Query Corpus Annotated with Entities, Term Importance, POS Tags, and Syntactic ParsesYuval Marton, Kristina Toutanova. [doi]
- The Language Application Grid and GalaxyNancy Ide, Keith Suderman, James Pustejovsky, Marc Verhagen, Christopher Cieri. [doi]
- The Royal Society Corpus: From Uncharted Data to CorpusHannah Kermes, Stefania Degaetano-Ortlieb, Ashraf Khamis, Jörg Knappen, Elke Teich. [doi]
- Comparing the Level of Code-Switching in CorporaBjörn Gambäck, Amitava Das. [doi]
- The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real DataMiguel Matos, Alberto Abad, António Joaquim Serralheiro. [doi]
- Analysis of English Spelling Errors in a Word-Typing GameRyuichi Tachibana, Mamoru Komachi. [doi]
- AIMU: Actionable Items for Meeting UnderstandingYun-Nung Chen, Dilek Z. Hakkani-Tür. [doi]
- Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love AffairNikola Ljubesic, Miquel Esplà-Gomis, Antonio Toral, Sergio Ortiz-Rojas, Filip Klubicka. [doi]
- Creating a Large Multi-Layered Representational Repository of Linguistic Code Switched Arabic DataMona T. Diab, Mahmoud-Ghoneim, Abdelati Hawwari, Fahad AlGhamdi, Nada AlMarwani, Mohamed Al-Badrashiny. [doi]
- Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic ClassificationMaximilian Köper, Melanie Zaiß, Qi Han, Steffen Koch, Sabine Schulte im Walde. [doi]
- Selection Criteria for Low Resource Language ProgramsChristopher Cieri, Mike Maxwell, Stephanie Strassel, Jennifer Tracey. [doi]
- European Union Language Resources in Sketch EngineVít Baisa, Jan Michelfeit, Marek Medved, Milos Jakubícek. [doi]
- The Open Linguistics Working Group: Developing the Linguistic Linked Open Data CloudJohn P. McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora, Jonathan Pool. [doi]
- Word Segmentation for Akkadian CuneiformTimo Homburg, Christian Chiarcos. [doi]
- Modelling a Parallel Corpus of French and French Belgian Sign LanguageLaurence Meurant, Maxime Gobert, Anthony Cleve. [doi]
- Event Coreference Resolution with Multi-Pass SievesJing Lu, Vincent Ng. [doi]
- Word Embedding Evaluation and CombinationSahar Ghannay, Benoît Favre, Yannick Estève, Nathalie Camelin. [doi]
- PentoRef: A Corpus of Spoken References in Task-oriented DialoguesSina Zarrieß, Julian Hough, Casey Kennington, Ramesh R. Manuvinakurike, David DeVault, Raquel Fernández, David Schlangen. [doi]
- Sentence Similarity based on Dependency Tree Kernels for Multi-document SummarizationSaziye Betül Özates, Arzucan Özgür, Dragomir R. Radev. [doi]
- Challenges of Adjective Mapping between plWordNet and Princeton WordNetEwa Rudnicka, Wojciech Witkowski, Katarzyna Podlaska. [doi]
- Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni ArabicFaisal Al-Shargi, Aidan Kaplan, Ramy Eskander, Nizar Habash, Owen Rambow. [doi]
- Focus Annotation of Task-based Data: A Comparison of Expert and Crowd-Sourced Annotation in a Reading Comprehension CorpusKordula De Kuthy, Ramon Ziai, Detmar Meurers. [doi]
- Concepticon: A Resource for the Linking of Concept ListsJohann-Mattis List, Michael Cysouw, Robert Forkel. [doi]
- What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter EmojisFrancesco Barbieri, Francesco Ronzano, Horacio Saggion. [doi]
- Name Translation based on Fine-grained Named Entity Recognition in a Single LanguageKugatsu Sadamitsu, Itsumi Saito, Taichi Katayama, Hisako Asano, Yoshihiro Matsuo. [doi]
- Syntax-based Multi-system Machine TranslationMatiss Rikters, Inguna Skadina. [doi]
- Arabic to English Person Name Transliteration using TwitterHamdy Mubarak, Ahmed Abdelali. [doi]
- A Large-Scale Multilingual Disambiguation of GlossesJosé Camacho-Collados, Claudio Delli Bovi, Alessandro Raganato, Roberto Navigli. [doi]
- The Public License Selector:
Making Open Licensing EasierPawel Kamocki, Pavel Stranák, Michal Sedlák. [doi]
- Annotating and Detecting Medical Events in Clinical NotesPrescott Klassen, Fei Xia, Meliha Yetisgen. [doi]
- LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource LanguagesStephanie Strassel, Jennifer Tracey. [doi]
- Riddle Generation using Word AssociationsPaloma Galvan, Virginia Francisco, Raquel Hervás, Gonzalo Méndez 0001. [doi]
- Hypergraph Modelization of a Syntactically Annotated English Wikipedia DumpEdmundo-Pavel Soriano-Morales, Julien Ah-Pine, Sabine Loudcher. [doi]
- Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward CompatibilityAnaïs Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol-Taravella, Delphine Battistelli. [doi]
- SCARE ― The Sentiment Corpus of App Reviews with Fine-grained Annotations in GermanMario Sänger, Ulf Leser, Steffen Kemmerer, Peter Adolphs, Roman Klinger. [doi]
- Enhancing Access to Online Education: Quality Machine Translation of MOOC ContentValia Kordoni, Antal van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx, Matthias Huck, Andy Way. [doi]
- Domain-Specific Corpus Expansion with Focused WebcrawlingSteffen Remus, Chris Biemann. [doi]
- B2SG: a TOEFL-like Task for PortugueseRodrigo Wilkens, Leonardo Zilio, Eduardo Ferreira, Aline Villavicencio. [doi]
- A Verbal and Gestural Corpus of Story Retellings to an Expressive Embodied Virtual CharacterJackson Tolins, Kris Liu, Michael Neff, Marilyn A. Walker, Jean E. Fox Tree. [doi]
- Japanese Word―Color Associations with and without ContextsJun Harashima. [doi]
- metaTED: a Corpus of Metadiscourse for Spoken LanguageRui Correia, Nuno J. Mamede, Jorge Baptista, Maxine Eskénazi. [doi]
- ALT Explored: Integrating an Online Dialectometric Tool and an Online Dialect AtlasMartijn Wieling, Eva Sassolini, Sebastiana Cucurullo, Simonetta Montemagni. [doi]
- Evaluating the Impact of Light Post-Editing on UsabilitySheila Castilho, Sharon O'Brien. [doi]
- Sentiframes: A Resource for Verb-centered German Sentiment InferenceManfred Klenner, Michael Amsler. [doi]
- Distribution of Valency Complements in Czech Complex Predicates: Between Verb and NounVáclava Kettnerová, Eduard Bejcek. [doi]
- Automatic Corpus Extension for Data-driven Natural Language GenerationElena Manishina, Bassam Jabaian, Stéphane Huet, Fabrice Lefèvre. [doi]
- Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource LanguagesJohn Sylak-Glassman, Christo Kirov, David Yarowsky. [doi]
- Bootstrapping a Hybrid MT System to a New Language PairJoão António Rodrigues, Nuno Rendeiro, Andreia Querido, Sanja Stajner, António Branco. [doi]
- A comparison of Named-Entity Disambiguation and Word Sense DisambiguationAngel X. Chang, Valentin I. Spitkovsky, Christopher D. Manning, Eneko Agirre. [doi]
- Towards Lexical Encoding of Multi-Word Expressions in Spanish DialectsDiana Bogantes, Eric Rodríguez, Alejandro Arauco, Alejandro Rodríguez, Agata Savary. [doi]
- South African Language Resources: Phrase ChunkingRoald Eiselen. [doi]
- Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media StoriesProkopis Prokopidis, Vassilis Papavassiliou, Stelios Piperidis. [doi]
- Refurbishing a Morphological Database for GermanPetra Steiner. [doi]
- TermoPL - a Flexible Tool for Terminology ExtractionMalgorzata Marciniak, Agnieszka Mykowiecka, Piotr Rychlik. [doi]
- Learning Thesaurus Relations from Distributional FeaturesRosa Tsegaye Aga, Christian Wartena, Lucas Drumond, Lars Schmidt-Thieme. [doi]
- PersonaBank: A Corpus of Personal Narratives and Their Story Intention GraphsStephanie M. Lukin, Kevin Bowden, Casey Barackman, Marilyn A. Walker. [doi]
- Discriminating Similar Languages: Evaluations and ExplorationsCyril Goutte, Serge Léger, Shervin Malmasi, Marcos Zampieri. [doi]
- Identifying Content Types of Messages Related to Open Source Software ProjectsIoannis Korkontzelos, Paul Thompson, Sophia Ananiadou. [doi]
- A Multilingual Predicate MatrixMaddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe, German Rigau. [doi]
- DeQue: A Lexicon of Complex Prepositions and Conjunctions in FrenchCarlos Ramisch, Alexis Nasr, André Valli, José Deulofeu. [doi]
- Modeling Language Change in Historical Corpora: The Case of PortugueseMarcos Zampieri, Shervin Malmasi, Mark Dras. [doi]
- Building Tempo-HindiWordNet: A resource for effective temporal information access in HindiDipawesh Pawar, Mohammed Hasanuzzaman, Asif Ekbal. [doi]
- Punctuation Prediction for Unsegmented Transcript Based on Word VectorXiaoyin Che, Cheng Wang, Haojin Yang, Christoph Meinel. [doi]
- Reliable Baselines for Sentiment Analysis in Resource-Limited Languages: The Serbian Movie Review DatasetVuk Batanovic, Bosko Nikolic, Milan Milosavljevic. [doi]
- An Extension of the Slovak Broadcast News Corpus based on Semi-Automatic AnnotationPeter Viszlay, Ján Stas, Tomás Koctúr, Martin Lojka, Jozef Juhár. [doi]
- Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and CrowdsourcingManuel Burghardt, Daniel Granvogl, Christian Wolff. [doi]
- Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015Johann Poignant, Hervé Bredin, Claude Barras, Mickaël Stefas, Pierrick Bruneau, Thomas Tamisier. [doi]
- Age and Gender Prediction on Health Forum DataPrasha Shrestha, Nicolas Rey-Villamizar, Farig Sadeque, Ted Pedersen, Steven Bethard, Thamar Solorio. [doi]
- A Turkish-German Code-Switching CorpusÖzlem Çetinoglu. [doi]
- SPA: Web-based Platform for easy Access to Speech Processing ModulesFernando Batista, Pedro Curto, Isabel Trancoso, Alberto Abad, Jaime Ferreira, Eugénio Ribeiro, Helena Moniz, David Martins de Matos, Ricardo Ribeiro 0001. [doi]
- A Large Rated Lexicon with French Medical WordsNatalia Grabar, Thierry Hamon. [doi]
- Multimodal Resources for Human-Robot Communication ModellingStavroula-Evita Fotinea, Eleni Efthimiou, Maria Koutsombogera, Athanasia-Lida Dimou, Theodore Goulas, Kyriaki Vasilaki. [doi]
- Coordinating Communication in the Wild: The Artwalk Dialogue Corpus of Pedestrian Navigation and Mobile Referential CommunicationKris Liu, Jean E. Fox Tree, Marilyn A. Walker. [doi]
- corpus-tools.org: An Interoperable Generic Software Tool Set for Multi-layer Linguistic CorporaStephan Druskat, Volker Gast, Thomas Krause, Florian Zipser. [doi]
- An Open Corpus for Named Entity Recognition in Historic NewspapersClemens Neudecker. [doi]
- Summarizing Behaviours: An Experiment on the Annotation of Call-Centre ConversationsMorena Danieli, Balamurali A. R., Evgeny A. Stepanov, Benoît Favre, Frédéric Béchet, Giuseppe Riccardi. [doi]
- Domain Ontology Learning Enhanced by Optimized Relation Instance in DBpediaLiumingjing Xiao, Chong Ruan, an Yang, Junhao Zhang, Junfeng Hu. [doi]
- Corpus-Based Diacritic Restoration for South Slavic LanguagesNikola Ljubesic, Tomaz Erjavec, Darja Fiser. [doi]
- What's the Issue Here?: Task-based Evaluation of Reader Comment Summarization SystemsEmma Barker, Monica Lestari Paramita, Adam Funk, Emina Kurtic, Ahmet Aker, Jonathan Foster, Mark Hepple, Robert J. Gaizauskas. [doi]
- SuperCAT: The (New and Improved) Corpus Analysis ToolkitK. Bretonnel Cohen, William A. Baumgartner Jr., Irina P. Temnikova. [doi]
- Datasets for Aspect-Based Sentiment Analysis in FrenchMarianna Apidianaki, Xavier Tannier, Cécile Richart. [doi]
- Neural Scoring Function for MST ParserJindrich Libovický. [doi]
- DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training CorpusMartin Brümmer, Milan Dojchinovski, Sebastian Hellmann. [doi]
- The Trials and Tribulations of Predicting Post-Editing ProductivityLena Marg. [doi]
- ANTUSD: A Large Chinese Sentiment DictionaryShih-Ming Wang, Lun-Wei Ku. [doi]
- Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on RomanianLauriane Aufrant, Guillaume Wisniewski, François Yvon. [doi]
- Bilingual Lexicon Extraction at the Morpheme Level Using Distributional AnalysisAmir Hazem, Béatrice Daille. [doi]
- CoRuSS - a New Prosodically Annotated Corpus of Russian Spontaneous SpeechTatiana Kachkovskaia, Daniil Kocharov, Pavel A. Skrelin, Nina B. Volskaya. [doi]
- A Study of Reuse and Plagiarism in LREC papersGil Francopoulo, Joseph Mariani, Patrick Paroubek. [doi]
- Tēzaurs.lv: the Largest Open Lexical Database for LatvianAndrejs Spektors, Ilze Auzina, Roberts Dargis, Normunds Gruzitis, Peteris Paikens, Lauma Pretkalnina, Laura Rituma, Baiba Saulite. [doi]
- Annotating Discourse Relations in Spoken Language: A Comparison of the PDTB and CCR FrameworksInes Rehbein, Merel Scholman, Vera Demberg. [doi]
- Predictive Modeling: Guessing the NLP Terms of TomorrowGil Francopoulo, Joseph Mariani, Patrick Paroubek. [doi]
- Extending Monolingual Semantic Textual Similarity Task to Multiple Cross-lingual SettingsYoshihiko Hayashi, Wentao Luo. [doi]
- Wikipedia Titles As Noun Tag PredictorsArmin Hoenen. [doi]
- Coreference in Prague Czech-English Dependency TreebankAnna Nedoluzhko, Michal Novák, Silvie Cinková, Marie Mikulová, Jirí Mírovský. [doi]
- Fostering digital representation of EU regional and minority languages: the Digital Language Diversity ProjectClaudia Soria, Irene Russo, Valeria Quochi, Davyth Hicks, Antton Gurrutxaga, Anneli Sarhimaa, Matti Tuomisto. [doi]
- Polarity Lexicon Building: to what Extent Is the Manual Effort Worth?Iñaki San Vicente, Xabier Saralegi. [doi]
- Fine-Grained Chinese Discourse Relation LabellingHuan-Yuan Chen, Wan-Shan Liao, Hen-Hsen Huang, Hsin-Hsi Chen. [doi]
- CLARIAH in the NetherlandsJan Odijk. [doi]
- Cro36WSD: A Lexical Sample for Croatian Word Sense DisambiguationDomagoj Alagic, Jan Snajder. [doi]
- Forecasting Emerging Trends from Scientific LiteratureKartik Asooja, Georgeta Bordea, Gabriela Vulcu, Paul Buitelaar. [doi]
- Analyzing Time Series Changes of Correlation between Market Share and Concerns on Companies measured through Search Engine SuggestsTakakazu Imada, Yusuke Inoue, Lei Chen, Syunya Doi, Tian Nie, Chen Zhao, Takehito Utsuro, Yasuhide Kawada. [doi]
- Manual and Automatic Paraphrases for MT EvaluationAles Tamchyna, Petra Barancikova. [doi]
- A Proposition Bank of UrduMaaz Anwar, Riyaz Ahmad Bhat, Dipti Misra Sharma, Ashwini Vaidya, Martha Palmer, Tafseer Ahmed Khan. [doi]
- Using a Language Technology Infrastructure for German in order to Anonymize German Sign Language Corpus DataJulian Bleicken, Thomas Hanke, Uta Salden, Sven Wagner. [doi]
- New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and SerbianNikola Ljubesic, Filip Klubicka, Zeljko Agic, Ivo-Pavao Jazbec. [doi]
- LELIO: An Auto-Adaptative System to Acquire Domain Lexical Knowledge in Technical Texts