945 | -- | 961 | Yubin Qian. Mining culture from professional discourse: a lexicon-based hybrid method |
963 | -- | 984 | Martha Yifiru Tachbelie, Solomon Teferra Abate. Lexical modeling for the development of Amharic automatic speech recognition systems |
985 | -- | 1009 | Ege Kesim, Tuge Numanoglu, Öykü Zeynep Bayramoglu, Bekir Berker Turker, Nusrah Hussain, T. Metin Sezgin, Yucel Yemez, Engin Erzin. The eHRI database: a multimodal database of engagement in human-robot interactions |
1011 | -- | 1043 | Nabil Ababou, Azzeddine Mazroui, Rachid Belehbib. From extended chunking to dependency parsing using traditional Arabic grammar |
1045 | -- | 1079 | Veronika Laippala, Samuel Rönnqvist, Miika Oinonen, Aki-Juhani Kyröläinen, Anna Salmela, Douglas Biber, Jesse Egbert, Sampo Pyysalo. Register identification from the unrestricted open Web using the Corpus of Online Registers of English |
1081 | -- | 1105 | Sean Trott, Benjamin K. Bergen 0001, Eva Wittenberg. Spontaneous, controlled acts of reference between friends and strangers |
1107 | -- | 1137 | Tommaso Caselli, Johan Bos. Investigating interoperable event corpora: limitations of reusability of resources and portability of models |
1139 | -- | 1171 | Arnaldo Cândido Júnior, Edresson Casanova, Anderson da Silva Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio. CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese |
1173 | -- | 1206 | Oliver Hellwig, Sebastian Nehrdich, Sven Sellmer. Data-driven dependency parsing of Vedic Sanskrit |
1207 | -- | 1262 | Natalie Weber, Tyler Brown, Joshua Celli, McKenzie Denham, Hailey Dykstra, Rodrigo Hernandez-Merlin, Evan Hochstein, Pinyu Hwang, Nico Kidd, Diana Kulmizev, Hannah Morrison, Matty Norris, Lena Venkatraman. Blackfoot Words: a database of Blackfoot lexical forms |
1263 | -- | 1293 | Mingzhou Xu, Longyue Wang, Siyou Liu, Derek F. Wong, Shuming Shi 0001, Zhaopeng Tu. A benchmark dataset and evaluation methodology for Chinese zero pronoun translation |
1295 | -- | 1327 | Anssi Moisio, Dejan Porjazovski, Aku Rouhe, Yaroslav Getman, Anja Virkkunen, Ragheb Al-Ghezi, Mietta Lennes, Tamás Grósz, Krister Lindén, Mikko Kurimo. Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks |
1329 | -- | 1359 | Yftah Ziser, Bonnie Webber, Shay B. Cohen. Rant or rave: variation over time in the language of online reviews |
1361 | -- | 1387 | Rachana Gusain, Satya Ranjan Dash, Shantipriya Parida, Girish Nath Jha. Automatic language identification: a case study of Pahari languages |
1389 | -- | 1403 | Chenhui Chu, Zhuoyuan Mao, Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohashi. SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain |
1405 | -- | 1422 | Brygida Sawicka-Stepinska. Design and construction of Guayaquil radio speech corpus (CHARG) |
1423 | -- | 1430 | Xi Huang 0001, Hong Xu. Rei Miyata: controlled document authoring in a machine translation age |