Abstract is missing.
- Frontmatter [doi]
- Entity Recognition at First Sight: Improving NER with Eye Movement InformationNora Hollenstein, Ce Zhang. 1-10 [doi]
- The emergence of number and syntax units in LSTM language modelsYair Lakretz, Germán Kruszewski, Theo Desbordes, Dieuwke Hupkes, Stanislas Dehaene, Marco Baroni. 11-20 [doi]
- Neural Self-Training through Spaced RepetitionHadi Amiri. 21-31 [doi]
- Neural language models as psycholinguistic subjects: Representations of syntactic stateRichard Futrell, Ethan Wilcox, Takashi Morita, Peng Qian, Miguel Ballesteros, Roger Levy. 32-42 [doi]
- Understanding language-elicited EEG data by predicting it from a fine-tuned language modelDan Schwartz, Tom M. Mitchell. 43-57 [doi]
- Pre-training on high-resource speech recognition improves low-resource speech-to-text translationSameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater. 58-68 [doi]
- Measuring the perceptual availability of phonological features during language acquisition using unsupervised binary stochastic autoencodersCory Shain, Micha Elsner. 69-85 [doi]
- Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency DetectionVicky Zayats, Mari Ostendorf. 86-95 [doi]
- Massively Multilingual Adversarial Speech RecognitionOliver Adams, Matthew Wiesner, Shinji Watanabe, David Yarowsky. 96-108 [doi]
- Lost in Interpretation: Predicting Untranslated Terminology in Simultaneous InterpretationNikolai Vogler, Craig Stewart, Graham Neubig. 109-118 [doi]
- AudioCaps: Generating Captions for Audios in The WildChris Dongjoo Kim, Byeongchang Kim, Hyunmin Lee, Gunhee Kim. 119-132 [doi]
- "President Vows to Cut Hair": Dataset and Analysis of Creative Text Editing for Humorous HeadlinesNabil Hossain, John Krumm, Michael Gamon. 133-142 [doi]
- Answer-based Adversarial Training for Generating Clarification QuestionsSudha Rao, Hal Daumé III. 143-155 [doi]
- Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled DataWei Zhao, Liang Wang, Kewei Shen, Ruoyu Jia, Jingming Liu. 156-165 [doi]
- Topic-Guided Variational Auto-Encoder for Text GenerationWenlin Wang, Zhe Gan, Hongteng Xu, Ruiyi Zhang, Guoyin Wang, Dinghan Shen, Changyou Chen, Lawrence Carin. 166-177 [doi]
- Implementation of a Chomsky-Schützenberger n-best parser for weighted multiple context-free grammarsThomas Ruprecht, Tobias Denkinger. 178-191 [doi]
- Phylogenic Multi-Lingual Dependency ParsingMathieu Dehouck, Pascal Denis. 192-203 [doi]
- Discontinuous Constituency Parsing with a Stack-Free Transition System and a Dynamic OracleMaximin Coavoux, Shay B. Cohen. 204-217 [doi]
- How Bad are PoS Tagger in Cross-Corpora Settings? Evaluating Annotation Divergence in the UD ProjectGuillaume Wisniewski, François Yvon. 218-227 [doi]
- CCG Parsing Algorithm with Incremental Tree RotationMilos Stanojevic, Mark Steedman. 228-239 [doi]
- Cyclical Annealing Schedule: A Simple Approach to Mitigating KL VanishingHao Fu, Chunyuan Li, Xiaodong Liu, Jianfeng Gao, Asli Çelikyilmaz, Lawrence Carin. 240-250 [doi]
- Recurrent models and lower bounds for projective syntactic decodingNatalie Schluter. 251-260 [doi]
- Evaluating Composition Models for Verb Phrase Elliptical Sentence EmbeddingsGijs Wijnholds, Mehrnoosh Sadrzadeh. 261-271 [doi]
- Neural Finite-State Transducers: Beyond Rational RelationsChu-Cheng Lin, Hao Zhu, Matthew R. Gormley, Jason Eisner. 272-283 [doi]
- Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text ModelingPrince Zizhuang Wang, William Yang Wang. 284-294 [doi]
- A Study of Incorrect Paraphrases in Crowdsourced User UtterancesMohammad-Ali Yaghoub-Zadeh-Fard, Boualem Benatallah, Moshe Chai Barukh, Shayan Zamanirad. 295-306 [doi]
- ComQA: A Community-sourced Dataset for Complex Factoid Question Answering with Paraphrase ClustersAbdalghani Abujabal, Rishiraj Saha Roy, Mohamed Yahya, Gerhard Weikum. 307-317 [doi]
- FreebaseQA: A New Factoid QA Data Set Matching Trivia-Style Question-Answer Pairs with FreebaseKelvin Jiang, Dekun Wu, Hui Jiang 0001. 318-323 [doi]
- Simple Question Answering with Subgraph Ranking and Joint-ScoringWenbo Zhao, Tagyoung Chung, Anuj Kumar Goyal, Angeliki Metallinou. 324-334 [doi]
- Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Open-domain Question AnsweringJianmo Ni, Chenguang Zhu, Weizhu Chen, Julian McAuley. 335-344 [doi]
- UHop: An Unrestricted-Hop Relation Extraction Framework for Knowledge-Based Question AnsweringZi-Yuan Chen, Chih-Hung Chang, Yi Pei Chen, Jijnasa Nayak, Lun-Wei Ku. 345-356 [doi]
- BAG: Bi-directional Attention Entity Graph Convolutional Network for Multi-hop Reasoning Question AnsweringYu Cao, Meng Fang, Dacheng Tao. 357-362 [doi]
- Vector of Locally-Aggregated Word Embeddings (VLAWE): A Novel Document-level RepresentationRadu-Tudor Ionescu, Andrei M. Butnaru. 363-369 [doi]
- Multi-task Learning for Multi-modal Emotion Recognition and Sentiment AnalysisMd. Shad Akhtar, Dushyant Singh Chauhan, Deepanway Ghosal, Soujanya Poria, Asif Ekbal, Pushpak Bhattacharyya. 370-379 [doi]
- Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary SentenceChi Sun, Luyao Huang, Xipeng Qiu. 380-385 [doi]
- A Variational Approach to Weakly Supervised Document-Level Multi-Aspect Sentiment ClassificationZiqian Zeng, Wenxuan Zhou, Xin Liu, Yangqiu Song. 386-396 [doi]
- HiGRU: Hierarchical Gated Recurrent Units for Utterance-Level Emotion RecognitionWenxiang Jiao, Haiqin Yang, Irwin King, Michael R. Lyu. 397-406 [doi]
- Learning Interpretable Negation Rules via Weak Supervision at Document Level: A Reinforcement Learning ApproachNicolas Pröllochs, Stefan Feuerriegel, Dirk Neumann 0001. 407-413 [doi]
- Simplified Neural Unsupervised Domain AdaptationTimothy Miller. 414-419 [doi]
- Learning Bilingual Sentiment-Specific Word Embeddings without Cross-lingual SupervisionYanlin Feng, Xiaojun Wan. 420-429 [doi]
- ReWE: Regressing Word Embeddings for Regularization of Neural Machine Translation SystemsInigo Jauregi Unanue, Ehsan Zare Borzeshi, Nazanin Esmaili, Massimo Piccardi. 430-436 [doi]
- Lost in Machine Translation: A Method to Reduce Meaning LossReuben Cohn-Gordon, Noah Goodman. 437-441 [doi]
- Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine TranslationXing Niu, Weijia Xu, Marine Carpuat. 442-448 [doi]
- Code-Switching for Enhancing NMT with Pre-Specified TranslationKai Song, Yue Zhang, Heng Yu, Weihua Luo, Kun Wang, Min Zhang. 449-459 [doi]
- Aligning Vector-spaces with Noisy Supervised LexiconNoa Yehezkel Lubin, Jacob Goldberger, Yoav Goldberg. 460-465 [doi]
- Understanding and Improving Hidden Representations for Neural Machine TranslationGuanlin Li, Lemao Liu, Xintong Li, Conghui Zhu, Tiejun Zhao, Shuming Shi. 466-477 [doi]
- Content Differences in Syntactic and Semantic RepresentationDaniel Hershcovich, Omri Abend, Ari Rappoport. 478-488 [doi]
- Attentive Mimicking: Better Word Embeddings by Attending to Informative ContextsTimo Schick, Hinrich Schütze. 489-494 [doi]
- Evaluating Style Transfer for TextRemi Mir, Bjarke Felbo, Nick Obradovich, Iyad Rahwan. 495-504 [doi]
- Big BiRD: A Large, Fine-Grained, Bigram Relatedness Dataset for Examining Semantic CompositionShima Asaadi, Saif Mohammad, Svetlana Kiritchenko. 505-516 [doi]
- Outlier Detection for Improved Data Quality and Diversity in Dialog SystemsStefan Larson, Anish Mahendran, Andrew Lee, Jonathan K. Kummerfeld, Parker Hill, Michael A. Laurenzano, Johann Hauswald, Lingjia Tang, Jason Mars. 517-527 [doi]
- Asking the Right Question: Inferring Advice-Seeking Intentions from Personal NarrativesLiye Fu, Jonathan P. Chang, Cristian Danescu-Niculescu-Mizil. 528-541 [doi]
- Seeing Things from a Different Angle: Discovering Diverse Perspectives about ClaimsSihao Chen, Daniel Khashabi, Wenpeng Yin 0001, Chris Callison-Burch, Dan Roth. 542-557 [doi]
- IMHO Fine-Tuning Improves Claim DetectionTuhin Chakrabarty, Christopher Hidey, Kathy McKeown. 558-563 [doi]
- Joint Multiple Intent Detection and Slot Labeling for Goal-Oriented DialogRashmi Gangadharaiah, Balakrishnan Narayanaswamy. 564-569 [doi]
- CITE: A Corpus of Image-Text Discourse RelationsMalihe Alikhani, Sreyasi Nag Chowdhury, Gerard de Melo, Matthew Stone. 570-575 [doi]
- Improving Dialogue State Tracking by Discerning the Relevant ContextSanuj Sharma, Prafulla Kumar Choubey, Ruihong Huang. 576-581 [doi]
- CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual DialogSatwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach. 582-595 [doi]
- Learning Outside the Box: Discourse-level Features Improve Metaphor IdentificationJesse Mu, Helen Yannakoudakis, Ekaterina Shutova. 596-601 [doi]
- Detection of Abusive Language: the Problem of Biased DatasetsMichael Wiegand, Josef Ruppenhofer, Thomas Kleinbauer. 602-608 [doi]
- Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove ThemHila Gonen, Yoav Goldberg. 609-614 [doi]
- Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word EmbeddingsThomas Manzini, Yao Chong Lim, Alan W. Black, Yulia Tsvetkov. 615-621 [doi]
- On Measuring Social Biases in Sentence EncodersChandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman, Rachel Rudinger. 622-628 [doi]
- Gender Bias in Contextualized Word EmbeddingsJieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordonez, Kai-Wei Chang. 629-634 [doi]
- Combining Sentiment Lexica with a Multi-View Variational AutoencoderAlexander Hoyle, Lawrence Wolf-Sonkin, Hanna M. Wallach, Ryan Cotterell, Isabelle Augenstein. 635-640 [doi]
- Enhancing Opinion Role Labeling with Semantic-Aware Word Representations from Semantic Role LabelingMeishan Zhang, Peili Liang, Guohong Fu. 641-646 [doi]
- Frowning Frodo, Wincing Leia, and a Seriously Great Friendship: Learning to Classify Emotional Relationships of Fictional CharactersEvgeny Kim, Roman Klinger. 647-653 [doi]
- Generalizing Unmasking for Short TextsJanek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast. 654-659 [doi]
- Adversarial Training for Satire Detection: Controlling for Confounding VariablesRobert McHardy, Heike Adel, Roman Klinger. 660-665 [doi]
- Keyphrase Generation: A Text Summarization StruggleErion Çano, Ondrej Bojar. 666-672 [doi]
- SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence CompressionChristos Baziotis, Ion Androutsopoulos, Ioannis Konstas, Alexandros Potamianos. 673-681 [doi]
- Crowdsourcing Lightweight Pyramids for Manual Summary EvaluationOri Shapira, David Gabay, Yang Gao 0023, Hadar Ronen, Ramakanth Pasunuru, Mohit Bansal, Yael Amsterdamer, Ido Dagan. 682-687 [doi]
- Serial Recall Effects in Neural Language ModelingHassan Hajipoor, Hadi Amiri, Maseud Rahgozar, Farhad Oroumchian. 688-694 [doi]
- Fast Concept Mention Grouping for Concept Map-based Multi-Document SummarizationTobias Falke, Iryna Gurevych. 695-700 [doi]
- Syntax-aware Neural Semantic Role Labeling with SupertagsJungo Kasai, Dan Friedman, Robert Frank, Dragomir R. Radev, Owen Rambow. 701-709 [doi]
- Left-to-Right Dependency Parsing with Pointer NetworksDaniel Fernández-González, Carlos Gómez-Rodríguez. 710-716 [doi]
- Viable Dependency Parsing as Sequence LabelingMichalina Strzyz, David Vilares, Carlos Gómez-Rodríguez. 717-723 [doi]
- Pooled Contextualized Embeddings for Named Entity RecognitionAlan Akbik, Tanja Bergmann, Roland Vollgraf. 724-728 [doi]
- Better Modeling of Incomplete Annotations for Named Entity RecognitionZhanming Jie, Pengjun Xie, Wei Lu, Ruixue Ding, Linlin Li. 729-734 [doi]
- Event Detection without TriggersShulin Liu, Yang Li, Feng Zhang, Tao Yang, Xinpeng Zhou. 735-744 [doi]
- Sub-event detection from twitter streams as a sequence labeling problemGiannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder. 745-750 [doi]
- GraphIE: A Graph-Based Framework for Information ExtractionYujie Qian, Enrico Santus, Zhijing Jin, Jiang Guo, Regina Barzilay. 751-761 [doi]
- OpenKI: Integrating Open Information Extraction and Knowledge Bases with Relation InferenceDongxu Zhang, Subhabrata Mukherjee, Colin Lockard, Luna Dong, Andrew McCallum. 762-772 [doi]
- Imposing Label-Relational Inductive Bias for Extremely Fine-Grained Entity TypingWenhan Xiong, Jiawei Wu, Deren Lei, Mo Yu, Shiyu Chang, Xiaoxiao Guo, William Yang Wang. 773-784 [doi]
- Improving Event Coreference Resolution by Learning Argument Compatibility from Unlabeled DataYin Jou Huang, Jing Lu, Sadao Kurohashi, Vincent Ng. 785-795 [doi]
- Sentence Embedding Alignment for Lifelong Relation ExtractionHong Wang, Wenhan Xiong, Mo Yu, Xiaoxiao Guo, Shiyu Chang, William Yang Wang. 796-806 [doi]
- Description-Based Zero-shot Fine-Grained Entity TypingRasha Obeidat, Xiaoli Z. Fern, Hamed Shahbazi, Prasad Tadepalli. 807-814 [doi]
- Adversarial Decomposition of Text RepresentationAlexey Romanov, Anna Rumshisky, Anna Rogers, David Donahue. 815-825 [doi]
- PoMo: Generating Entity-Specific Post-Modifiers in ContextJun Seok Kang, Robert L. Logan IV, Zewei Chu, Yang Chen, Dheeru Dua, Kevin Gimpel, Sameer Singh, Niranjan Balasubramanian. 826-838 [doi]
- Improved Lexically Constrained Decoding for Translation and Monolingual RewritingJ. Edward Hu, Huda Khayrallah, Ryan Culkin, Patrick Xia, Tongfei Chen, Matt Post, Benjamin Van Durme. 839-850 [doi]
- Courteously Yours: Inducing courteous behavior in Customer Care responses using Reinforced Pointer Generator NetworkHitesh Golchha, Mauajama Firdaus, Asif Ekbal, Pushpak Bhattacharyya. 851-860 [doi]
- How to Avoid Sentences Spelling Boring? Towards a Neural Approach to Unsupervised Metaphor GenerationZhiwei Yu, Xiaojun Wan 0001. 861-871 [doi]
- Incorporating Context and External Knowledge for Pronoun Coreference ResolutionHongming Zhang, Yan Song, Yangqiu Song. 872-881 [doi]
- Unsupervised Deep Structured Semantic Models for Commonsense ReasoningShuohang Wang, Sheng Zhang 0012, Yelong Shen, Xiaodong Liu, Jingjing Liu, Jianfeng Gao, Jing Jiang 0001. 882-891 [doi]
- Recovering dropped pronouns in Chinese conversations via modeling their referentsJingxuan Yang, Jianzhuo Tong, Si Li 0001, Sheng Gao, Jun Guo, Nianwen Xue. 892-901 [doi]
- The problem with probabilistic DAG automata for semantic graphsIeva Vasiljeva, Sorcha Gilroy, Adam Lopez. 902-911 [doi]
- A Systematic Study of Leveraging Subword Information for Learning Word RepresentationsYi Zhu, Ivan Vulic, Anna Korhonen. 912-932 [doi]
- Better Word Embeddings by Disentangling Contextual n-Gram InformationPrakhar Gupta, Matteo Pagliardini, Martin Jaggi. 933-939 [doi]
- Integration of Knowledge Graph Embedding Into Topic Modeling with Hierarchical Dirichlet ProcessDingcheng Li, Siamak Zamani, Jingyuan Zhang, Ping Li. 940-950 [doi]
- Correlation Coefficients and Semantic Textual SimilarityVitalii Zhelezniak, Aleksandar Savkov, April Shen, Nils Y. Hammerla. 951-962 [doi]
- Generating Token-Level Explanations for Natural Language InferenceJames Thorne, Andreas Vlachos, Christos Christodoulopoulos, Arpit Mittal. 963-969 [doi]
- Strong Baselines for Complex Word Identification across Multiple LanguagesPierre Finnimore, Elisabeth Fritzsch, Daniel King, Alison Sneyd, Aneeq-ur Rehman, Fernando Alva-Manchego, Andreas Vlachos. 970-977 [doi]
- Adaptive Convolution for Multi-Relational LearningXiaotian Jiang, Quan Wang, Bin Wang. 978-987 [doi]
- Graph Pattern Entity Ranking Model for Knowledge Graph CompletionTakuma Ebisu, Ryutaro Ichise. 988-997 [doi]
- Adversarial Training for Weakly Supervised Event DetectionXiaozhi Wang, Xu Han, Zhiyuan Liu, Maosong Sun, Peng Li. 998-1008 [doi]
- A Submodular Feature-Aware Framework for Label Subset Selection in Extreme Classification ProblemsElham J. Barezi, Ian D. Wood, Pascale Fung, Hamid R. Rabiee. 1009-1018 [doi]
- Relation Extraction with Temporal Reasoning Based on Memory Augmented Distant SupervisionJianhao Yan, Lin He, Ruqin Huang, Jian Li, Ying Liu. 1019-1030 [doi]
- Integrating Semantic Knowledge to Tackle Zero-shot Text ClassificationJingqing Zhang, Piyawat Lertvittayakumjorn, Yike Guo. 1031-1040 [doi]
- Word-Node2Vec: Improving Word Embedding with Document-Level Non-Local Word Co-occurrencesProcheta Sen, Debasis Ganguly, Gareth J. F. Jones. 1041-1051 [doi]
- Cross-Topic Distributional Semantic Representations Via Unsupervised MappingsEleftheria Briakou, Nikos Athanasiou, Alexandros Potamianos. 1052-1061 [doi]
- What just happened? Evaluating retrofitted distributional word vectorsDmetri Hayes. 1062-1072 [doi]
- Linguistic Knowledge and Transferability of Contextual RepresentationsNelson F. Liu, Matt Gardner 0001, Yonatan Belinkov, Matthew E. Peters, Noah A. Smith. 1073-1094 [doi]
- Mutual Information Maximization for Simple and Accurate Part-Of-Speech InductionKarl Stratos. 1095-1104 [doi]
- Unsupervised Recurrent Neural Network GrammarsYoon Kim, Alexander M. Rush, Lei Yu, Adhiguna Kuncoro, Chris Dyer, Gábor Melis. 1105-1117 [doi]
- Cooperative Learning of Disjoint Syntax and SemanticsSerhii Havrylov, Germán Kruszewski, Armand Joulin. 1118-1128 [doi]
- Unsupervised Latent Tree Induction with Deep Inside-Outside Recursive Auto-EncodersAndrew Drozdov, Patrick Verga, Mohit Yadav, Mohit Iyyer, Andrew McCallum. 1129-1141 [doi]
- Knowledge-Augmented Language Model and Its Application to Unsupervised Named-Entity RecognitionAngli Liu, Jingfei Du, Veselin Stoyanov. 1142-1150 [doi]
- Syntax-Enhanced Neural Machine Translation with Syntax-Aware Word RepresentationsMeishan Zhang, Zhenghua Li, Guohong Fu, Min Zhang. 1151-1161 [doi]
- Competence-based Curriculum Learning for Neural Machine TranslationEmmanouil Antonios Platanios, Otilia Stretcu, Graham Neubig, Barnabás Póczos, Tom M. Mitchell. 1162-1172 [doi]
- Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine TranslationJiawei Wu, Xin Wang, William Yang Wang. 1173-1183 [doi]
- Consistency by Agreement in Zero-Shot Neural Machine TranslationMaruan Al-Shedivat, Ankur P. Parikh. 1184-1197 [doi]
- Modeling Recurrence for TransformerJie Hao, Xing Wang, Baosong Yang, Longyue Wang, Jinfeng Zhang, Zhaopeng Tu. 1198-1207 [doi]
- Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable ModelsTiancheng Zhao, Kaige Xie, Maxine Eskénazi. 1208-1218 [doi]
- Skeleton-to-Response: Dialogue Generation Guided by Retrieval MemoryDeng Cai, Yan Wang, Wei Bi, Zhaopeng Tu, Xiaojiang Liu, Wai Lam, Shuming Shi. 1219-1228 [doi]
- Jointly Optimizing Diversity and Relevance in Neural Response GenerationXiang Gao, Sungjin Lee, Yizhe Zhang, Chris Brockett, Michel Galley, Jianfeng Gao, Bill Dolan. 1229-1238 [doi]
- Disentangling Language and Knowledge in Task-Oriented DialogsDinesh Raghu, Nikhil Gupta, Mausam. 1239-1255 [doi]
- Tensorized Self-Attention: Efficiently Modeling Pairwise and Global Dependencies TogetherTao Shen, Tianyi Zhou, Guodong Long, Jing Jiang 0002, Chengqi Zhang. 1256-1266 [doi]
- WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning RepresentationsMohammad Taher Pilehvar, José Camacho-Collados. 1267-1273 [doi]
- Does My Rebuttal Matter? Insights from a Major NLP ConferenceYang Gao 0023, Steffen Eger, Ilia Kuznetsov, Iryna Gurevych, Yusuke Miyao. 1274-1290 [doi]
- Casting Light on Invisible Cities: Computationally Engaging with Literary CriticismShufan Wang, Mohit Iyyer. 1291-1297 [doi]
- PAWS: Paraphrase Adversaries from Word ScramblingYuan Zhang, Jason Baldridge, Luheng He. 1298-1308 [doi]
- Cross-Corpora Evaluation and Analysis of Grammatical Error Correction Models - Is Single-Corpus Evaluation Enough?Masato Mita, Tomoya Mizumoto, Masahiro Kaneko, Ryo Nagata, Kentaro Inui. 1309-1314 [doi]
- Star-TransformerQipeng Guo, Xipeng Qiu, Pengfei Liu, Yunfan Shao, Xiangyang Xue, Zheng Zhang. 1315-1325 [doi]
- Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous ConversationTasnim Mohiuddin, Thanh Tung Nguyen, Shafiq R. Joty. 1326-1336 [doi]
- From legal to technical concept: Towards an automated classification of German political Twitter postings as criminal offensesFrederike Zufall, Tobias Horsmann, Torsten Zesch. 1337-1347 [doi]
- Joint Multi-Label Attention Networks for Social Text AnnotationHang Dong, Wei Wang, Kaizhu Huang, Frans Coenen. 1348-1354 [doi]
- Multi-Channel Convolutional Neural Network for Twitter Emotion and Sentiment RecognitionJumayel Islam, Robert E. Mercer, Lu Xiao 0002. 1355-1365 [doi]
- Detecting Cybersecurity Events from Noisy Short TextSemih Yagcioglu, Mehmet Saygin Seyfioglu, Begum Citamak, Batuhan Bardak, Seren Guldamlasioglu, Azmi Yuksel, Emin Islam Tatli. 1366-1372 [doi]
- White-to-Black: Efficient Distillation of Black-Box Adversarial AttacksYotam Gil, Yoav Chai, Or Gorodissky, Jonathan Berant. 1373-1379 [doi]
- Analyzing the Perceived Severity of Cybersecurity Threats Reported on Social MediaShi Zong, Alan Ritter, Graham Mueller, Evan Wright. 1380-1390 [doi]
- Fake News Detection using Deep Markov Random FieldsDuc-Minh Nguyen, Tien Huu Do, A. Robert Calderbank, Nikos Deligiannis. 1391-1400 [doi]
- Issue Framing in Online Discussion ForaMareike Hartmann, Tallulah Jansen, Isabelle Augenstein, Anders Søgaard. 1401-1407 [doi]
- Vector of Locally Aggregated Embeddings for Text RepresentationHadi Amiri, Mitra Mohtarami. 1408-1414 [doi]
- Predicting the Type and Target of Offensive Posts in Social MediaMarcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar. 1415-1420 [doi]
- Biomedical Event Extraction based on Knowledge-driven Tree-LSTMDiya Li, Lifu Huang, Heng Ji, Jiawei Han 0001. 1421-1430 [doi]
- Detecting cognitive impairments by agreeing on interpretations of linguistic featuresZining Zhu, Jekaterina Novikova, Frank Rudzicz. 1431-1441 [doi]
- Relation Extraction using Explicit Context ConditioningGaurav Singh, Parminder Bhatia. 1442-1447 [doi]
- Conversation Model Fine-Tuning for Classifying Client Utterances in Counseling DialoguesSungjoon Park, Donghyun Kim, Alice Oh. 1448-1459 [doi]
- Using Similarity Measures to Select Pretraining Data for NERXiang Dai, Sarvnaz Karimi, Ben Hachey, Cécile Paris. 1460-1470 [doi]
- Predicting Annotation Difficulty to Improve Task Routing and Model Performance for Biomedical Information ExtractionYinfei Yang, Oshin Agarwal, Chris Tar, Byron C. Wallace, Ani Nenkova. 1471-1480 [doi]
- Detecting Depression in Social Media using Fine-Grained EmotionsMario Ezra Aragón, Adrián Pastor López-Monroy, Luis Carlos González-Gurrola, Manuel Montes-y-Gómez. 1481-1486 [doi]
- A Silver Standard Corpus of Human Phenotype-Gene RelationsDiana Sousa, Andre Lamurias, Francisco M. Couto. 1487-1492 [doi]
- Improving Lemmatization of Non-Standard Languages with Joint LearningEnrique Manjavacas, Ákos Kádár, Mike Kestemont. 1493-1503 [doi]
- One Size Does Not Fit All: Comparing NMT Representations of Different GranularitiesNadir Durrani, Fahim Dalvi, Hassan Sajjad, Yonatan Belinkov, Preslav Nakov. 1504-1516 [doi]
- A Simple Joint Model for Improved Contextual Neural LemmatizationChaitanya Malaviya, Shijie Wu, Ryan Cotterell. 1517-1528 [doi]
- A Probabilistic Generative Model of Linguistic TypologyJohannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein. 1529-1540 [doi]
- Quantifying the morphosyntactic content of Brown ClustersManuel R. Ciosici, Leon Derczynski, Ira Assent. 1541-1550 [doi]
- Analyzing Bayesian Crosslingual Transfer in Topic ModelsShudong Hao, Michael J. Paul. 1551-1565 [doi]
- Recursive Subtree Composition in LSTM-Based Dependency ParsingMiryam de Lhoneux, Miguel Ballesteros, Joakim Nivre. 1566-1576 [doi]
- Cross-lingual CCG InductionKilian Evang. 1577-1587 [doi]
- Density Matching for Bilingual Word EmbeddingChunting Zhou, Xuezhe Ma, Di Wang, Graham Neubig. 1588-1598 [doi]
- Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency ParsingTal Schuster, Ori Ram, Regina Barzilay, Amir Globerson. 1599-1613 [doi]
- Early Rumour DetectionKaimin Zhou, Chang Shu, Binyang Li, Jey Han Lau. 1614-1623 [doi]
- Microblog Hashtag Generation via Encoding Conversation ContextsYue Wang, Jing Li, Irwin King, Michael R. Lyu, Shuming Shi. 1624-1633 [doi]
- Text Processing Like Humans Do: Visually Attacking and Shielding NLP SystemsSteffen Eger, Gözde Gül Sahin, Andreas Rücklé, Ji Ung Lee, Claudia Schulz 0001, Mohsen Mesgar, Krishnkant Swarnkar, Edwin Simpson, Iryna Gurevych. 1634-1647 [doi]
- Something's Brewing! Early Prediction of Controversy-causing Posts from Discussion FeaturesJack Hessel, Lillian Lee. 1648-1659 [doi]
- No Permanent Friends or Enemies: Tracking Relationships between Nations from NewsXiaochuang Han, Eunsol Choi, Chenhao Tan. 1660-1676 [doi]
- Improving Human Text Comprehension through Semi-Markov CRF-based Neural Section Title GenerationSebastian Gehrmann, Steven Layne, Franck Dernoncourt. 1677-1688 [doi]
- Unifying Human and Statistical Evaluation for Natural Language GenerationTatsunori B. Hashimoto, Hugh Zhang, Percy Liang. 1689-1701 [doi]
- What makes a good conversation? How controllable attributes affect human judgmentsAbigail See, Stephen Roller, Douwe Kiela, Jason Weston. 1702-1723 [doi]
- An Empirical Investigation of Global and Local Normalization for Recurrent Neural Sequence Models Using a Continuous Relaxation to Beam SearchKartik Goyal, Chris Dyer, Taylor Berg-Kirkpatrick. 1724-1733 [doi]
- Pun Generation with SurpriseHe He, Nanyun Peng, Percy Liang. 1734-1744 [doi]
- Single Document Summarization as Tree InductionYang Liu, Ivan Titov, Mirella Lapata. 1745-1755 [doi]
- Fixed That for You: Generating Contrastive Claims with Semantic EditsChristopher Hidey, Kathy McKeown. 1756-1767 [doi]
- Box of Lies: Multimodal Deception Detection in DialoguesFelix Soldner, Verónica Pérez-Rosas, Rada Mihalcea. 1768-1777 [doi]
- A Crowdsourced Corpus of Multiple Judgments and Disagreement on Anaphoric InterpretationMassimo Poesio, Jon Chamberlain, Silviu Paun, Juntao Yu, Alexandra Uma, Udo Kruschwitz. 1778-1789 [doi]
- A Streamlined Method for Sourcing Discourse-level Argumentation Annotations from the CrowdTristan Miller, Maria Sukhareva, Iryna Gurevych. 1790-1796 [doi]
- Unsupervised Dialog Structure LearningWeiyan Shi, Tiancheng Zhao, Zhou Yu. 1797-1807 [doi]
- Modeling Document-level Causal Structures for Event Causal Relation IdentificationLei Gao, Prafulla Kumar Choubey, Ruihong Huang. 1808-1817 [doi]
- Hierarchical User and Item Representation with Three-Tier Attention for RecommendationChuhan Wu, Fangzhao Wu, Junxin Liu, Yongfeng Huang. 1818-1826 [doi]
- Text Similarity Estimation Based on Word Embeddings and Matrix Norms for Targeted MarketingTim vor der Brück, Marc Pouly. 1827-1836 [doi]
- Glocal: Incorporating Global Information in Local Convolution for Keyphrase ExtractionAnimesh Prasad, Min-Yen Kan. 1837-1846 [doi]
- A Study of Latent Structured Prediction Approaches to Passage RerankingIryna Haponchyk, Alessandro Moschitti. 1847-1857 [doi]
- Combining Distant and Direct Supervision for Neural Relation ExtractionIz Beltagy, Kyle Lo, Waleed Ammar. 1858-1867 [doi]
- Tweet Stance Detection Using an Attention based Neural Ensemble ModelUmme Aymun Siddiqua, Abu Nowshed Chy, Masaki Aono. 1868-1873 [doi]
- Word Embedding-Based Automatic MT Evaluation Metric using Word Position InformationHiroshi Echizen-ya, Kenji Araki, Eduard H. Hovy. 1874-1883 [doi]
- Learning to Stop in Structured Prediction for Neural Machine TranslationMingbo Ma, Renjie Zheng, Liang Huang 0001. 1884-1889 [doi]
- Learning Unsupervised Multilingual Word Embeddings with Incremental Multilingual HubsGeert Heyman, Bregt Verreet, Ivan Vulic, Marie-Francine Moens. 1890-1902 [doi]
- Curriculum Learning for Domain Adaptation in Neural Machine TranslationXuan Zhang, Pamela Shapiro, Gaurav Kumar, Paul McNamee, Marine Carpuat, Kevin Duh. 1903-1915 [doi]
- Improving Robustness of Machine Translation with Synthetic NoiseVaibhav, Sumeet Singh, Craig Stewart, Graham Neubig. 1916-1920 [doi]
- Non-Parametric Adaptation for Neural Machine TranslationAnkur Bapna, Orhan Firat. 1921-1931 [doi]
- Online Distilling from Checkpoints for Neural Machine TranslationHao-Ran Wei, Shujian Huang, Ran Wang, Xin-Yu Dai, Jiajun Chen. 1932-1941 [doi]
- Value-based Search in Execution Space for Mapping Instructions to ProgramsDor Muhlgay, Jonathan Herzig, Jonathan Berant. 1942-1954 [doi]
- VQD: Visual Query Detection In Natural ScenesManoj Acharya, Karan Jariwala, Christopher Kanan. 1955-1961 [doi]
- Improving Natural Language Interaction with Robots Using AdviceNikhil Mehta, Dan Goldwasser. 1962-1967 [doi]
- Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence ModelsVictor Prokhorov, Mohammad Taher Pilehvar, Nigel Collier. 1968-1976 [doi]
- Shifting the Baseline: Single Modality Performance on Visual Navigation & QAJesse Thomason, Daniel Gordon, Yonatan Bisk. 1977-1983 [doi]
- ExCL: Extractive Clip Localization Using Natural Language DescriptionsSoham Ghosh, Anuva Agarwal, Zarana Parekh, Alexander G. Hauptmann. 1984-1990 [doi]
- Detecting dementia in Mandarin Chinese using transfer learning from a parallel corpusBai Li, Yi-Te Hsu, Frank Rudzicz. 1991-1997 [doi]
- Cross-lingual Visual Verb Sense DisambiguationSpandana Gella, Desmond Elliott, Frank Keller. 1998-2004 [doi]
- Subword-Level Language Identification for Intra-Word Code-SwitchingManuel Mager, Özlem Çetinoglu, Katharina Kann. 2005-2011 [doi]
- MuST-C: a Multilingual Speech Translation CorpusMattia Antonino Di Gangi, Roldano Cattoni, Luisa Bentivogli, Matteo Negri, Marco Turchi. 2012-2017 [doi]
- Contextualization of Morphological InflectionEkaterina Vylomova, Ryan Cotterell, Trevor Cohn, Timothy Baldwin, Jason Eisner. 2018-2024 [doi]
- A Robust Abstractive System for Cross-Lingual SummarizationJessica Ouyang, Boya Song, Kathy McKeown. 2025-2031 [doi]
- Improving Neural Machine Translation with Neural Syntactic DistanceChunpeng Ma, Akihiro Tamura, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao. 2032-2037 [doi]
- Measuring Immediate Adaptation Performance for Neural Machine TranslationPatrick Simianer, Joern Wuebker, John DeNero. 2038-2046 [doi]
- Differentiable Sampling with Flexible Reference Word Order for Neural Machine TranslationWeijia Xu, Xing Niu, Marine Carpuat. 2047-2053 [doi]
- Reinforcement Learning based Curriculum Optimization for Neural Machine TranslationGaurav Kumar, George Foster, Colin Cherry, Maxim Krikun. 2054-2061 [doi]
- Overcoming Catastrophic Forgetting During Domain Adaptation of Neural Machine TranslationBrian Thompson, Jeremy Gwinnup, Huda Khayrallah, Kevin Duh, Philipp Koehn. 2062-2068 [doi]
- Short-Term Meaning Shift: A Distributional ExplorationMarco Del Tredici, Raquel Fernández, Gemma Boleda. 2069-2075 [doi]
- Detecting Derogatory Compounds - An Unsupervised ApproachMichael Wiegand, Maximilian Wolf, Josef Ruppenhofer. 2076-2081 [doi]
- Personalized Neural Embeddings for Collaborative Filtering with TextGuangneng Hu. 2082-2088 [doi]
- An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language ModelsAlexandra Chronopoulou, Christos Baziotis, Alexandros Potamianos. 2089-2095 [doi]
- Incorporating Emoji Descriptions Improves Tweet ClassificationAbhishek Singh, Eduardo Blanco, Wei Jin. 2096-2101 [doi]
- Modeling Personal Biases in Language Use by Inducing Personalized Word EmbeddingsDaisuke Oba, Naoki Yoshinaga 0001, Shoetsu Sato, Satoshi Akasaki, Masashi Toyoda. 2102-2108 [doi]
- Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News MediaRamy Baly, Georgi Karadzhov, Abdelrhman Saleh, James Glass, Preslav Nakov. 2109-2116 [doi]
- Joint Detection and Location of English PunsYanyan Zou, Wei Lu. 2117-2123 [doi]
- Harry Potter and the Action Prediction Challenge from Natural LanguageDavid Vilares, Carlos Gómez-Rodríguez. 2124-2130 [doi]
- Argument Mining for Understanding Peer ReviewsXinyu Hua, Mitko Nikolov, Nikhil Badugu, Lu Wang. 2131-2137 [doi]
- An annotated dataset of literary entitiesDavid Bamman, Sejal Popat, Sheng Shen. 2138-2144 [doi]
- Abusive Language Detection with Graph Convolutional NetworksPushkar Mishra, Marco Del Tredici, Helen Yannakoudakis, Ekaterina Shutova. 2145-2150 [doi]
- On the Importance of Distinguishing Word Meaning Representations: A Case Study on Reverse Dictionary MappingMohammad Taher Pilehvar. 2151-2156 [doi]
- Factorising AMR generation through syntaxKris Cao, Stephen Clark. 2157-2163 [doi]
- A Crowdsourced Frame Disambiguation Corpus with AmbiguityAnca Dumitrache, Lora Aroyo, Chris Welty. 2164-2170 [doi]
- Inoculation by Fine-Tuning: A Method for Analyzing Challenge DatasetsNelson F. Liu, Roy Schwartz, Noah A. Smith. 2171-2179 [doi]
- A Capsule Network-based Embedding Model for Knowledge Graph Completion and Search PersonalizationDai Quoc Nguyen, Thanh Vu, Tu Dinh Nguyen, Dat Quoc Nguyen, Dinh Q. Phung. 2180-2189 [doi]
- Partial Or Complete, That's The QuestionQiang Ning, Hangfeng He, Chuchu Fan, Dan Roth. 2190-2200 [doi]
- Sequential Attention with Keyword Mask Model for Community-based Question AnsweringJianxin Yang, Wenge Rong, Libin Shi, Zhang Xiong. 2201-2211 [doi]
- Simple Attention-Based Representation Learning for Ranking Short Social Media PostsPeng Shi, Jinfeng Rao, Jimmy Lin. 2212-2217 [doi]
- AttentiveChecker: A Bi-Directional Attention Flow Mechanism for Fact VerificationSantosh Tokala, Vishal G, Avirup Saha, Niloy Ganguly. 2218-2222 [doi]
- Practical, Efficient, and Customizable Active Learning for Named Entity Recognition in the Digital HumanitiesAlexander Erdmann, David Joseph Wrisley, Benjamin Allen, Christopher Brown, Sophie Cohen-Bodénès, Micha Elsner, Yukun Feng, Brian Joseph, Béatrice Joyeux-Prunel, Marie-Catherine de Marneffe. 2223-2234 [doi]
- Doc2hash: Learning Discrete Latent variables for Documents RetrievalYifei Zhang, Hao Zhu. 2235-2240 [doi]
- Evaluating Text GANs as Language ModelsGuy Tevet, Gavriel Habib, Vered Shwartz, Jonathan Berant. 2241-2247 [doi]
- Latent Code and Text-based Generative Adversarial Networks for Soft-text GenerationMd. Akmal Haidar, Mehdi Rezagholizadeh, Alan Do-Omri, Ahmad Rashid. 2248-2258 [doi]
- Neural Text Generation from Rich Semantic RepresentationsValerie Hajdik, Jan Buys, Michael Wayne Goodman, Emily M. Bender. 2259-2266 [doi]
- Step-by-Step: Separating Planning from Realization in Neural Data-to-Text GenerationAmit Moryossef, Yoav Goldberg, Ido Dagan. 2267-2277 [doi]
- Evaluating Rewards for Question Generation ModelsTom Hosking, Sebastian Riedel 0001. 2278-2283 [doi]
- Text Generation from Knowledge Graphs with Graph TransformersRik Koncel-Kedziorski, Dhanush Bekal, Yi Luan, Mirella Lapata, Hannaneh Hajishirzi. 2284-2293 [doi]
- Open Information Extraction from Question-Answer PairsNikita Bhutani, Yoshihiko Suhara, Wang Chiew Tan, Alon Y. Halevy, H. V. Jagadish. 2294-2305 [doi]
- Question Answering by Reasoning Across Documents with Graph Convolutional NetworksNicola De Cao, Wilker Aziz, Ivan Titov. 2306-2317 [doi]
- A Qualitative Comparison of CoQA, SQuAD 2.0 and QuACMark Yatskar. 2318-2323 [doi]
- BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment AnalysisHu Xu, Bing Liu, Lei Shu, Philip S. Yu. 2324-2335 [doi]
- Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short TextAhmad Sakor, Isaiah Onando Mulang, Kuldeep Singh, Saeedeh Shekarpour, Maria-Esther Vidal, Jens Lehmann 0001, Sören Auer. 2336-2346 [doi]
- Be Consistent! Improving Procedural Text Comprehension using Label ConsistencyXinya Du, Bhavana Dalvi, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark, Claire Cardie. 2347-2356 [doi]
- MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based FormalismsAida Amini, Saadia Gabriel, Shanchuan Lin, Rik Koncel-Kedziorski, Yejin Choi, Hannaneh Hajishirzi. 2357-2367 [doi]
- DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over ParagraphsDheeru Dua, Yizhong Wang, Pradeep Dasigi, Gabriel Stanovsky, Sameer Singh, Matt Gardner 0001. 2368-2378 [doi]
- An Encoding Strategy Based Word-Character LSTM for Chinese NERWei Liu, Tongge Xu, QingHua Xu, Jiayu Song, Yueran Zu. 2379-2389 [doi]
- Highly Effective Arabic Diacritization using Sequence to Sequence ModelingHamdy Mubarak, Ahmed Abdelali, Hassan Sajjad, Younes Samih, Kareem Darwish. 2390-2395 [doi]
- SC-LSTM: Learning Task-Specific Representations in Multi-Task Learning for Sequence LabelingPeng Lu, Ting Bai, Philippe Langlais. 2396-2406 [doi]
- Learning to Denoise Distantly-Labeled Data for Entity TypingYasumasa Onoe, Greg Durrett. 2407-2417 [doi]
- A Simple and Robust Approach to Detecting Subject-Verb Agreement ErrorsSimon Flachs, Ophélie Lacroix, Marek Rei, Helen Yannakoudakis, Anders Søgaard. 2418-2427 [doi]
- A Grounded Unsupervised Universal Part-of-Speech Tagger for Low-Resource LanguagesRonald Cardenas, Ying Lin, Heng Ji, Jonathan May. 2428-2439 [doi]
- On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency ParsingWasi Uddin Ahmad, Zhisong Zhang, Xuezhe Ma, Eduard H. Hovy, Kai-Wei Chang, Nanyun Peng. 2440-2452 [doi]
- A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence RepresentationsMingda Chen, Qingming Tang, Sam Wiseman, Kevin Gimpel. 2453-2464 [doi]
- Self-Discriminative Learning for Unsupervised Document EmbeddingHong-You Chen, Chin-Hua Hu, Leila Wehbe, Shou-de Lin. 2465-2474 [doi]
- Adaptive Convolution for Text ClassificationByung-Ju Choi, Jun-Hyung Park, SangKeun Lee 0001. 2475-2485 [doi]
- Zero-Shot Cross-Lingual Opinion Target ExtractionSoufian Jebbara, Philipp Cimiano. 2486-2495 [doi]
- Adversarial Category Alignment Network for Cross-domain Sentiment ClassificationXiaoye Qu, Zhikang Zou, Yu Cheng, Yang Yang 0002, Pan Zhou. 2496-2508 [doi]
- Target-oriented Opinion Words Extraction with Target-fused Neural Sequence LabelingZhifang Fan, Zhen Wu, Xin-Yu Dai, Shujian Huang, Jiajun Chen. 2509-2518 [doi]
- Abstractive Summarization of Reddit Posts with Multi-level Memory NetworksByeongchang Kim, Hyunwoo Kim, Gunhee Kim. 2519-2531 [doi]
- Automatic learner summary assessment for reading comprehensionMenglin Xia, Ekaterina Kochmar, Ted Briscoe. 2532-2542 [doi]
- Data-efficient Neural Text Compression with Interactive LearningAvinesh P. V. S, Christian M. Meyer. 2543-2554 [doi]
- Text Generation with Exemplar-based Adaptive DecodingHao Peng, Ankur P. Parikh, Manaal Faruqui, Bhuwan Dhingra, Dipanjan Das 0001. 2555-2565 [doi]
- Guiding Extractive Summarization with Question-Answering RewardsKristjan Arumae, Fei Liu 0004. 2566-2577 [doi]
- Beyond task success: A closer look at jointly learning to see, ask, and GuessWhatRavi Shekhar, Aashish Venkatesh, Tim Baumgärtner, Elia Bruni, Barbara Plank, Raffaella Bernardi, Raquel Fernández. 2578-2587 [doi]
- The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature EncodingYiqun Yao, Jiaming Xu, Bo Xu. 2588-2598 [doi]
- Strong and Simple Baselines for Multimodal Utterance EmbeddingsPaul Pu Liang, Yao Chong Lim, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov, Louis-Philippe Morency. 2599-2609 [doi]
- Learning to Navigate Unseen Environments: Back Translation with Environmental DropoutHao Tan, Licheng Yu, Mohit Bansal. 2610-2621 [doi]
- Towards Content Transfer through Grounded Text GenerationShrimai Prabhumoye, Chris Quirk, Michel Galley. 2622-2632 [doi]
- Improving Machine Reading Comprehension with General Reading StrategiesKai Sun, Dian Yu, Dong Yu, Claire Cardie. 2633-2643 [doi]
- Multi-task Learning with Sample Re-weighting for Machine Reading ComprehensionYichong Xu, Xiaodong Liu, Yelong Shen, Jingjing Liu, Jianfeng Gao. 2644-2655 [doi]
- Semantically-Aligned Equation Generation for Solving and Reasoning Math Word ProblemsTing-Rui Chiang, Yun-Nung Chen. 2656-2668 [doi]
- Iterative Search for Weakly Supervised Semantic ParsingPradeep Dasigi, Matt Gardner 0001, Shikhar Murty, Luke Zettlemoyer, Eduard H. Hovy. 2669-2680 [doi]
- Alignment over Heterogeneous Embeddings for Question AnsweringVikas Yadav, Steven Bethard, Mihai Surdeanu. 2681-2691 [doi]
- Bridging the Gap: Attending to Discontinuity in Identification of Multiword ExpressionsOmid Rohanian, Shiva Taslimipoor, Samaneh Kouchaki, Le An Ha, Ruslan Mitkov. 2692-2698 [doi]
- Incorporating Word Attention into Character-Based Word SegmentationShohei Higashiyama, Masao Utiyama, Eiichiro Sumita, Masao Ideuchi, Yoshiaki Oida, Yohei Sakamoto, Isaac Okada. 2699-2709 [doi]
- VCWE: Visual Character-Enhanced Word EmbeddingsChi Sun, Xipeng Qiu, Xuanjing Huang. 2710-2719 [doi]
- Subword Encoding in Lattice LSTM for Chinese Word SegmentationJie Yang, Yue Zhang, Shuailong Liang. 2720-2725 [doi]
- Improving Cross-Domain Chinese Word Segmentation with Word EmbeddingsYuxiao Ye, Weikang Li, Yue Zhang, Likun Qiu, Jian Sun. 2726-2735 [doi]
- Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech TaggingApostolos Kemos, Heike Adel, Hinrich Schütze. 2736-2743 [doi]
- Shrinking Japanese Morphological Analyzers With Neural Networks and Semi-supervised LearningArseny Tolmachev, Daisuke Kawahara, Sadao Kurohashi. 2744-2755 [doi]
- Neural Constituency Parsing of Speech TranscriptsParia Jamshid Lou, Yufei Wang, Mark Johnson 0001. 2756-2765 [doi]
- Acoustic-to-Word Models with Conversational Context InformationSuyoun Kim, Florian Metze. 2766-2771 [doi]
- A Dynamic Speaker Model for Conversational InteractionsHao Cheng 0002, Hao Fang 0002, Mari Ostendorf. 2772-2785 [doi]
- Fluent Translations from Disfluent Speech in End-to-End Speech TranslationElizabeth Salesky, Matthias Sperber, Alexander H. Waibel. 2786-2792 [doi]
- Relation Classification Using Segment-Level Attention-based CNN and Dependency-based RNNVan-Hien Tran, Van-Thuy Phi, Hiroyuki Shindo, Yuji Matsumoto 0001. 2793-2798 [doi]
- Document-Level Event Factuality Identification via Adversarial Neural NetworkZhong Qian, Peifeng Li, Qiaoming Zhu, Guodong Zhou. 2799-2809 [doi]
- Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag AttentionsZhi-Xiu Ye, Zhen-Hua Ling. 2810-2819 [doi]
- Ranking-Based Autoencoder for Extreme Multi-label ClassificationBingyu Wang, Li Chen, Wei Sun, Kechen Qin, Kefeng Li, Hui Zhou. 2820-2830 [doi]
- Posterior-regularized REINFORCE for Instance Selection in Distant SupervisionQi Zhang, Siliang Tang, Xiang Ren, Fei Wu, Shiliang Pu, Yueting Zhuang. 2831-2835 [doi]
- Scalable Collapsed Inference for High-Dimensional Topic ModelsRashidul Islam, James R. Foulds. 2836-2845 [doi]
- An Integrated Approach for Keyphrase Generation via Exploring the Power of Retrieval and ExtractionWang Chen, Hou Pong Chan, Piji Li, Lidong Bing, Irwin King. 2846-2856 [doi]
- Predicting Malware Attributes from Cybersecurity TextsArpita Roy, Youngja Park, Shimei Pan. 2857-2861 [doi]
- Improving Distantly-supervised Entity Typing with Compact Latent Space ClusteringBo Chen, Xiaotao Gu, Yufeng Hu, Siliang Tang, Guoping Hu, Yueting Zhuang, Xiang Ren. 2862-2872 [doi]
- Modelling Instance-Level Annotator Reliability for Natural Language Labelling TasksMaolin Li, Arvid Fahlström Myrman, Tingting Mu, Sophia Ananiadou. 2873-2883 [doi]
- Review-Driven Multi-Label Music Style Classification by Exploiting Style CorrelationsGuangxiang Zhao, Jingjing Xu, Qi Zeng, Xuancheng Ren, Xu Sun 0001. 2884-2891 [doi]
- Fact Discovery from Knowledge Base via Facet DecompositionZihao Fu, Yankai Lin, Zhiyuan Liu 0001, Wai Lam. 2892-2901 [doi]
- A Richer-but-Smarter Shortest Dependency Path with Attentive Augmentation for Relation ExtractionDuy-Cat Can, Hoang-Quynh Le, Quang-Thuy Ha, Nigel Collier. 2902-2912 [doi]
- Bidirectional Attentive Memory Networks for Question Answering over Knowledge BasesYu Chen, Lingfei Wu, Mohammed J. Zaki. 2913-2923 [doi]
- BoolQ: Exploring the Surprising Difficulty of Natural Yes/No QuestionsChristopher Clark, Kenton Lee, Ming-Wei Chang, Tom Kwiatkowski, Michael Collins 0001, Kristina Toutanova. 2924-2936 [doi]
- Enhancing Key-Value Memory Neural Networks for Knowledge Based Question AnsweringKun Xu, Yuxuan Lai, Yansong Feng, Zhiguo Wang. 2937-2947 [doi]
- Repurposing Entailment for Multi-Hop Question Answering TasksHarsh Trivedi, Heeyoung Kwon, Tushar Khot, Ashish Sabharwal, Niranjan Balasubramanian. 2948-2958 [doi]
- GenderQuant: Quantifying Mention-Level GenderednessAnanya, Nitya Parthasarthi, Sameer Singh. 2959-2969 [doi]
- Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass ShootingsDorottya Demszky, Nikhil Garg, Rob Voigt, James Zou, Jesse Shapiro, Matthew Gentzkow, Dan Jurafsky. 2970-3005 [doi]
- Learning to Decipher Hate SymbolsJing Qian, Mai ElSherief, Elizabeth M. Belding, William Yang Wang. 3006-3015 [doi]
- Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution NetworksNingyu Zhang, Shumin Deng, Zhanlin Sun, Guanying Wang, Xi Chen, Wei Zhang, Huajun Chen. 3016-3025 [doi]
- GAN Driven Semi-distant Supervision for Relation ExtractionPengshuai Li, Xinsong Zhang, Weijia Jia, Hai Zhao. 3026-3035 [doi]
- A general framework for information extraction using dynamic span graphsYi Luan, Dave Wadden, Luheng He, Amy Shah, Mari Ostendorf, Hannaneh Hajishirzi. 3036-3046 [doi]
- OpenCeres: When Open Information Extraction Meets the Semi-Structured WebColin Lockard, Prashant Shiralkar, Xin Luna Dong. 3047-3056 [doi]
- Structured Minimally Supervised Learning for Neural Relation ExtractionFan Bai, Alan Ritter. 3057-3069 [doi]
- Neural Machine Translation of Text from Non-Native SpeakersAntonios Anastasopoulos, Alison Lui, Toan Q. Nguyen, David Chiang 0001. 3070-3080 [doi]
- Improving Domain Adaptation Translation with Domain Invariant and Specific InformationShuhao Gu, Yang Feng, Qun Liu. 3081-3091 [doi]
- Selective Attention for Context-aware Neural Machine TranslationSameen Maruf, André F. T. Martins, Gholamreza Haffari. 3092-3102 [doi]
- On Evaluation of Adversarial Perturbations for Sequence-to-Sequence ModelsPaul Michel, Xian Li, Graham Neubig, Juan Miguel Pino. 3103-3114 [doi]
- Accelerated Reinforcement Learning for Sentence Generation by Vocabulary PredictionKazuma Hashimoto, Yoshimasa Tsuruoka. 3115-3125 [doi]
- Mitigating Uncertainty in Document ClassificationXuchao Zhang, Fanglan Chen, Chang-Tien Lu, Naren Ramakrishnan. 3126-3136 [doi]
- Complexity-Weighted Loss and Diverse Reranking for Sentence SimplificationReno Kriz, João Sedoc, Marianna Apidianaki, Carolina Zheng, Gaurav Kumar, Eleni Miltsakaki, Chris Callison-Burch. 3137-3147 [doi]
- Predicting Helpful Posts in Open-Ended Discussion Forums: A Neural ArchitectureKishaloy Halder, Min-Yen Kan, Kazunari Sugiyama. 3148-3157 [doi]
- Text Classification with Few Examples using Controlled GeneralizationAbhijit Mahabal, Jason Baldridge, Burcu Karagol-Ayan, Vincent Perot, Dan Roth. 3158-3167 [doi]
- Reinforcement Learning Based Text Style Transfer without Parallel Training CorpusHongyu Gong, Suma Bhat, Lingfei Wu, Jinjun Xiong, Wen-mei W. Hwu. 3168-3180 [doi]
- Adapting RNN Sequence Prediction Model to Multi-label Set PredictionKechen Qin, Cheng Li, Virgil Pavlu, Javed A. Aslam. 3181-3190 [doi]
- Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla LanguageSudipta Saha Shubha, Nafis Sadeq, Shafayat Ahmed, Md. Nahidul Islam, Muhammad Abdullah Adnan, Md. Yasin Ali Khan, Mohammad Zuberul Islam. 3191-3200 [doi]
- Connecting Language and Knowledge with Heterogeneous Representations for Neural Relation ExtractionPeng Xu, Denilson Barbosa. 3201-3206 [doi]
- Segmentation-free compositional n-gram embeddingGeewook Kim, Kazuki Fukui, Hidetoshi Shimodaira. 3207-3215 [doi]
- Exploiting Noisy Data in Distant Supervision Relation ClassificationKaijia Yang, Liang He, Xinyu Dai, Shujian Huang, Jiajun Chen. 3216-3225 [doi]
- Misspelling Oblivious Word EmbeddingsBora Edizel, Aleksandra Piktus, Piotr Bojanowski, Rui Ferreira, Edouard Grave, Fabrizio Silvestri. 3226-3234 [doi]
- Learning Relational Representations by Analogy using Hierarchical Siamese NetworksGaetano Rossiello, Alfio Gliozzo, Robert Farrell, Nicolas R. Fauceglia, Michael Glass. 3235-3245 [doi]
- An Effective Label Noise Model for DNN Text ClassificationIshan Jindal, Daniel Pressel, Brian Lester, Matthew S. Nokleby. 3246-3256 [doi]
- Understanding Learning Dynamics Of Language Models with SVCCANaomi Saphra, Adam Lopez. 3257-3267 [doi]
- Using Large Corpus N-gram Statistics to Improve Recurrent Neural Language ModelsYiben Yang, Ji-Ping Wang, Doug Downey. 3268-3273 [doi]
- Continual Learning for Sentence Representations Using ConceptorsTianlin Liu, Lyle Ungar, João Sedoc. 3274-3279 [doi]
- Relation Discovery with Out-of-Relation Knowledge Base as SupervisionYan Liang, Xin Liu, Jianwen Zhang, Yangqiu Song. 3280-3290 [doi]
- Corpora Generation for Grammatical Error CorrectionJared Lichtarge, Chris Alberti, Shankar Kumar, Noam Shazeer, Niki Parmar, Simon Tong. 3291-3301 [doi]
- Structural Supervision Improves Learning of Non-Local Grammatical DependenciesEthan Wilcox, Peng Qian, Richard Futrell, Miguel Ballesteros, Roger Levy. 3302-3312 [doi]
- Benchmarking Approximate Inference Methods for Neural Structured PredictionLifu Tu, Kevin Gimpel. 3313-3324 [doi]
- Evaluating and Enhancing the Robustness of Dialogue Systems: A Case Study on a Negotiation AgentMinhao Cheng, Wei Wei, Cho-Jui Hsieh. 3325-3335 [doi]
- Investigating Robustness and Interpretability of Link Prediction via Adversarial ModificationsPouya Pezeshkpour, Yifan Tian, Sameer Singh 0001. 3336-3347 [doi]
- Analysis Methods in Neural Language Processing: A SurveyYonatan Belinkov, James Glass. 3348-3354 [doi]
- Transferable Neural Projection RepresentationsChinnadhurai Sankar, Sujith Ravi, Zornitsa Kozareva. 3355-3360 [doi]
- Semantic Role Labeling with Associated Memory NetworkChaoyu Guan, Yuhao Cheng, Hai Zhao. 3361-3371 [doi]
- Better, Faster, Stronger Sequence Tagging Constituent ParsersDavid Vilares, Mostafa Abdou, Anders Søgaard. 3372-3383 [doi]
- CAN-NER: Convolutional Attention Network for Chinese Named Entity RecognitionYuying Zhu, Guoxin Wang. 3384-3393 [doi]
- Decomposed Local Models for Coordinate Structure ParsingHiroki Teranishi, Hiroyuki Shindo, Yuji Matsumoto 0001. 3394-3403 [doi]
- Multi-Task Learning for Japanese Predicate Argument Structure AnalysisHikaru Omori, Mamoru Komachi. 3404-3414 [doi]
- Domain adaptation for part-of-speech tagging of noisy user-generated textLuisa März, Dietrich Trautmann, Benjamin Roth. 3415-3420 [doi]
- Neural Chinese Address ParsingHao Li, Wei Lu, Pengjun Xie, Linlin Li. 3421-3431 [doi]
- Learning Hierarchical Discourse-level Structure for Fake News DetectionHamid Karimi, Jiliang Tang. 3432-3442 [doi]
- DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence FusionMor Geva, Eric Malmi, Idan Szpektor, Jonathan Berant. 3443-3455 [doi]
- Linguistically-Informed Specificity and Semantic Plausibility for Dialogue GenerationWei-Jen Ko, Greg Durrett, Junyi Jessy Li. 3456-3466 [doi]
- Learning to Describe Unknown Phrases with Local and Global ContextsShonosuke Ishiwatari, Hiroaki Hayashi, Naoki Yoshinaga 0001, Graham Neubig, Shoetsu Sato, Masashi Toyoda, Masaru Kitsuregawa. 3467-3476 [doi]
- Mining Discourse Markers for Unsupervised Sentence Representation LearningDamien Sileo, Tim Van de Cruys, Camille Pradel, Philippe Muller. 3477-3486 [doi]
- How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary SelectionWenhu Chen, Yu Su 0001, Yilin Shen, ZhiYu Chen, Xifeng Yan, William Yang Wang. 3487-3497 [doi]
- Subword-based Compact Reconstruction of Word EmbeddingsShota Sasaki, Jun Suzuki, Kentaro Inui. 3498-3508 [doi]
- Bayesian Learning for Neural Dependency ParsingEhsan Shareghi, Yingzhen Li, Yi Zhu, Roi Reichart, Anna Korhonen. 3509-3519 [doi]
- AutoSeM: Automatic Task Selection and Mixing in Multi-Task LearningHan Guo, Ramakanth Pasunuru, Mohit Bansal. 3520-3531 [doi]
- Studying the Inductive Biases of RNNs with Synthetic Variations of Natural LanguagesShauli Ravfogel, Yoav Goldberg, Tal Linzen. 3532-3542 [doi]
- Attention is not ExplanationSarthak Jain, Byron C. Wallace. 3543-3556 [doi]
- Playing Text-Adventure Games with Graph-Based Deep Reinforcement LearningPrithviraj Ammanabrolu, Mark Riedl. 3557-3565 [doi]
- Information Aggregation for Multi-Head Attention with Routing-by-AgreementJian Li, Baosong Yang, Zi-Yi Dou, Xing Wang, Michael R. Lyu, Zhaopeng Tu. 3566-3575 [doi]
- Context Dependent Semantic Parsing over Temporally Structured DataCharles Chen, Razvan C. Bunescu. 3576-3585 [doi]
- Structural Scaffolds for Citation Intent Classification in Scientific PublicationsArman Cohan, Waleed Ammar, Madeleine van Zuylen, Field Cady. 3586-3596 [doi]
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence InferenceMandar Joshi, Eunsol Choi, Omer Levy, Daniel S. Weld, Luke Zettlemoyer. 3597-3608 [doi]
- Submodular Optimization-based Diverse Paraphrasing and its Effectiveness in Data AugmentationAshutosh Kumar, Satwik Bhattamishra, Manik Bhandari, Partha Talukdar. 3609-3619 [doi]
- Let's Make Your Request More Persuasive: Modeling Persuasive Strategies via Semi-Supervised Neural Nets on Crowdfunding PlatformsDiyi Yang, Jiaao Chen, Zichao Yang, Dan Jurafsky, Eduard H. Hovy. 3620-3630 [doi]
- Recursive Routing Networks: Learning to Compose Modules for Language UnderstandingIgnacio Cases, Clemens Rosenbaum, Matthew Riemer, Atticus Geiger, Tim Klinger, Alex Tamkin, Olivia Li, Sandhini Agarwal, Joshua D. Greene, Dan Jurafsky, Christopher Potts, Lauri Karttunen. 3631-3648 [doi]
- Structural Neural Encoders for AMR-to-text GenerationMarco Damonte, Shay B. Cohen. 3649-3658 [doi]
- Multilingual prediction of Alzheimer's disease through domain adaptation and concept-based language modellingKathleen C. Fraser, Nicklas Linz, Bai Li, Kristina Lundholm Fors, Frank Rudzicz, Alexandra König, Jan Alexandersson, Philippe H. Robert, Dimitrios Kokkinakis. 3659-3670 [doi]
- Ranking and Selecting Multi-Hop Knowledge Paths to Better Predict Human NeedsDebjit Paul, Anette Frank. 3671-3681 [doi]
- NLP Whack-A-Mole: Challenges in Cross-Domain Temporal Expression ExtractionAmy L. Olex, Luke Maffey, Bridget T. McInnes. 3682-3692 [doi]
- Document-Level N-ary Relation Extraction with Multiscale Representation LearningRobin Jia, Cliff Wong, Hoifung Poon. 3693-3704 [doi]
- Inferring Which Medical Treatments Work from Reports of Clinical TrialsEric Lehman, Jay DeYoung, Regina Barzilay, Byron C. Wallace. 3705-3717 [doi]
- Decay-Function-Free Time-Aware Attention to Context and Speaker Indicator for Spoken Language UnderstandingJonggu Kim, Jong-Hyeok Lee. 3718-3726 [doi]
- Dialogue Act Classification with Context-Aware Self-AttentionVipul Raheja, Joel R. Tetreault. 3727-3733 [doi]
- Affect-Driven Dialog GenerationPierre Colombo, Wojciech Witon, Ashutosh Modi, James Kennedy, Mubbasir Kapadia. 3734-3743 [doi]
- Multi-Level Memory for Task Oriented DialogsRevanth Reddy, Danish Contractor, Dinesh Raghu, Sachindra Joshi. 3744-3754 [doi]
- Topic Spotting using Hierarchical Networks with Self AttentionPooja Chitkara, Ashutosh Modi, Pravalika Avvaru, Sepehr Janghorbani, Mubbasir Kapadia. 3755-3761 [doi]
- Top-Down Structurally-Constrained Neural Response Generation with Lexicalized Probabilistic Context-Free GrammarWenchao Du, Alan W. Black. 3762-3771 [doi]
- What do Entity-Centric Models Learn? Insights from Entity Linking in Multi-Party DialogueLaura Aina, Carina Silberer, Ionut-Teodor Sorodoc, Matthijs Westera, Gemma Boleda. 3772-3783 [doi]
- Continuous Learning for Large-scale Personalized Domain ClassificationHan Li, Jihwan Lee, Sidharth Mudgal, Ruhi Sarikaya, Young-Bum Kim. 3784-3794 [doi]
- Cross-lingual Transfer Learning for Multilingual Task Oriented DialogSebastian Schuster, Sonal Gupta, Rushin Shah, Mike Lewis. 3795-3805 [doi]
- Evaluating Coherence in Dialogue Systems using EntailmentNouha Dziri, Ehsan Kamalloo, Kory Wallace Mathewson, Osmar R. Zaïane. 3806-3812 [doi]
- On Knowledge distillation from complex networks for response predictionSiddhartha Arora, Mitesh M. Khapra, Harish G. Ramaswamy. 3813-3822 [doi]
- Cross-lingual Multi-Level Adversarial Transfer to Enhance Low-Resource Name TaggingLifu Huang, Heng Ji, Jonathan May. 3823-3833 [doi]
- Unsupervised Extraction of Partial Translations for Neural Machine TranslationBenjamin Marie, Atsushi Fujita. 3834-3844 [doi]
- Low-Resource Syntactic Transfer with Unsupervised Source ReorderingMohammad Sadegh Rasooli, Michael Collins 0001. 3845-3856 [doi]
- Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved TrainingTasnim Mohiuddin, Shafiq R. Joty. 3857-3867 [doi]
- Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource LanguagesV. Rudra Murthy, Anoop Kunchukuttan, Pushpak Bhattacharyya. 3868-3873 [doi]
- Massively Multilingual Neural Machine TranslationRoee Aharoni, Melvin Johnson, Orhan Firat. 3874-3884 [doi]
- A Large-Scale Comparison of Historical Text Normalization SystemsMarcel Bollmann. 3885-3898 [doi]
- Combining Discourse Markers and Cross-lingual Embeddings for Synonym-Antonym ClassificationMichael Roth, Shyam Upadhyay. 3899-3905 [doi]
- Context-Aware Cross-Lingual MappingHanan Aldarmaki, Mona T. Diab. 3906-3911 [doi]
- Polyglot Contextual Representations Improve Crosslingual TransferPhoebe Mulcaire, Jungo Kasai, Noah A. Smith. 3912-3918 [doi]
- Typological Features for Multilingual Delexicalised Dependency ParsingManon Scholivet, Franck Dary, Alexis Nasr, Benoît Favre, Carlos Ramisch. 3919-3930 [doi]
- Recommendations for Datasets for Source Code SummarizationAlexander LeClair, Collin McMillan. 3931-3937 [doi]
- Question Answering as an Automatic Evaluation Metric for News Article SummarizationMatan Eyal, Tal Baumel, Michael Elhadad. 3938-3948 [doi]
- Understanding the Behaviour of Neural Abstractive Summarizers using Contrastive ExamplesKrtin Kumar, Jackie Chi Kit Cheung. 3949-3954 [doi]
- Jointly Extracting and Compressing Documents with Summary State RepresentationsAfonso Mendes, Shashi Narayan, Sebastião Miranda, Zita Marinho, André F. T. Martins, Shay B. Cohen. 3955-3966 [doi]
- News Article Teaser Tweets and How to Generate ThemSanjeev Kumar Karn, Mark Buckley, Ulli Waltinger, Hinrich Schütze. 3967-3977 [doi]
- Cross-referencing Using Fine-grained Topic ModelingJeffrey Lund, Piper Armstrong, Wilson Fearn, Stephen Cowley, Emily Hales, Kevin D. Seppi. 3978-3987 [doi]
- Conversation Initiation by Diverse News Contents IntroductionSatoshi Akasaki, Nobuhiro Kaji. 3988-3998 [doi]
- Positional Encoding to Control Output Sequence LengthSho Takase, Naoaki Okazaki. 3999-4004 [doi]
- The Lower The Simpler: Simplifying Hierarchical Recurrent ModelsChao Wang, Hui Jiang. 4005-4009 [doi]
- Using Natural Language Relations between Answer Choices for Machine ComprehensionRajkumar Pujari, Dan Goldwasser. 4010-4015 [doi]
- Saliency Learning: Teaching the Model Where to Pay AttentionReza Ghaeini, Xiaoli Z. Fern, Hamed Shahbazi, Prasad Tadepalli. 4016-4025 [doi]
- Understanding Dataset Design Choices for Multi-hop ReasoningJifan Chen, Greg Durrett. 4026-4032 [doi]
- Neural Grammatical Error Correction with Finite State TransducersFelix Stahlberg, Christopher Bryant, Bill Byrne. 4033-4039 [doi]
- Convolutional Self-Attention NetworksBaosong Yang, Longyue Wang, Derek F. Wong, Lidia S. Chao, Zhaopeng Tu. 4040-4045 [doi]
- Rethinking Complex Neural Network Architectures for Document ClassificationAshutosh Adhikari, Achyudh Ram, Raphael Tang, Jimmy Lin. 4046-4051 [doi]
- Pre-trained language model representations for language generationSergey Edunov, Alexei Baevski, Michael Auli. 4052-4059 [doi]
- Pragmatically Informative Text GenerationSheng Shen, Daniel Fried, Jacob Andreas, Dan Klein. 4060-4067 [doi]
- Stochastic Wasserstein Autoencoder for Probabilistic Sentence GenerationHareesh Bahuleyan, Lili Mou, Hao Zhou, Olga Vechtomova. 4068-4076 [doi]
- Benchmarking Hierarchical Script KnowledgeYonatan Bisk, Jan Buys, Karl Pichotta, Yejin Choi. 4077-4085 [doi]
- A large-scale study of the effects of word frequency and predictability in naturalistic readingCory Shain. 4086-4094 [doi]
- Augmenting word2vec with latent Dirichlet allocation within a clinical applicationAkshay Budhkar, Frank Rudzicz. 4095-4099 [doi]
- On the Idiosyncrasies of the Mandarin Chinese Classifier SystemShijia Liu, Hongyuan Mei, Adina Williams, Ryan Cotterell. 4100-4106 [doi]
- Joint Learning of Pre-Trained and Random Units for Domain Adaptation in Part-of-Speech TaggingSara Meftah, Youssef Tamaazousti, Nasredine Semmar, Hassane Essafi, Fatiha Sadat. 4107-4112 [doi]
- Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling BaselinesEhsan Shareghi, Daniela Gerz, Ivan Vulic, Anna Korhonen. 4113-4118 [doi]
- Training Data Augmentation for Context-Sensitive Neural Lemmatizer Using Inflection Tables and Raw TextToms Bergmanis, Sharon Goldwater. 4119-4128 [doi]
- A Structural Probe for Finding Syntax in Word RepresentationsJohn Hewitt, Christopher D. Manning. 4129-4138 [doi]
- CNM: An Interpretable Complex-valued Network for MatchingQiuchi Li, Benyou Wang, Massimo Melucci. 4139-4148 [doi]
- CommonsenseQA: A Question Answering Challenge Targeting Commonsense KnowledgeAlon Talmor, Jonathan Herzig, Nicholas Lourie, Jonathan Berant. 4149-4158 [doi]
- Probing the Need for Visual Context in Multimodal Machine TranslationOzan Caglayan, Pranava Madhyastha, Lucia Specia, Loïc Barrault. 4159-4170 [doi]
- BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingJacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova. 4171-4186 [doi]
- What's in a Name? Reducing Bias in Bios without Access to Protected AttributesAlexey Romanov, Maria De-Arteaga, Hanna M. Wallach, Jennifer T. Chayes, Christian Borgs, Alexandra Chouldechova, Sahin Cem Geyik, Krishnaram Kenthapadi, Anna Rumshisky, Adam Kalai. 4187-4195 [doi]