Abstract is missing.
- Frontmatter [doi]
- Fully Quantized Transformer for Machine TranslationGabriele Prato, Ella Charlaix, Mehdi Rezagholizadeh. 1-14 [doi]
- Summarizing Chinese Medical Answer with Graph Convolution Networks and Question-focused Dual AttentionNingyu Zhang, Shumin Deng, Juan Li, Xi Chen, Wei Zhang, Huajun Chen. 15-24 [doi]
- Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking ConversationsPeng Qi 0003, Yuhao Zhang, Christopher D. Manning. 25-40 [doi]
- Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example SentencesBoon Peng Yap, Andrew Koh, Eng Siong Chng. 41-46 [doi]
- Adversarial Text Generation via Sequence Contrast DiscriminationKe Wang, Xiaojun Wan 0001. 47-53 [doi]
- GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment AnalysisHuaishao Luo, Lei Ji, Tianrui Li, Daxin Jiang, Nan Duan. 54-64 [doi]
- Reducing Sentiment Bias in Language Models via Counterfactual EvaluationPo-Sen Huang, Huan Zhang, Ray Jiang, Robert Stanforth, Johannes Welbl, Jack Rae, Vishal Maini, Dani Yogatama, Pushmeet Kohli. 65-83 [doi]
- Improving Text Understanding via Deep Syntax-Semantics CommunicationHao Fei 0001, Yafeng Ren, Donghong Ji. 84-93 [doi]
- GRUEN for Evaluating Linguistic Quality of Generated TextWanzheng Zhu, Suma Bhat. 94-108 [doi]
- A Greedy Bit-flip Training Algorithm for Binarized Knowledge Graph EmbeddingsKatsuhiko Hayashi, Koki Kishimoto, Masashi Shimbo. 109-114 [doi]
- Difference-aware Knowledge Selection for Knowledge-grounded Conversation GenerationChujie Zheng, Yunbo Cao, Daxin Jiang, Minlie Huang. 115-125 [doi]
- An Attentive Recurrent Model for Incremental Prediction of Sentence-final VerbsWenyan Li, Alvin Grissom II, Jordan L. Boyd-Graber. 126-136 [doi]
- Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random FieldsJingxuan Yang, Kerui Xu, Jun Xu, Si Li, Sheng Gao, Jun Guo 0002, Ji-Rong Wen, Nianwen Xue. 137-147 [doi]
- Neural Speed Reading AuditedAnders Søgaard. 148-153 [doi]
- Converting the Point of View of Message Spoken to Virtual AssistantsGunhee Lee, Vera Zu, Sai Srujana Buddi, Dennis Liang, Purva Kulkarni, Jack G. M. Fitzgerald. 154-163 [doi]
- Robustness to Modification with Shared Words in Paraphrase IdentificationZhouxing Shi, Minlie Huang. 164-171 [doi]
- Few-shot Natural Language Generation for Task-Oriented DialogBaolin Peng, Chenguang Zhu, Chunyuan Li, Xiujun Li, Jinchao Li, Michael Zeng, Jianfeng Gao. 172-182 [doi]
- Mimic and Conquer: Heterogeneous Tree Structure Distillation for Syntactic NLPHao Fei 0001, Yafeng Ren, Donghong Ji. 183-193 [doi]
- A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain PretrainingChenguang Zhu, Ruochen Xu, Michael Zeng, Xuedong Huang 0001. 194-203 [doi]
- Active Testing: An Unbiased Evaluation Method for Distantly Supervised Relation ExtractionPengshuai Li, Xinsong Zhang, Weijia Jia, Wei Zhao. 204-211 [doi]
- Semantic Matching via Optimal Partial TransportRuiyi Zhang, Changyou Chen, Xinyuan Zhang 0001, Ke Bai, Lawrence Carin. 212-222 [doi]
- How Decoding Strategies Affect the Verifiability of Generated TextLuca Massarelli, Fabio Petroni, Aleksandra Piktus, Myle Ott, Tim Rocktäschel, Vassilis Plachouras, Fabrizio Silvestri, Sebastian Riedel 0001. 223-235 [doi]
- Minimize Exposure Bias of Seq2Seq Models in Joint Entity and Relation ExtractionRanran Haoran Zhang, Qianying Liu, Aysa Xuemo Fan, Heng Ji, Daojian Zeng, Fei Cheng, Daisuke Kawahara, Sadao Kurohashi. 236-246 [doi]
- Gradient-based Analysis of NLP Models is ManipulableJunlin Wang, Jens Tuyls, Eric Wallace, Sameer Singh 0001. 247-258 [doi]
- Pretrain-KGE: Learning Knowledge Representation from Pretrained Language ModelsZhiyuan Zhang, Xiaoqian Liu, Yi Zhang, Qi Su 0001, Xu Sun 0001, Bin He. 259-266 [doi]
- A Self-Refinement Strategy for Noise Reduction in Grammatical Error CorrectionMasato Mita, Shun Kiyono, Masahiro Kaneko, Jun Suzuki, Kentaro Inui. 267-280 [doi]
- Understanding tables with intermediate pre-trainingJulian Martin Eisenschlos, Syrine Krichene, Thomas Müller. 281-296 [doi]
- Enhance Robustness of Sequence Labelling with Masked Adversarial TrainingLuoxin Chen, Xinyue Liu, Weitong Ruan, Jianhua Lu. 297-302 [doi]
- Multilingual Argument Mining: Datasets and AnalysisOrith Toledo-Ronen, Matan Orbach, Yonatan Bilu, Artem Spector, Noam Slonim. 303-317 [doi]
- Improving Grammatical Error Correction with Machine Translation PairsWangchunshu Zhou, Tao Ge, Chang Mu, Ke Xu 0001, Furu Wei, Ming Zhou 0001. 318-328 [doi]
- Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical DirectivesWon-Ik Cho, Young Ki Moon, Sangwhan Moon, Seok Min Kim, Nam Soo Kim. 329-339 [doi]
- The RELX Dataset and Matching the Multilingual Blanks for Cross-lingual Relation ClassificationAbdullatif Köksal, Arzucan Özgür. 340-350 [doi]
- Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text GenerationGiuseppe Russo, Nora Hollenstein, Claudiu Cristian Musat, Ce Zhang. 351-366 [doi]
- Open-Ended Visual Question Answering by Multi-Modal Domain AdaptationYiming Xu, Lin Chen, Zhongwei Cheng, Lixin Duan, Jiebo Luo. 367-376 [doi]
- Dual Low-Rank Multimodal FusionTao Jin, Siyu Huang, Yingming Li, Zhongfei Zhang. 377-387 [doi]
- Contextual Modulation for Relation-Level Metaphor IdentificationOmnia Zayed, John P. McCrae, Paul Buitelaar. 388-406 [doi]
- Context-aware Stand-alone Neural Spelling CorrectionXiangci Li, Hairong Liu, Liang Huang 0001. 407-414 [doi]
- A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument LabelsYouxuan Jiang, Huaiyu Zhu 0001, Jonathan K. Kummerfeld, Yunyao Li 0001, Walter S. Lasecki. 415-421 [doi]
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language UnderstandingJiyeon Ham, Yo Joong Choe, Kyubyong Park, Ilji Choi, Hyungjoon Soh. 422-430 [doi]
- Dialogue Generation on Infrequent Sentence Functions via Structured Meta-LearningYifan Gao 0001, Piji Li, Wei Bi, Xiaojiang Liu, Michael R. Lyu, Irwin King. 431-440 [doi]
- Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer LearningZhaojiang Lin, Andrea Madotto, Pascale Fung. 441-459 [doi]
- A Fully Hyperbolic Neural Model for Hierarchical Multi-class ClassificationFederico López, Michael Strube 0001. 460-475 [doi]
- Claim Check-Worthiness Detection as Positive Unlabelled LearningDustin Wright, Isabelle Augenstein. 476-488 [doi]
- ConceptBert: Concept-Aware Representation for Visual Question AnsweringFrançois Gardères, Maryam Ziaeefard, Baptiste Abeloos, Freddy Lécué. 489-498 [doi]
- Bootstrapping a Crosslingual Semantic ParserTom Sherborne, Yumo Xu, Mirella Lapata. 499-517 [doi]
- Revisiting Representation Degeneration Problem in Language ModelingZhong Zhang 0004, Chongming Gao, Cong Xu, Rui Miao, Qinli Yang, Junming Shao. 518-527 [doi]
- The workweek is the best time to start a family - A Study of GPT-2 Based Claim GenerationShai Gretz, Yonatan Bilu, Edo Cohen-Karlik, Noam Slonim. 528-544 [doi]
- Dynamic Data Selection for Curriculum Learning via Ability EstimationJohn P. Lalor, Hong Yu 0001. 545-555 [doi]
- Fixed Encoder Self-Attention Patterns in Transformer-Based Machine TranslationAlessandro Raganato, Yves Scherrer, Jörg Tiedemann. 556-568 [doi]
- ZEST: Zero-shot Learning from Text Descriptions using Textual Similarity and Visual SummarizationTzuf Paz-Argaman, Reut Tsarfaty, Gal Chechik, Yuval Atzmon. 569-579 [doi]
- Few-Shot Multi-Hop Relation Reasoning over Knowledge BasesChuxu Zhang, Lu Yu 0006, Mandana Saebi, Meng Jiang 0001, Nitesh V. Chawla. 580-585 [doi]
- Sentiment Analysis with Weighted Graph Convolutional NetworksFanyu Meng, Junlan Feng, Danping Yin, Si Chen, Min Hu. 586-595 [doi]
- PBoS: Probabilistic Bag-of-Subwords for Generalizing Word EmbeddingJinman Zhao, Shawn Zhong, Xiaomin Zhang, Yingyu Liang. 596-611 [doi]
- Interpretable Entity Representations through Large-Scale TypingYasumasa Onoe, Greg Durrett. 612-624 [doi]
- Empirical Studies of Institutional Federated Learning For Natural Language ProcessingXinghua Zhu, Jianzong Wang, Zhenhou Hong, Jing Xiao. 625-634 [doi]
- NeuReduce: Reducing Mixed Boolean-Arithmetic Expressions by Recurrent Neural NetworkWeijie Feng, Binbin Liu, Dongpeng Xu, Qilong Zheng, Yun Xu. 635-644 [doi]
- From Language to Language-ish: How Brain-Like is an LSTM's Representation of Atypical Language Stimuli?Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin, Alona Fyshe. 645-656 [doi]
- Revisiting Pre-Trained Models for Chinese Natural Language ProcessingYiming Cui, Wanxiang Che, Ting Liu 0001, Bing Qin 0001, Shijin Wang 0001, Guoping Hu. 657-668 [doi]
- Cascaded Semantic and Positional Self-Attention Network for Document ClassificationJuyong Jiang, Jie Zhang, Kai Zhang. 669-677 [doi]
- Toward Recognizing More Entity Types in NER: An Efficient Implementation using Only Entity LexiconsMinlong Peng, Ruotian Ma, Qi Zhang 0001, Lujun Zhao, Mengxi Wei, Changlong Sun, Xuanjing Huang. 678-688 [doi]
- From Disjoint Sets to Parallel Data to Train Seq2Seq Models for Sentiment TransferPaulo R. Cavalin, Marisa Vasconcelos, Marcelo Grave, Claudio S. Pinhanez, Victor Henrique Alves Ribeiro. 689-698 [doi]
- Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language NavigationJiannan Xiang, Xin Wang 0061, William Yang Wang. 699-707 [doi]
- Document Ranking with a Pretrained Sequence-to-Sequence ModelRodrigo Nogueira, Zhiying Jiang, Ronak Pradeep, Jimmy Lin. 708-718 [doi]
- Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity PriorZi Lin, Jeremiah Z. Liu, Zi Yang, Nan Hua, Dan Roth. 719-730 [doi]
- Rethinking Self-Attention: Towards Interpretability in Neural ParsingKhalil Mrini, Franck Dernoncourt, Quan Hung Tran, Trung Bui, Walter Chang, Ndapa Nakashole. 731-742 [doi]
- PolicyQA: A Reading Comprehension Dataset for Privacy PoliciesWasi Uddin Ahmad, Jianfeng Chi, Yuan Tian 0001, Kai-Wei Chang. 743-749 [doi]
- A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial ExpressionsTakuma Udagawa, Takato Yamazaki, Akiko Aizawa. 750-765 [doi]
- Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State TrackingSu Zhu, Jieyu Li, Lu Chen 0002, Kai Yu 0004. 766-781 [doi]
- Syntactic and Semantic-driven Learning for Open Information ExtractionJialong Tang, Yaojie Lu 0001, Hongyu Lin, Xianpei Han, Le Sun 0001, Xinyan Xiao, Hua Wu 0003. 782-792 [doi]
- Group-wise Contrastive Learning for Neural Dialogue GenerationHengyi Cai, Hongshen Chen, Yonghao Song, Zhuoye Ding, Yongjun Bao, Weipeng Yan, Xiaofang Zhao. 793-802 [doi]
- E-BERT: Efficient-Yet-Effective Entity Embeddings for BERTNina Pörner, Ulli Waltinger, Hinrich Schütze. 803-818 [doi]
- A Multi-task Learning Framework for Opinion Triplet ExtractionChen Zhang, Qiuchi Li, Dawei Song 0001, Benyou Wang. 819-828 [doi]
- Event Extraction as Multi-turn Question AnsweringFayuan Li, Weihua Peng, Yuguang Chen, Quan Wang, Lu Pan, Yajuan Lyu, Yong Zhu. 829-838 [doi]
- Improving QA Generalization by Concurrent Modeling of Multiple BiasesMingzhu Wu, Nafise Sadat Moosavi, Andreas Rücklé, Iryna Gurevych. 839-853 [doi]
- Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue SystemsYen-Chen Wu, Bo-Hsiang Tseng, Milica Gasic. 854-863 [doi]
- Controlled Hallucinations: Learning to Generate Faithfully from Noisy DataKatja Filippova. 864-870 [doi]
- Sequential Span Classification with Neural Semi-Markov CRFs for Biomedical AbstractsKosuke Yamada, Tsutomu Hirao, Ryohei Sasano, Koichi Takeda, Masaaki Nagata. 871-877 [doi]
- Where to Submit? Helping Researchers to Choose the Right VenueKonstantin Kobs, Tobias Koopmann, Albin Zehe, David Fernes, Philipp Krop, Andreas Hotho. 878-883 [doi]
- AirConcierge: Generating Task-Oriented Dialogue via Efficient Large-Scale Knowledge RetrievalChieh-Yang Chen, Pei-Hsin Wang, Shih-Chieh Chang, Da-Cheng Juan, Wei Wei 0025, Jia-Yu Pan. 884-897 [doi]
- DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form UnderstandingZilong Wang, Mingjie Zhan, Xuebo Liu 0001, Ding Liang. 898-908 [doi]
- Pretrained Language Models for Dialogue Generation with Multiple Input SourcesYu Cao, Wei Bi, Meng Fang, Dacheng Tao. 909-917 [doi]
- A Study in Improving BLEU Reference Coverage with Diverse Automatic ParaphrasingRachel Bawden, Biao Zhang, Lisa Yankovskaya, Andre Tättar, Matt Post. 918-932 [doi]
- Cross-lingual Alignment Methods for Multilingual BERT: A Comparative StudySaurabh Kulshreshtha, José Luis Redondo García, Ching-Yun Chang. 933-942 [doi]
- Hybrid Emoji-Based Masked Language Models for Zero-Shot Abusive Language DetectionMichele Corazza, Stefano Menini, Elena Cabrio, Sara Tonelli, Serena Villata. 943-949 [doi]
- SeNsER: Learning Cross-Building Sensor Metadata TaggerYang Jiao, Jiacheng Li, Jiaman Wu, Dezhi Hong, Rajesh E. Gupta, Jingbo Shang. 950-960 [doi]
- Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech TaggingEhsan Doostmohammadi, Minoo Nassajian, Adel Rahimi. 961-971 [doi]
- Scene Graph Modification Based on Natural Language CommandsXuanli He, Quan Hung Tran, Gholamreza Haffari, Walter Chang, Zhe Lin, Trung Bui, Franck Dernoncourt, Nhan Dam. 972-990 [doi]
- LiMiT: The Literal Motion in Text DatasetIrene Manotas, Ngoc Phuoc An Vo, Vadim Sheinin. 991-1000 [doi]
- Transition-based Parsing with Stack-TransformersRamón Fernandez Astudillo, Miguel Ballesteros, Tahira Naseem, Austin Blodgett, Radu Florian. 1001-1007 [doi]
- G-DAug: Generative Data Augmentation for Commonsense ReasoningYiben Yang, Chaitanya Malaviya, Jared Fernandez, Swabha Swayamdipta, Ronan Le Bras, Ji-Ping Wang, Chandra Bhagavatula, Yejin Choi, Doug Downey. 1008-1025 [doi]
- HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual DataWenhu Chen, Hanwen Zha, ZhiYu Chen, Wenhan Xiong, Hong Wang, William Yang Wang. 1026-1036 [doi]
- PhoBERT: Pre-trained language models for VietnameseDat Quoc Nguyen, Anh Tuan Nguyen. 1037-1042 [doi]
- ESTeR: Combining Word Co-occurrences and Word Associations for Unsupervised Emotion DetectionSujatha Das Gollapalli, Polina Rozenshtein, See-Kiong Ng. 1043-1056 [doi]
- Make Templates Smarter: A Template Based Data2Text System Powered by Text Stitch ModelBingfeng Luo, Zuo Bai, Kunfeng Lai, Jianping Shen. 1057-1062 [doi]
- GCDST: A Graph-based and Copy-augmented Multi-domain Dialogue State TrackingPeng Wu, Bowei Zou, Ridong Jiang, AiTi Aw. 1063-1073 [doi]
- Incorporating Stylistic Lexical Preferences in Generative Language ModelsHrituraj Singh, Gaurav Verma, Balaji Vasan Srinivasan. 1074-1079 [doi]
- Why do you think that? Exploring faithful sentence-level rationales without supervisionMax Glockner, Ivan Habernal, Iryna Gurevych. 1080-1095 [doi]
- Semi-Supervised Learning for Video CaptioningKe Lin, Zhuoxin Gan, Liwei Wang. 1096-1106 [doi]
- 2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERTYoungbin Ro, Yukyung Lee, Pilsung Kang 0001. 1107-1117 [doi]
- LGPSolver - Solving Logic Grid Puzzles AutomaticallyElgun Jabrayilzade, Selma Tekir. 1118-1123 [doi]
- Using the Past Knowledge to Improve Sentiment ClassificationQi Qin, Wenpeng Hu, Bing Liu. 1124-1133 [doi]
- High-order Semantic Role LabelingZuchao Li, Hai Zhao, Rui Wang, Kevin Parnow. 1134-1151 [doi]
- Undersensitivity in Neural Reading ComprehensionJohannes Welbl, Pasquale Minervini, Max Bartolo, Pontus Stenetorp, Sebastian Riedel 0001. 1152-1165 [doi]
- HyperText: Endowing FastText with Hyperbolic GeometryYudong Zhu, Di Zhou, JingHui Xiao, Xin Jiang, Xiao Chen, Qun Liu. 1166-1171 [doi]
- AutoETER: Automated Entity Type Representation with Relation-Aware Attention for Knowledge Graph EmbeddingGuanglin Niu, Bo Li, Yongfei Zhang, Shiliang Pu, Jingyang Li. 1172-1181 [doi]
- Learning Robust and Multilingual Speech RepresentationsKazuya Kawakami, Luyu Wang, Chris Dyer, Phil Blunsom, Aäron Van Den Oord. 1182-1192 [doi]
- FQuAD: French Question Answering DatasetMartin d'Hoffschmidt, Wacim Belblidia, Quentin Heinrich, Tom Brendlé, Maxime Vidal. 1193-1208 [doi]
- Semantic Matching and Aggregation Network for Few-shot Intent DetectionHoang Nguyen, Chenwei Zhang, Congying Xia, Philip S. Yu. 1209-1218 [doi]
- Quantifying the Contextualization of Word Representations with Semantic Class ProbingMengjie Zhao, Philipp Dufter, Yadollah Yaghoobzadeh, Hinrich Schütze. 1219-1234 [doi]
- Learning to Generate Clinically Coherent Chest X-Ray ReportsJustin R. Lovelace, Bobak Mortazavi. 1235-1243 [doi]
- FELIX: Flexible Text Editing Through Tagging and InsertionJonathan Mallinson, Aliaksei Severyn, Eric Malmi, Guillermo Garrido. 1244-1255 [doi]
- What Can We Do to Improve Peer Review in NLP?Anna Rogers, Isabelle Augenstein. 1256-1262 [doi]
- Unsupervised Relation Extraction from Language Models using Constrained Cloze CompletionAnkur Goswami, Akshata Bhat, Hadar Ohana, Theodoros Rekatsinas. 1263-1276 [doi]
- Biomedical Event Extraction on Graph Edge-conditioned Attention Networks with Hierarchical Knowledge GraphsKung-Hsiang Huang, Mu Yang, Nanyun Peng. 1277-1285 [doi]
- Constraint Satisfaction Driven Natural Language Generation: A Tree Search Embedded MCMC ApproachMaosen Zhang, Nan Jiang, Lei Li, Yexiang Xue. 1286-1298 [doi]
- Examining the Ordering of Rhetorical Strategies in Persuasive RequestsOmar Shaikh, Jiaao Chen, Jon Saad-Falcon, Polo Chau, Diyi Yang. 1299-1306 [doi]
- Evaluating Models' Local Decision Boundaries via Contrast SetsMatt Gardner 0001, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hannaneh Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh 0001, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang, Ben Zhou. 1307-1323 [doi]
- Parsing with Multilingual BERT, a Small Treebank, and a Small CorpusEthan C. Chau, Lucy H. Lin, Noah A. Smith. 1324-1334 [doi]
- OptSLA: an Optimization-Based Approach for Sequential Label AggregationNasim Sabetpour, Adithya Kulkarni, Qi Li. 1335-1340 [doi]
- Optimizing Word Segmentation for Downstream TaskTatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki. 1341-1351 [doi]
- Dynamically Updating Event Representations for Temporal Relation Classification with Multi-category LearningFei Cheng, Masayuki Asahara, Ichiro Kobayashi, Sadao Kurohashi. 1352-1357 [doi]
- A Compare Aggregate Transformer for Understanding Document-grounded DialogueLongxuan Ma, Wei-Nan Zhang 0003, Runxin Sun, Ting Liu 0001. 1358-1367 [doi]
- TextHide: Tackling Data Privacy for Language Understanding TasksYangsibo Huang, Zhao Song, Danqi Chen, Kai Li, Sanjeev Arora. 1368-1382 [doi]
- Modeling Intra and Inter-modality Incongruity for Multi-Modal Sarcasm DetectionHongliang Pan, Zheng Lin, Peng Fu 0008, Yatao Qi, Weiping Wang 0005. 1383-1392 [doi]
- Investigating Transferability in Pretrained Language ModelsAlex Tamkin, Trisha Singh, Davide Giovanardi, Noah D. Goodman. 1393-1401 [doi]
- Improving Knowledge-Aware Dialogue Response Generation by Using Human-Written Prototype DialoguesSixing Wu, Ying Li 0015, Dawei Zhang 0003, Zhonghai Wu. 1402-1411 [doi]
- Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based ChatbotsJia-Chen Gu, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Xiaodan Zhu. 1412-1422 [doi]
- Privacy-Preserving News Recommendation Model LearningTao Qi, Fangzhao Wu, Chuhan Wu, Yongfeng Huang, Xing Xie 0001. 1423-1432 [doi]
- exBERT: Extending Pre-trained Models with Domain-specific Vocabulary Under Constrained Training ResourcesWen Tai, H. T. Kung 0001, Xin Dong 0009, Marcus Z. Comiter, Chang-Fu Kuo. 1433-1439 [doi]
- Balancing via Generation for Multi-Class Text Classification ImprovementNaama Tepper, Esther Goldbraich, Naama Zwerdling, George Kour, Ateret Anaby-Tavor, Boaz Carmeli. 1440-1452 [doi]
- Conditional Neural Generation using Sub-Aspect Functions for Extractive News SummarizationZhengyuan Liu, Ke Shi, Nancy F. Chen. 1453-1463 [doi]
- Research Replication Prediction Using Weakly Supervised LearningTianyi Luo, Xingyu Li, Hainan Wang, Yang Liu. 1464-1474 [doi]
- Open Domain Question Answering based on Text Enhanced Knowledge Graph with Hyperedge InfusionJiale Han, Bo Cheng 0001, Xu Wang. 1475-1481 [doi]
- Inexpensive Domain Adaptation of Pretrained Language Models: Case Studies on Biomedical NER and Covid-19 QANina Pörner, Ulli Waltinger, Hinrich Schütze. 1482-1490 [doi]
- Semantically Driven Sentence Fusion: Modeling and EvaluationEyal Ben-David, Orgad Keller, Eric Malmi, Idan Szpektor, Roi Reichart. 1491-1505 [doi]
- Pseudo-Bidirectional Decoding for Local Sequence TransductionWangchunshu Zhou, Tao Ge, Ke Xu 0001. 1506-1511 [doi]
- Predicting Responses to Psychological Questionnaires from Participants' Social Media Posts and Question Text EmbeddingsHuy Vu, Suhaib Abdurahman, Sudeep Bhatia, Lyle Ungar. 1512-1524 [doi]
- Will it Unblend?Yuval Pinter, Cassandra L. Jacobs, Jacob Eisenstein. 1525-1535 [doi]
- CodeBERT: A Pre-Trained Model for Programming and Natural LanguagesZhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin 0001, Ting Liu, Daxin Jiang, Ming Zhou. 1536-1547 [doi]
- StyleDGPT: Stylized Response Generation with Pre-trained Language ModelsZe Yang, Wei Wu 0014, Can Xu, Xinnian Liang, Jiaqi Bai, Liran Wang, Wei Wang, Zhoujun Li. 1548-1559 [doi]
- Enhancing Automated Essay Scoring Performance via Cohesion Measurement and Combination of Regression and RankingRuosong Yang, Jiannong Cao 0001, Zhiyuan Wen, Youzheng Wu, Xiaodong He 0002. 1560-1569 [doi]
- Neural Dialogue State Tracking with Temporally Expressive NetworksJunfan Chen, Richong Zhang, Yongyi Mao, Jie Xu 0007. 1570-1579 [doi]
- Inferring about fraudulent collusion risk on Brazilian public works contracts in official texts using a Bi-LSTM approachMarcos Lima, Roberta Silva, Felipe Lopes de Souza Mendes, Leonardo R. de Carvalho, Aleteia Araujo, Flavio de Barros Vidal. 1580-1588 [doi]
- Record-to-Text Generation with Style ImitationShuai Lin, Wentao Wang, Zichao Yang, Xiaodan Liang, Frank F. Xu, Eric P. Xing, Zhiting Hu. 1589-1598 [doi]
- Teaching Machine Comprehension with Compositional ExplanationsQinyuan Ye, Xiao Huang, Elizabeth Boschee, Xiang Ren. 1599-1615 [doi]
- A Knowledge-driven Approach to Classifying Object and Attribute Coreferences in Opinion MiningJiahua Chen, Shuai Wang 0020, Sahisnu Mazumder, Bing Liu 0001. 1616-1626 [doi]
- SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized EmbeddingsMasoud Jalili Sabet, Philipp Dufter, François Yvon, Hinrich Schütze. 1627-1643 [doi]
- TweetEval: Unified Benchmark and Comparative Evaluation for Tweet ClassificationFrancesco Barbieri, José Camacho-Collados, Luis Espinosa Anke, Leonardo Neves. 1644-1650 [doi]
- Octa: Omissions and Conflicts in Target-Aspect Sentiment AnalysisZhe Zhang 0004, Chung-Wei Hang, Munindar P. Singh. 1651-1662 [doi]
- On the Language Neutrality of Pre-trained Multilingual RepresentationsJindrich Libovický, Rudolf Rosa, Alexander Fraser. 1663-1674 [doi]
- Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social MediaXiang Dai, Sarvnaz Karimi, Ben Hachey, Cécile Paris. 1675-1681 [doi]
- TopicBERT for Energy Efficient Document ClassificationYatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas A. Runkler, Hinrich Schütze. 1682-1690 [doi]
- Improving Constituency Parsing with Span AttentionYuanhe Tian, Yan Song, Fei Xia, Tong Zhang. 1691-1703 [doi]
- Optimizing BERT for Unlabeled Text-Based Items SimilarityItzik Malkiel, Oren Barkan, Avi Caciularu, Noam Razin, Ori Katz, Noam Koenigstein. 1704-1714 [doi]
- Multi-Agent Mutual Learning at Sentence-Level and Token-Level for Neural Machine TranslationBaohao Liao, Yingbo Gao, Hermann Ney. 1715-1724 [doi]
- DomBERT: Domain-oriented Language Model for Aspect-based Sentiment AnalysisHu Xu, Bing Liu, Lei Shu, Philip S. Yu. 1725-1731 [doi]
- RMM: A Recursive Mental Model for Dialog NavigationHomero Roman Roman, Yonatan Bisk, Jesse Thomason, Asli Çelikyilmaz, Jianfeng Gao. 1732-1745 [doi]
- Will this Idea Spread Beyond Academia? Understanding Knowledge Transfer of Scientific Concepts across Text CorporaHancheng Cao, Mengjie Cheng, Zhepeng Cen, Daniel A. McFarland, Xiang Ren. 1746-1757 [doi]
- Recurrent Inference in Text EditingNing Shi, Ziheng Zeng, Haotian Zhang, Yichen Gong. 1758-1769 [doi]
- An Empirical Exploration of Local Ordering Pre-training for Structured LearningZhisong Zhang, Xiang Kong, Lori S. Levin, Eduard H. Hovy. 1770-1783 [doi]
- Unsupervised Extractive Summarization by Pre-training Hierarchical TransformersShusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei, Ming Zhou. 1784-1795 [doi]
- Active Learning Approaches to Enhancing Neural Machine Translation: An Empirical StudyYuekai Zhao, Haoran Zhang, Shuchang Zhou, Zhihua Zhang. 1796-1806 [doi]
- Towards Fine-Grained Transfer: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot FillingLibo Qin, Xiao Xu, Wanxiang Che, Ting Liu. 1807-1816 [doi]
- Continual Learning Long Short Term MemoryXin Guo, Yu Tian, Qinghan Xue, Panos Lampropoulos, Steven Eliuk, Kenneth E. Barner, Xiaolong Wang 0006. 1817-1822 [doi]
- CommonGen: A Constrained Text Generation Challenge for Generative Commonsense ReasoningBill Yuchen Lin, Wangchunshu Zhou, Ming Shen, Pei Zhou, Chandra Bhagavatula, Yejin Choi, Xiang Ren. 1823-1840 [doi]
- Constrained Decoding for Computationally Efficient Named Entity Recognition TaggersBrian Lester, Daniel Pressel, Amy Hemmeter, Sagnik Ray Choudhury, Srinivas Bangalore. 1841-1848 [doi]
- On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL QueriesTianze Shi, Chen Zhao, Jordan L. Boyd-Graber, Hal Daumé III, Lillian Lee. 1849-1864 [doi]
- TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and DenoisingZiyi Yang, Chenguang Zhu, Robert Gmyr, Michael Zeng, Xuedong Huang 0001, Eric Darve. 1865-1874 [doi]
- Improving End-to-End Bangla Speech Recognition with Semi-supervised TrainingNafis Sadeq, Nafis Tahmid Chowdhury, Farhan Tanvir Utshaw, Shafayat Ahmed, Muhammad Abdullah Adnan. 1875-1883 [doi]
- No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform GesturesChaitanya Ahuja, Dong-Won Lee, Ryo Ishii, Louis-Philippe Morency. 1884-1895 [doi]
- UnifiedQA: Crossing Format Boundaries With a Single QA SystemDaniel Khashabi, Sewon Min, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark, Hannaneh Hajishirzi. 1896-1907 [doi]
- Robust and Interpretable Grounding of Spatial References with Relation NetworksTsung-Yen Yang, Andrew S. Lan, Karthik Narasimhan. 1908-1923 [doi]
- Pragmatic Issue-Sensitive Image CaptioningAllen Nie, Reuben Cohn-Gordon, Christopher Potts. 1924-1938 [doi]
- PTUM: Pre-training User Model from Unlabeled User Behaviors via Self-supervisionChuhan Wu, Fangzhao Wu, Tao Qi, Jianxun Lian, Yongfeng Huang, Xing Xie 0001. 1939-1944 [doi]
- Adversarial Subword Regularization for Robust Neural Machine TranslationJungsoo Park, Mujeen Sung, Jinhyuk Lee, Jaewoo Kang. 1945-1953 [doi]
- Learning Visual-Semantic Embeddings for Reporting Abnormal Findings on Chest X-raysJianmo Ni, Chun-Nan Hsu, Amilcare Gentili, Julian J. McAuley. 1954-1960 [doi]
- SynET: Synonym Expansion using TransitivityJiale Yu, Yongliang Shen, Xinyin Ma, Chenghao Jia, Chen Chen, Weiming Lu 0001. 1961-1970 [doi]
- Scheduled DropHead: A Regularization Method for Transformer ModelsWangchunshu Zhou, Tao Ge, Furu Wei, Ming Zhou 0001, Ke Xu 0001. 1971-1980 [doi]
- Multi-Turn Dialogue Generation in E-Commerce Platform with the Context of Historical DialogueWeisheng Zhang, Kaisong Song, Yangyang Kang, Zhongqing Wang, Changlong Sun, Xiaozhong Liu, Shoushan Li, Min Zhang, Luo Si. 1981-1990 [doi]
- Automatically Identifying Gender Issues in Machine Translation using PerturbationsHila Gonen, Kellie Webster. 1991-1995 [doi]
- Ruler: Data Programming by Demonstration for Document LabelingSara Evensen, Chang Ge 0002, Çagatay Demiralp. 1996-2005 [doi]
- Dual Reconstruction: a Unifying Objective for Semi-Supervised Neural Machine TranslationWeijia Xu, Xing Niu, Marine Carpuat. 2006-2020 [doi]
- Focus-Constrained Attention Mechanism for CVAE-based Response GenerationZhi Cui, Yanran Li, Jiayi Zhang, Jianwei Cui, Chen Wei, Bin Wang. 2021-2030 [doi]
- Chunk-based Chinese Spelling Check with Global OptimizationZuyi Bao, Chen Li, Rui Wang. 2031-2040 [doi]
- Multi-pretraining for Large-scale Text ClassificationKang Min Kim, Bumsu Hyeon, Yeachan Kim, Jun-Hyung Park, SangKeun Lee 0001. 2041-2050 [doi]
- End-to-End Speech Recognition and Disfluency RemovalParia Jamshid Lou, Mark Johnson 0001. 2051-2061 [doi]
- Characterizing the Value of Information in Medical NotesChao-Chun Hsu, Shantanu Karnwal, Sendhil Mullainathan, Ziad Obermeyer, Chenhao Tan. 2062-2072 [doi]
- KLearn: Background Knowledge Inference from Summarization DataMaxime Peyrard, Robert West 0001. 2073-2085 [doi]
- Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-trainingDongha Choi, Hyunju Lee. 2086-2095 [doi]
- Logic2Text: High-Fidelity Natural Language Generation from Logical FormsZhiYu Chen, Wenhu Chen, Hanwen Zha, Xiyou Zhou, Yunkai Zhang, Sairam Sundaresan, William Yang Wang. 2096-2111 [doi]
- MedICaT: A Dataset of Medical Images, Captions, and Textual ReferencesSanjay Subramanian, Lucy Lu Wang, Ben Bogin, Sachin Mehta, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh 0001, Matt Gardner 0001, Hannaneh Hajishirzi. 2112-2120 [doi]
- TSDG: Content-aware Neural Response Generation with Two-stage Decoding ProcessJunsheng Kong, Zhicheng Zhong, Yi Cai, Xin Wu, Da Ren. 2121-2126 [doi]
- Unsupervised Cross-Lingual Adaptation of Dependency Parsers Using CRF AutoencodersZhao Li, Kewei Tu. 2127-2133 [doi]
- Diversify Question Generation with Continuous Content Selectors and Question Type ModelingZhen Wang, Siwei Rao, Jie Zhang, Zhen Qin, Guangjian Tian, Jun Wang. 2134-2143 [doi]
- Participatory Research for Low-resourced Machine Translation: A Case Study in African LanguagesWilhelmina Nekoto, Vukosi Marivate, Tshinondiwa Matsila, Timi E. Fasubaa, Taiwo Fagbohungbe, Solomon Oluwole Akinola, Shamsuddeen Hassan Muhammad, Salomon Kabongo Kabenamualu, Salomey Osei, Freshia Sackey, Rubungo Andre Niyongabo, Ricky Macharm, Perez Ogayo, Orevaoghene Ahia, Musie Meressa Berhe, Mofetoluwa Adeyemi, Masabata Mokgesi-Selinga, Lawrence Okegbemi, Laura Martinus, Kolawole Tajudeen, Kevin Degila, Kelechi Ogueji, Kathleen Siminyu, Julia Kreutzer, Jason Webster, Jamiil Toure Ali, Jade Z. Abbott, Iroro Orife, Ignatius Ezeani, Idris Abdulkabir Dangana, Herman Kamper, Hady ElSahar, Goodness Duru, Ghollah Kioko, Espoir Murhabazi, Elan Van Biljon, Daniel Whitenack, Christopher Onyefuluchi, Chris Chinenye Emezue, Bonaventure F. P. Dossou, Blessing Sibanda, Blessing Itoro Bassey, Ayodele Olabiyi, Arshath Ramkilowan, Alp Öktem, Adewale Akinfaderin, Abdallah Bashir. 2144-2160 [doi]
- ConveRT: Efficient and Accurate Conversational Representations from TransformersMatthew Henderson, Iñigo Casanueva, Nikola Mrksic, Pei-hao Su, Tsung-Hsien Wen, Ivan Vulic. 2161-2174 [doi]
- Computer Assisted Translation with Neural Quality Estimation and Auotmatic Post-EditingKe Wang, Jiayi Wang, Niyu Ge, Yangbin Shi, Yu Zhao, Kai Fan. 2175-2186 [doi]
- Zero-Shot Rationalization by Multi-Task Transfer Learning from Question AnsweringPo-Nien Kung, Tse-Hsuan Yang, Yi-Cheng Chen, Sheng-Siang Yin, Yun-Nung Chen. 2187-2197 [doi]
- The Role of Reentrancies in Abstract Meaning Representation ParsingMarco Damonte, Ida Szubert, Shay B. Cohen, Mark Steedman. 2198-2207 [doi]
- Cross-Lingual Suicidal-Oriented Word Embedding toward Suicide PreventionDaeun Lee, Soyoung Park, Jiwon Kang, Daejin Choi, Jinyoung Han. 2208-2217 [doi]
- Service-oriented Text-to-SQL ParsingWangsu Hu, Jilei Tian. 2218-2222 [doi]
- Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report GenerationToru Nishino, Ryota Ozaki, Yohei Momoki, Tomoki Taniguchi, Ryuji Kano, Norihisa Nakano, Yuki Tagawa, Motoki Taniguchi, Tomoko Ohkuma, Keigo Nakamura. 2223-2236 [doi]
- Reducing the Frequency of Hallucinated Quantities in Abstractive SummariesZheng Zhao, Shay B. Cohen, Bonnie Webber. 2237-2249 [doi]
- Rethinking Topic Modelling: From Document-Space to Term-SpaceMagnus Sahlgren. 2250-2259 [doi]
- Sparse and Decorrelated Representations for Stable Zero-shot NMTBokyung Son, Sungwon Lyu. 2260-2266 [doi]
- A Semi-supervised Approach to Generate the Code-Mixed Text using Pre-trained Encoder and Transfer LearningDeepak Gupta, Asif Ekbal, Pushpak Bhattacharyya. 2267-2280 [doi]
- Integrating Graph Contextualized Knowledge into Pre-trained Language ModelsBin He, Di Zhou, JingHui Xiao, Xin Jiang, Qun Liu, Nicholas Jing Yuan, Tong Xu. 2281-2290 [doi]
- Recursive Top-Down Production for Sentence Generation with Latent TreesShawn Tan, Yikang Shen, Alessandro Sordoni, Aaron C. Courville, Timothy J. O'Donnell. 2291-2307 [doi]
- Guided Dialogue Policy Learning without Adversarial Learning in the LoopZiming Li 0001, Sungjin Lee, Baolin Peng, Jinchao Li, Julia Kiseleva, Maarten de Rijke, Shahin Shayandeh, Jianfeng Gao. 2308-2317 [doi]
- MultiDM-GCN: Aspect-Guided Response Generation in Multi-Domain Multi-Modal Dialogue System using Graph Convolution NetworkMauajama Firdaus, Nidhi Thakur, Asif Ekbal. 2318-2328 [doi]
- Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic RelationShiyao Cui, Bowen Yu 0002, Tingwen Liu, Zhenyu Zhang 0006, Xuebin Wang, Jinqiao Shi. 2329-2339 [doi]
- Semi-supervised Formality Style Transfer using Language Model Discriminator and Mutual Information MaximizationKunal Chawla, Diyi Yang. 2340-2354 [doi]
- Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and FairnessLingjuan Lyu, Xuanli He, Yitong Li. 2355-2365 [doi]
- Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on SuccessFarzana Rashid, Tommaso Fornaciari, Dirk Hovy, Eduardo Blanco 0002, Fernando Vega-Redondo. 2366-2371 [doi]
- Learning Knowledge Bases with Parameters for Task-Oriented Dialogue SystemsAndrea Madotto, Samuel Cahyawijaya, Genta Indra Winata, Yan Xu 0012, Zihan Liu, Zhaojiang Lin, Pascale Fung. 2372-2394 [doi]
- Generalizing Open Domain Fact Extraction and Verification to COVID-FACT thorough In-Domain Language ModelingZhenghao Liu, Chenyan Xiong, Zhuyun Dai, Si Sun, Maosong Sun, Zhiyuan Liu 0001. 2395-2400 [doi]
- ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-trainingWeizhen Qi, Yu Yan, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang, Ming Zhou 0001. 2401-2410 [doi]
- DivGAN: Towards Diverse Paraphrase Generation via Diversified Generative Adversarial NetworkYue Cao, Xiaojun Wan 0001. 2411-2421 [doi]
- Plug-and-Play Conversational ModelsAndrea Madotto, Etsuko Ishii, Zhaojiang Lin, Sumanth Dathathri, Pascale Fung. 2422-2433 [doi]
- Event-Driven Learning of Systematic Behaviours in Stock MarketsXianchao Wu. 2434-2444 [doi]
- You could have said that instead: Improving Chatbots with Natural Language FeedbackMakesh Narsimhan Sreedhar, Kun Ni, Siva Reddy. 2445-2453 [doi]
- Adapting Coreference Resolution to Twitter ConversationsBerfin Aktas, Veronika Solopova, Annalena Kohnert, Manfred Stede. 2454-2460 [doi]
- On Romanization for Model Transfer Between Scripts in Neural Machine TranslationChantal Amrhein, Rico Sennrich. 2461-2469 [doi]
- COSMIC: COmmonSense knowledge for eMotion Identification in ConversationsDeepanway Ghosal, Navonil Majumder, Alexander F. Gelbukh, Rada Mihalcea, Soujanya Poria. 2470-2481 [doi]
- Improving Compositional Generalization in Semantic ParsingInbar Oren, Jonathan Herzig, Nitish Gupta, Matt Gardner 0001, Jonathan Berant. 2482-2495 [doi]
- Answer Span Correction in Machine Reading ComprehensionRevanth Gangi Reddy, Md. Arafat Sultan, Efsun Sarioglu Kayi, Rong Zhang, Vittorio Castelli, Avirup Sil. 2496-2501 [doi]
- On the Interplay Between Fine-tuning and Sentence-Level Probing for Linguistic Knowledge in Pre-Trained TransformersMarius Mosbach, Anna Khokhlova, Michael A. Hedderich, Dietrich Klakow. 2502-2516 [doi]
- Zero-shot Entity Linking with Efficient Long Range Sequence ModelingZonghai Yao, Liangliang Cao, Huapu Pan. 2517-2522 [doi]
- How Does Context Matter? On the Robustness of Event Detection with Context-Selective Mask GeneralizationJian Liu, Yubo Chen 0001, Kang Liu 0001, Yantao Jia, Zhicheng Sheng. 2523-2532 [doi]
- Adaptive Feature Selection for End-to-End Speech TranslationBiao Zhang, Ivan Titov, Barry Haddow, Rico Sennrich. 2533-2544 [doi]
- Abstractive Multi-Document Summarization via Joint Learning with Single-Document SummarizationHanqi Jin, Xiaojun Wan 0001. 2545-2554 [doi]
- Blockwise Self-Attention for Long Document UnderstandingJiezhong Qiu, Hao Ma, Omer Levy, Wen-tau Yih, Sinong Wang, Jie Tang 0001. 2555-2565 [doi]
- Unsupervised Few-Bits Semantic Hashing with Implicit Topics ModelingFanghua Ye, Jarana Manotumruksa, Emine Yilmaz. 2566-2575 [doi]
- Grid Tagging Scheme for End-to-End Fine-grained Opinion ExtractionZhen Wu, Chengcan Ying, Fei Zhao, Zhifang Fan, Xinyu Dai, Rui Xia. 2576-2585 [doi]
- Learning Numeral EmbeddingChengyue Jiang, Zhonglin Nian, Kaihao Guo, Shanbo Chu, Yinggong Zhao, Libin Shen, Kewei Tu. 2586-2599 [doi]
- An Investigation of Potential Function Designs for Neural CRFZechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu. 2600-2609 [doi]
- Fast End-to-end Coreference Resolution for KoreanCheoneum Park, Jamin Shin, Sungjoon Park, Joonho Lim, Changki Lee. 2610-2624 [doi]
- Toward Stance-based Personas for Opinionated DialoguesThomas Scialom, Serra Sinem Tekiroglu, Jacopo Staiano, Marco Guerini. 2625-2635 [doi]
- Hierarchical Pre-training for Sequence Labelling in Spoken DialogEmile Chapuis, Pierre Colombo, Matteo Manica, Matthieu Labeau, Chloé Clavel. 2636-2648 [doi]
- Extending Multilingual BERT to Low-Resource LanguagesZihan Wang, Karthikeyan K, Stephen Mayhew 0001, Dan Roth. 2649-2656 [doi]
- Out-of-Sample Representation Learning for Knowledge GraphsMarjan Albooyeh, Rishab Goel, Seyed Mehran Kazemi. 2657-2666 [doi]
- Fine-Grained Grounding for Multimodal Speech RecognitionTejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott. 2667-2677 [doi]
- Unsupervised Expressive Rules Provide Explainability and Assist Human Experts Grasping New DomainsEyal Shnarch, Leshem Choshen, Guy Moshkowich, Ranit Aharonov, Noam Slonim. 2678-2697 [doi]
- Textual supervision for visually grounded spoken language understandingBertrand Higy, Desmond Elliott, Grzegorz Chrupala. 2698-2709 [doi]
- Universal Dependencies according to BERT: both more specific and more generalTomasz Limisiewicz, David Marecek, Rudolf Rosa. 2710-2722 [doi]
- Visual Objects As Context: Exploiting Visual Objects for Lexical EntailmentMasayasu Muraoka, Tetsuya Nasukawa, Bishwaranjan Bhattacharjee. 2723-2735 [doi]
- Learning to Plan and Realize Separately for Open-Ended Dialogue SystemsSashank Santhanam, Zhuo Cheng, Brodie Mather, Bonnie J. Dorr, Archna Bhatia, Bryanna Hebenstreit, Alan Zemel, Adam Dalton 0001, Tomek Strzalkowski, Samira Shaikh. 2736-2750 [doi]
- Be Different to Be Better! A Benchmark to Leverage the Complementarity of Language and VisionSandro Pezzelle, Claudio Greco 0002, Greta Gandolfi, Eleonora Gualdoni, Raffaella Bernardi. 2751-2767 [doi]
- Cross-Lingual Training of Neural Models for Document RankingPeng Shi, He Bai, Jimmy Lin. 2768-2773 [doi]
- Improving Word Embedding Factorization for Compression using Distilled Nonlinear Neural DecompositionVasileios Lioutas, Ahmad Rashid, Krtin Kumar, Md. Akmal Haidar, Mehdi Rezagholizadeh. 2774-2784 [doi]
- PharmMT: A Neural Machine Translation Approach to Simplify Prescription DirectionsJiazhao Li, Corey Lester, Xinyan Zhao, Yuting Ding, Yun Jiang, V. G. Vinod Vydiswaran. 2785-2796 [doi]
- LSTMS Compose - and Learn - Bottom-UpNaomi Saphra, Adam Lopez. 2797-2809 [doi]
- Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense GraphsAna Marasovic, Chandra Bhagavatula, Jae Sung Park, Ronan Le Bras, Noah A. Smith, Yejin Choi. 2810-2829 [doi]
- Corpora Evaluation and System Bias detection in Multi Document SummarizationAlvin Dey, Tanya Chowdhury, Yash Kumar Atri, Tanmoy Chakraborty 0002. 2830-2840 [doi]
- Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word ProblemShucheng Li, Lingfei Wu, Shiwei Feng, Fangli Xu, Fengyuan Xu, Sheng Zhong. 2841-2852 [doi]
- Target Conditioning for One-to-Many GenerationMarie-Anne Lachaux, Armand Joulin, Guillaume Lample. 2853-2862 [doi]
- Can Pre-training help VQA with Lexical Variations?Shailza Jolly, Shubham Kapoor. 2863-2868 [doi]
- FENAS: Flexible and Expressive Neural Architecture SearchRamakanth Pasunuru, Mohit Bansal. 2869-2876 [doi]
- Inferring symmetry in natural languageChelsea Tanchip, Lei Yu, Aotao Xu, Yang Xu. 2877-2886 [doi]
- A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer EncoderXipeng Qiu, Hengzhi Pei, Hang Yan, Xuanjing Huang. 2887-2897 [doi]
- LEGAL-BERT: "Preparing the Muppets for Court'"Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos. 2898-2904 [doi]
- Enhancing Content Planning for Table-to-Text Generation with Data Understanding and VerificationHeng Gong, Wei Bi, Xiaocheng Feng, Bing Qin 0001, Xiaojiang Liu, Ting Liu 0001. 2905-2914 [doi]
- Contextual Text Style TransferYu Cheng, Zhe Gan, Yizhe Zhang, Oussama Elachqar, Dianqi Li, Jingjing Liu 0001. 2915-2924 [doi]
- DiPair: Fast and Accurate Distillation for Trillion-ScaleText Matching and Pair ModelingJiecao Chen, Liu Yang, Karthik Raman, Michael Bendersky, Jung-Jung Yeh, Yun Zhou, Marc Najork, Danyang Cai, Ehsan Emadzadeh. 2925-2937 [doi]
- Cross-Lingual Dependency Parsing by POS-Guided Word ReorderingLu Liu, Yi Zhou, Jianhan Xu, Xiaoqing Zheng, Kai-Wei Chang, Xuanjing Huang. 2938-2948 [doi]
- Assessing Robustness of Text Classification through Maximal Safe Radius ComputationEmanuele La Malfa, Min Wu, Luca Laurenti, Benjie Wang, Anthony Hartshorn, Marta Kwiatkowska. 2949-2968 [doi]
- Social Commonsense Reasoning with Multi-Head Knowledge AttentionDebjit Paul, Anette Frank. 2969-2980 [doi]
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken DialogErik Ekstedt, Gabriel Skantze. 2981-2990 [doi]
- A little goes a long way: Improving toxic language classification despite data scarcityMika Juuti, Tommi Gröndahl, Adrian Flanagan, N. Asokan. 2991-3009 [doi]
- An Instance Level Approach for Shallow Semantic Parsing in Scientific Procedural TextDaivik Swarup, Ahsaas Bajaj, Sheshera Mysore, Tim O'Gorman, Rajarshi Das, Andrew McCallum. 3010-3017 [doi]
- General Purpose Text Embeddings from Pre-trained Language Models for Scalable InferenceJingfei Du, Myle Ott, Haoran Li, Xing Zhou, Veselin Stoyanov. 3018-3030 [doi]
- Learning to Model and Ignore Dataset Bias with Mixed Capacity EnsemblesChristopher Clark, Mark Yatskar, Luke Zettlemoyer. 3031-3045 [doi]
- Learning to Generalize for Sequential Decision MakingXusen Yin, Ralph M. Weischedel, Jonathan May. 3046-3063 [doi]
- Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial ReportsMarkus Zlabinger, Marta Sabou, Sebastian Hofstätter, Allan Hanbury. 3064-3074 [doi]
- Adversarial Grammatical Error CorrectionVipul Raheja, Dimitris Alikaniotis. 3075-3087 [doi]
- On Long-Tailed Phenomena in Neural Machine TranslationVikas Raunak, Siddharth Dalmia, Vivek Gupta 0001, Florian Metze. 3088-3095 [doi]
- Knowing What You Know: Calibrating Dialogue Belief State Distributions via EnsemblesCarel van Niekerk, Michael Heck, Christian Geishauser, Hsien-Chin Lin, Nurul Lubis, Marco Moresi, Milica Gasic. 3096-3102 [doi]
- Domain Adversarial Fine-Tuning as an Effective RegularizerGiorgos Vernikos, Katerina Margatina, Alexandra Chronopoulou, Ion Androutsopoulos. 3103-3112 [doi]
- CLAR: A Cross-Lingual Argument Regularizer for Semantic Role LabelingIshan Jindal, Yunyao Li 0001, Siddhartha Brahma, Huaiyu Zhu 0001. 3113-3125 [doi]
- Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual GenerationSeungJae Shin, Kyungwoo Song, JoonHo Jang, Hyemi Kim, Weonyoung Joo, Il-Chul Moon. 3126-3140 [doi]
- Towards Domain-Independent Text Structuring Trainable on Large Discourse TreebanksGrigorii Guz, Giuseppe Carenini. 3141-3152 [doi]
- Data Annealing for Informal Language Understanding TasksJing Gu, Zhou Yu. 3153-3159 [doi]
- A Multilingual View of Unsupervised Machine TranslationXavier Garcia, Pierre Foret, Thibault Sellam, Ankur P. Parikh. 3160-3170 [doi]
- An Evaluation Method for Diachronic Word Sense InductionAshjan Alsulaimani, Erwan Moreau, Carl Vogel. 3171-3180 [doi]
- Integrating Task Specific Information into Pretrained Language Models for Low Resource Fine TuningRui Wang, Shijing Si, Guoyin Wang 0002, Lei Zhang, Lawrence Carin, Ricardo Henao. 3181-3186 [doi]
- Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured PruningBingbing Li, Zhenglun Kong, Tianyun Zhang, Ji Li 0006, Zhengang Li, Hang Liu, Caiwen Ding. 3187-3199 [doi]
- KoBE: Knowledge-Based Machine Translation EvaluationZorik Gekhman, Roee Aharoni, Genady Beryozkin, Markus Freitag, Wolfgang Macherey. 3200-3207 [doi]
- Pushing the Limits of AMR Parsing with Self-LearningYoung-Suk Lee, Ramón Fernandez Astudillo, Tahira Naseem, Revanth Gangi Reddy, Radu Florian, Salim Roukos. 3208-3214 [doi]
- Towards Zero Shot Conditional Summarization with Adaptive Multi-task Fine-TuningTravis Goodwin, Max E. Savery, Dina Demner-Fushman. 3215-3226 [doi]
- Multilingual Knowledge Graph Completion via Ensemble Knowledge TransferXuelu Chen, Muhao Chen, Changjun Fan, Ankith Uppunda, Yizhou Sun, Carlo Zaniolo. 3227-3238 [doi]
- Towards Controllable Biases in Language GenerationEmily Sheng, Kai-Wei Chang, Prem Natarajan, Nanyun Peng. 3239-3254 [doi]
- RobBERT: a Dutch RoBERTa-based Language ModelPieter Delobelle, Thomas Winters, Bettina Berendt. 3255-3265 [doi]
- Regularization of Distinct Strategies for Unsupervised Question GenerationJunmo Kang, Giwon Hong, Haritz Puerto San Roman, Sung-Hyon Myaeng. 3266-3277 [doi]
- Graph-to-Graph Transformer for Transition-based Dependency ParsingAlireza Mohammadshahi, James Henderson. 3278-3289 [doi]
- WER we are and WER we think we arePiotr Szymanski, Piotr Zelasko, Mikolaj Morzy, Adrian Szymczak, Marzena Zyla-Hoppe, Joanna Banaszczak, Lukasz Augustyniak, Jan Mizgajski, Yishay Carmiel. 3290-3295 [doi]
- DeSMOG: Detecting Stance in Media On Global WarmingYiwei Luo, Dallas Card, Dan Jurafsky. 3296-3315 [doi]
- A Novel Challenge Set for Hebrew Morphological Disambiguation and Diacritics RestorationAvi Shmidman, Joshua Guedalia, Shaltiel Shmidman, Moshe Koppel, Reut Tsarfaty. 3316-3326 [doi]
- Improve Transformer Models with Better Relative Position EmbeddingsZhiheng Huang, Davis Liang, Peng Xu, Bing Xiang. 3327-3335 [doi]
- A Sentiment-Controllable Topic-to-Essay Generator with Topic Knowledge GraphLin Qiao, Jianhao Yan, Fandong Meng, Zhendong Yang, Jie Zhou 0016. 3336-3344 [doi]
- What-if I ask you to explain: Explaining the effects of perturbations in procedural textDheeraj Rajagopal, Niket Tandon, Peter Clark, Bhavana Dalvi, Eduard H. Hovy. 3345-3355 [doi]
- RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language ModelsSamuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi, Noah A. Smith. 3356-3369 [doi]
- Improving Event Duration Prediction via Time-aware Pre-trainingZonglin Yang, Xinya Du, Alexander M. Rush, Claire Cardie. 3370-3378 [doi]
- Composed Variational Natural Language Generation for Few-shot IntentsCongying Xia, Caiming Xiong, Philip S. Yu, Richard Socher. 3379-3388 [doi]
- Document Reranking for Precision Medicine with Neural Matching and Faceted SummarizationJiho Noh, Ramakanth Kavuluru. 3389-3399 [doi]
- On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise TasksStephen Mussmann, Robin Jia, Percy Liang. 3400-3413 [doi]
- A Dual-Attention Network for Joint Named Entity Recognition and Sentence Classification of Adverse Drug EventsSusmitha Wunnava, Xiao Qin 0003, Tabassum Kakar, Xiangnan Kong, Elke A. Rundensteiner. 3414-3423 [doi]
- BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QANora Kassner, Hinrich Schütze. 3424-3430 [doi]
- Identifying spurious correlations for robust text classificationZhao Wang, Aron Culotta. 3431-3440 [doi]
- HoVer: A Dataset for Many-Hop Fact Extraction And Claim VerificationYichen Jiang, Shikha Bordia, Zheng Zhong, Charles Dognin, Maneesh Kumar Singh 0001, Mohit Bansal. 3441-3460 [doi]
- Continual Learning for Natural Language Generation in Task-oriented Dialog SystemsFei Mi, Liangwei Chen, Mengjie Zhao, Minlie Huang, Boi Faltings. 3461-3474 [doi]
- UNQOVERing Stereotypical Biases via Underspecified QuestionsTao Li, Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Vivek Srikumar. 3475-3489 [doi]
- A Semantics-based Approach to Disclosure Classification in User-Generated Online ContentChandan Akiti, Anna Cinzia Squicciarini, Sarah Michele Rajtmajer. 3490-3499 [doi]
- Mining Knowledge for Natural Language Inference from Wikipedia CategoriesMingda Chen, Zewei Chu, Karl Stratos, Kevin Gimpel. 3500-3511 [doi]
- OCNLI: Original Chinese Natural Language InferenceHai Hu, Kyle Richardson, Liang Xu, Lu Li, Sandra Kübler, Lawrence S. Moss. 3512-3526 [doi]
- Unsupervised Domain Adaptation for Cross-lingual Text LabelingDejiao Zhang, Ramesh Nallapati, Henghui Zhu, Feng Nan, Cícero Nogueira dos Santos, Kathleen McKeown, Bing Xiang. 3527-3536 [doi]
- Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue SystemsZiming Li 0001, Julia Kiseleva, Maarten de Rijke. 3537-3546 [doi]
- What do we expect from Multiple-choice QA Systems?Krunal Shah, Nitish Gupta, Dan Roth. 3547-3553 [doi]
- Resource-Enhanced Neural Model for Event Argument ExtractionJie Ma, Shuai Wang, Rishita Anubhai, Miguel Ballesteros, Yaser Al-Onaizan. 3554-3559 [doi]
- Improving Target-side Lexical Transfer in Multilingual Neural Machine TranslationLuyu Gao, Xinyi Wang, Graham Neubig. 3560-3566 [doi]
- Accurate Polyglot Semantic Parsing With DAG GrammarsFederico Fancellu, Ákos Kádár, Ran Zhang, Afsaneh Fazly. 3567-3580 [doi]
- Approximation of Response Knowledge Retrieval in Knowledge-grounded Dialogue GenerationWen Zheng, Natasa Milic-Frayling, Ke Zhou. 3581-3591 [doi]
- Evaluating Factuality in Generation with Dependency-level EntailmentTanya Goyal, Greg Durrett. 3592-3603 [doi]
- Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse TeacherGiannis Karamanolakis, Daniel Hsu 0001, Luis Gravano. 3604-3622 [doi]
- A Multi-Persona Chatbot for Hotline Counselor TrainingOrianna DeMasi, Yu Li, Zhou Yu. 3623-3636 [doi]
- Narrative Text Generation with a Latent Discrete PlanHarsh Jhamtani, Taylor Berg-Kirkpatrick. 3637-3650 [doi]
- Graph Transformer Networks with Syntactic and Semantic Structures for Event Argument ExtractionAmir Pouran Ben Veyseh, Tuan Ngo Nguyen, Thien Huu Nguyen. 3651-3661 [doi]
- The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine TranslationJie He, Tao Wang, Deyi Xiong, Qun Liu. 3662-3672 [doi]
- Using Visual Feature Space as a Pivot Across LanguagesZiyan Yang, Leticia Pinto-Alva, Franck Dernoncourt, Vicente Ordonez. 3673-3678 [doi]
- An Empirical Study of Cross-Dataset Evaluation for Neural Summarization SystemsYiran Chen, Pengfei Liu, Ming Zhong, Zi-Yi Dou, Danqing Wang, Xipeng Qiu, Xuanjing Huang. 3679-3691 [doi]
- Attending to Long-Distance Document Context for Sequence LabelingMatthew Jörke, Jon Gillick, Matthew Sims, David Bamman. 3692-3704 [doi]
- Global Bootstrapping Neural Network for Entity Set ExpansionLingyong Yan, Xianpei Han, Ben He, Le Sun 0001. 3705-3714 [doi]
- Document Classification for COVID-19 LiteratureBernal Jimenez Gutierrez, Jucheng Zeng, DongDong Zhang, Ping Zhang, Yu Su. 3715-3722 [doi]
- Adversarial Augmentation Policy Search for Domain and Cross-Lingual Generalization in Reading ComprehensionAdyasha Maharana, Mohit Bansal. 3723-3738 [doi]
- Denoising Multi-Source Weak Supervision for Neural Text ClassificationWendi Ren, Yinghao Li, Hanting Su, David Kartchner, Cassie Mitchell, Chao Zhang. 3739-3754 [doi]
- Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local StructuresAnirudh Joshi, Namit Katariya, Xavier Amatriain, Anitha Kannan. 3755-3763 [doi]
- Generating Accurate EHR Assessment from Medical GraphZhichao Yang, Hong Yu 0001. 3764-3773 [doi]
- Do Models of Mental Health Based on Social Media Data Generalize?Keith Harrigian, Carlos Aguirre, Mark Dredze. 3774-3788 [doi]
- Context Analysis for Pre-trained Masked Language ModelsYi-An Lai, Garima Lalwani, Yi Zhang. 3789-3804 [doi]
- Controllable Text Generation with Focused VariationLei Shu 0004, Alexandros Papangelis, Yi-Chia Wang, Gökhan Tür, Hu Xu, Zhaleh Feizollahi, Bing Liu 0001, Piero Molino. 3805-3817 [doi]
- Modeling Preconditions in Text with a Crowd-sourced DatasetHeeyoung Kwon, Mahnaz Koupaee, Pratyush Singh, Gargi Sawhney, Anmol Shukla, Keerthi Kumar Kallur, Nathanael Chambers, Niranjan Balasubramanian. 3818-3828 [doi]
- Reevaluating Adversarial Examples in Natural LanguageJohn X. Morris, Eli Lifland, Jack Lanchantin, Yangfeng Ji, Yanjun Qi. 3829-3839 [doi]
- Question Answering with Long Multiple-Span AnswersMing Zhu, Aman Ahuja, Da-Cheng Juan, Wei Wei 0019, Chandan K. Reddy. 3840-3849 [doi]
- Inserting Information Bottleneck for Attribution in TransformersZhiying Jiang, Raphael Tang, Ji Xin, Jimmy Lin. 3850-3857 [doi]
- Event-Related Bias Removal for Real-time Disaster EventsSalvador Medina Maza, Evangelia Spiliopoulou, Eduard H. Hovy, Alexander G. Hauptmann. 3858-3868 [doi]
- It's not a Non-Issue: Negation as a Source of Error in Machine TranslationMd Mosharaf Hossain, Antonios Anastasopoulos, Eduardo Blanco 0002, Alexis Palmer. 3869-3885 [doi]
- Incremental Text-to-Speech Synthesis with Prefix-to-Prefix FrameworkMingbo Ma, Baigong Zheng, Kaibo Liu, Renjie Zheng, Hairong Liu, Kainan Peng, Kenneth Church 0001, Liang Huang 0001. 3886-3896 [doi]
- Joint Turn and Dialogue level User Satisfaction Estimation on Mulit-Domain ConversationsPraveen Kumar Bodigutla, Aditya Tiwari, Spyros Matsoukas, Josep Valls-Vargas, Lazaros Polymenakos. 3897-3909 [doi]
- ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic EnvironmentsHyounghun Kim, Abhaysinh Zala, Graham Burri, Hao Tan, Mohit Bansal. 3910-3927 [doi]
- Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive TrainingRenjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Jiahong Yuan, Kenneth Church 0001, Liang Huang 0001. 3928-3937 [doi]
- Towards Context-Aware Code Comment GenerationXiaohan Yu, Quzhe Huang, Zheng Wang, Yansong Feng, Dongyan Zhao 0001. 3938-3947 [doi]
- MCMH: Learning Multi-Chain Multi-Hop Rules for Knowledge Graph ReasoningLu Zhang, Mo Yu, Tian Gao, Yue Yu. 3948-3954 [doi]
- Finding the Optimal Vocabulary Size for Neural Machine TranslationThamme Gowda, Jonathan May. 3955-3964 [doi]
- Weakly- and Semi-supervised Evidence ExtractionDanish Pruthi, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton. 3965-3970 [doi]
- Making Information Seeking Easier: An Improved Pipeline for Conversational SearchVaibhav Kumar, Jamie Callan. 3971-3980 [doi]
- Generalizable and Explainable Dialogue Generation via Explicit Action LearningXinting Huang, Jianzhong Qi 0001, Yu Sun 0021, Rui Zhang 0003. 3981-3991 [doi]
- More Embeddings, Better Sequence Labelers?Xinyu Wang 0013, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu. 3992-4006 [doi]
- NLP Service APIs and Models for Efficient Registration of New ClientsSahil Shah, Vihari Piratla, Soumen Chakrabarti, Sunita Sarawagi. 4007-4012 [doi]
- Effects of Naturalistic Variation in Goal-Oriented DialogJatin Ganhotra, Robert Moore, Sachindra Joshi, Kahini Wadhawan. 4013-4020 [doi]
- Determining Event Outcomes: The Case of #failSrikala Murugan, Dhivya Chinnappa, Eduardo Blanco 0002. 4021-4033 [doi]
- WikiLingua: A New Benchmark Dataset for Multilingual Abstractive SummarizationFaisal Ladhak, Esin Durmus, Claire Cardie, Kathleen R. McKeown. 4034-4048 [doi]
- Adversarial Training for Code Retrieval with Question-Description Relevance RegularizationJie Zhao, Huan Sun. 4049-4059 [doi]
- Large Product Key Memory for Pre-trained Language ModelsGyuwan Kim, Tae-Hwan Jung. 4060-4069 [doi]
- Temporal Reasoning in Natural Language InferenceSiddharth Vashishtha, Adam Poliak, Yash Kumar Lal, Benjamin Van Durme, Aaron Steven White. 4070-4078 [doi]
- A Pilot Study of Text-to-SQL Semantic Parsing for VietnameseAnh Tuan Nguyen, Mai Hoang Dao, Dat Quoc Nguyen. 4079-4085 [doi]
- STANDER: An Expert-Annotated Dataset for News Stance Detection and Evidence RetrievalCostanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd, Nigel Collier. 4086-4101 [doi]
- An Empirical Methodology for Detecting and Prioritizing Needs during Crisis EventsMaria Janina Sarol, Ly Dinh, Rezvaneh Rezapour, Chieh-Li Chin, Pingjing Yang, Jana Diesner. 4102-4107 [doi]
- SupMMD: A Sentence Importance Model for Extractive Summarisation using Maximum Mean DiscrepancyUmanga Bista, Alexander Patrick Mathews, Aditya Krishna Menon, Lexing Xie. 4108-4122 [doi]
- Towards Low-Resource Semi-Supervised Dialogue Generation with Meta-LearningYi Huang, Junlan Feng, Shuo Ma, Xiaoyu Du, Xiaoting Wu. 4123-4128 [doi]
- Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question AnsweringPeiFeng Wang, Nanyun Peng, Filip Ilievski, Pedro A. Szekely, Xiang Ren. 4129-4140 [doi]
- No Answer is Better Than Wrong Answer: A Reflection Model for Document Level Machine Reading ComprehensionXuguang Wang, Linjun Shou, Ming Gong, Nan Duan, Daxin Jiang. 4141-4150 [doi]
- Reference Language based Unsupervised Neural Machine TranslationZuchao Li, Hai Zhao, Rui Wang, Masao Utiyama, Eiichiro Sumita. 4151-4162 [doi]
- TinyBERT: Distilling BERT for Natural Language UnderstandingXiaoqi Jiao, Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Linlin Li, Fang Wang 0001, Qun Liu. 4163-4174 [doi]
- Poison Attacks against Text Datasets with Conditional Adversarially Regularized AutoencoderAlvin Chan, Yi Tay, Yew-Soon Ong, Aston Zhang. 4175-4189 [doi]
- #Turki$hTweets: A Benchmark Dataset for Turkish Text CorrectionAsiye Tuba Koksal, Ozge Bozal, Emre Yürekli, Gizem Gezici. 4190-4198 [doi]
- Assessing Human-Parity in Machine Translation on the Segment LevelYvette Graham, Christian Federmann, Maria Eskevich, Barry Haddow. 4199-4207 [doi]
- Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across ChannelsHarris Chan, Jamie Kiros, William Chan. 4208-4220 [doi]
- Factorized Transformer for Multi-Domain Neural Machine TranslationYongchao Deng, Hongfei Yu, Heng Yu, Xiangyu Duan, Weihua Luo. 4221-4230 [doi]
- Improving Named Entity Recognition with Attentive Ensemble of Syntactic InformationYuyang Nie, Yuanhe Tian, Yan Song, Xiang Ao, Xiang Wan. 4231-4245 [doi]
- Query-Key Normalization for TransformersAlex Henry, Prudhvi Raj Dachapally, Shubham Shantaram Pawar, Yuxuan Chen. 4246-4253 [doi]
- Contract Discovery: Dataset and a Few-shot Semantic Retrieval Challenge with Competitive BaselinesLukasz Borchmann, Dawid Wisniewski, Andrzej Gretkowski, Izabela Kosmala, Dawid Jurkiewicz, Lukasz Szalkiewicz, Gabriela Palka, Karol Kaczmarek, Agnieszka Kaliska, Filip Gralinski. 4254-4268 [doi]
- Vocabulary Adaptation for Domain Adaptation in Neural Machine TranslationShoetsu Sato, Jin Sakuma, Naoki Yoshinaga 0001, Masashi Toyoda, Masaru Kitsuregawa. 4269-4279 [doi]
- A Shared-Private Representation Model with Coarse-to-Fine Extraction for Target Sentiment AnalysisPeiqin Lin, Meng Yang. 4280-4289 [doi]
- Detecting Media Bias in News Articles using Gaussian Bias DistributionsWei-Fan Chen, Khalid Al Khatib, Benno Stein 0001, Henning Wachsmuth. 4290-4300 [doi]
- How Can Self-Attention Networks Recognize Dyck-n Languages?Javid Ebrahimi, Dhruv Gelda, Wei Zhang. 4301-4306 [doi]
- Training Flexible Depth Model by Multi-Task Learning for Neural Machine TranslationQiang Wang, Tong Xiao, Jingbo Zhu. 4307-4312 [doi]
- Looking inside Noun Compounds: Unsupervised Prepositional and Free Paraphrasing using Language ModelsGirishkumar Ponkiya, V. Rudra Murthy, Pushpak Bhattacharyya, Girish Keshav Palshikar. 4313-4323 [doi]
- The birth of Romanian BERTStefan Daniel Dumitrescu, Andrei-Marius Avram, Sampo Pyysalo. 4324-4328 [doi]
- BERT for Monolingual and Cross-Lingual Reverse DictionaryHang Yan, Xiaonan Li, Xipeng Qiu, Bocao Deng. 4329-4338 [doi]
- What's so special about BERT's layers? A closer look at the NLP pipeline in monolingual and multilingual modelsWietse de Vries, Andreas van Cranenburgh, Malvina Nissim. 4339-4350 [doi]
- Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?Peter Hase, Shiyue Zhang, Harry Xie, Mohit Bansal. 4351-4367 [doi]
- A Pointer Network Architecture for Joint Morphological Segmentation and TaggingAmit Seker, Reut Tsarfaty. 4368-4378 [doi]
- Beyond Language: Learning Commonsense from Images for ReasoningWanqing Cui, Yanyan Lan, Liang Pang, Jiafeng Guo, Xueqi Cheng. 4379-4389 [doi]
- A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training StrategiesHo-Lam Chung, Ying-Hong Chan, Yao-Chung Fan. 4390-4400 [doi]
- How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?Shayne Longpre, Yu Wang, Chris DuBois. 4401-4411 [doi]
- Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level InstructionsPeter Jansen. 4412-4417 [doi]
- Consistent Response Generation with Controlled SpecificityJunya Takayama, Yuki Arase. 4418-4427 [doi]
- Internal and External Pressures on Language Emergence: Least Effort, Object Constancy and FrequencyDiana Rodríguez Luna, Edoardo Maria Ponti, Dieuwke Hupkes, Elia Bruni. 4428-4437 [doi]
- Parsing All: Syntax and Semantics, Dependencies and SpansJunru Zhou, Zuchao Li, Hai Zhao. 4438-4449 [doi]
- LIMIT-BERT : Linguistics Informed Multi-Task BERTJunru Zhou, Zhuosheng Zhang 0001, Hai Zhao, Shuailiang Zhang. 4450-4461 [doi]
- Improving Limited Labeled Dialogue State Tracking with Self-SupervisionChien-Sheng Wu, Steven C. H. Hoi, Caiming Xiong. 4462-4472 [doi]
- On the Branching Bias of Syntax Extracted from Pre-trained Language ModelsHuayang Li, Lemao Liu, Guoping Huang, Shuming Shi 0001. 4473-4478 [doi]
- The Pragmatics behind Politics: Modelling Metaphor, Framing and Emotion in Political DiscoursePere-Lluís Huguet Cabot, Verna Dankers, David Abadi, Agneta Fischer, Ekaterina Shutova. 4479-4488 [doi]
- SMRTer Chatbots: Improving Non-Task-Oriented Dialog with Simulated Multi-Reference TrainingHuda Khayrallah, João Sedoc. 4489-4505 [doi]
- PrivNet: Safeguarding Private Attributes in Transfer Learning for RecommendationGuangneng Hu, Qiang Yang 0001. 4506-4516 [doi]
- Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense DisambiguationNithin Holla, Pushkar Mishra, Helen Yannakoudakis, Ekaterina Shutova. 4517-4533 [doi]
- An Empirical Investigation of Beam-Aware Training in SupertaggingRenato Negrinho, Matthew R. Gormley, Geoffrey J. Gordon. 4534-4542 [doi]
- Improving Aspect-based Sentiment Analysis with Gated Graph Convolutional Networks and Syntax-based RegulationAmir Pouran Ben Veyseh, Nasim Nouri, Franck Dernoncourt, Quan Hung Tran, Dejing Dou, Thien Huu Nguyen. 4543-4548 [doi]
- Decoding language spatial relations to 2D spatial arrangementsGorjan Radevski, Guillem Collell, Marie-Francine Moens, Tinne Tuytelaars. 4549-4560 [doi]
- The Dots Have Their Values: Exploiting the Node-Edge Connections in Graph-based Neural Models for Document-level Relation ExtractionHieu Minh Tran, Trung Minh Nguyen, Thien Huu Nguyen. 4561-4567 [doi]
- Why and when should you pool? Analyzing Pooling in Recurrent ArchitecturesPratyush Maini, Keshav Kolluru, Danish Pruthi, Mausam. 4568-4586 [doi]
- Structural and Functional Decomposition for Personality Image Captioning in a Communication GameMinh Thu Nguyen, Duy Phung, Minh Hoai, Thien Huu Nguyen. 4587-4593 [doi]
- Long Document Ranking with Query-Directed Sparse TransformerJyun-Yu Jiang, Chenyan Xiong, Chia-Jung Lee, Wei Wang 0010. 4594-4605 [doi]
- Visuo-Lingustic Question Answering (VLQA) ChallengeShailaja Keyur Sampat, Yezhou Yang, Chitta Baral. 4606-4616 [doi]
- Byte Pair Encoding is Suboptimal for Language Model PretrainingKaj Bostrom, Greg Durrett. 4617-4624 [doi]
- Exploring BERT's sensitivity to lexical cues using tests from semantic primingKanishka Misra, Allyson Ettinger, Julia Rayz. 4625-4635 [doi]
- Multi-hop Question Generation with Graph Convolutional NetworkDan Su, Yan Xu 0012, Wenliang Dai, Ziwei Ji, Tiezheng Yu, Pascale Fung. 4636-4647 [doi]
- MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question AnsweringAisha Urooj Khan, Amir Mazaheri, Niels da Vitoria Lobo, Mubarak Shah. 4648-4660 [doi]
- Thinking Like a Skeptic: Defeasible Inference in Natural LanguageRachel Rudinger, Vered Shwartz, Jena D. Hwang, Chandra Bhagavatula, Maxwell Forbes, Ronan Le Bras, Noah A. Smith, Yejin Choi. 4661-4675 [doi]
- Guiding Attention for Self-Supervised Learning with TransformersAmeet Deshpande, Karthik Narasimhan. 4676-4686 [doi]
- Language-Conditioned Feature Pyramids for Visual Selection TasksTaichi Iki, Akiko Aizawa. 4687-4697 [doi]
- Learning to Classify Human Needs of Events from Category Definitions with Prototypical InstantiationHaibo Ding, Zhe Feng. 4698-4704 [doi]
- Automatic Term Name Generation for Gene Ontology: Task and DatasetYanjian Zhang, Qin Chen, Yiteng Zhang, Zhongyu Wei, Yixu Gao, Jiajie Peng, Zengfeng Huang, Weijian Sun, Xuanjing Huang. 4705-4710 [doi]
- Compressing Transformer-Based Semantic Parsing Models using Compositional Code EmbeddingsPrafull Prakash, Saurabh Kumar Shashidhar, Wenlong Zhao, Subendhu Rongali, Haidar Khan, Michael Kayser. 4711-4717 [doi]
- BERT-QE: Contextualized Query Expansion for Document Re-rankingZhi Zheng, Kai Hui, Ben He, Xianpei Han, Le Sun 0001, Andrew Yates. 4718-4728 [doi]
- ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram RepresentationsShizhe Diao, Jiaxin Bai, Yan Song, Tong Zhang, Yonggang Wang. 4729-4740 [doi]
- Finding Friends and Flipping Frenemies: Automatic Paraphrase Dataset Augmentation Using Graph TheoryHannah Chen, Yangfeng Ji, David Evans. 4741-4751 [doi]
- Probabilistic Case-based Reasoning in Knowledge BasesRajarshi Das, Ameya Godbole, Nicholas Monath, Manzil Zaheer, Andrew McCallum. 4752-4765 [doi]
- TLDR: Extreme Summarization of Scientific DocumentsIsabel Cachola, Kyle Lo, Arman Cohan, Daniel S. Weld. 4766-4777 [doi]
- Tri-Train: Automatic Pre-Fine Tuning between Pre-Training and Fine-Tuning for SciNERQingkai Zeng, Wenhao Yu, Mengxia Yu, Tianwen Jiang, Tim Weninger, Meng Jiang 0001. 4778-4787 [doi]
- Hierarchical Region Learning for Nested Named Entity RecognitionXinwei Long, Shuzi Niu, Yucheng Li. 4788-4793 [doi]
- Understanding User Resistance Strategies in Persuasive ConversationsYouzhi Tian, Weiyan Shi, Chen Li, Zhou Yu. 4794-4798 [doi]
- On the Sub-Layer Functionalities of Transformer DecoderYilin Yang, Longyue Wang, Shuming Shi 0001, Prasad Tadepalli, Stefan Lee, Zhaopeng Tu. 4799-4811 [doi]
- Extremely Low Bit Transformer Quantization for On-Device Neural Machine TranslationInsoo Chung, Byeongwook Kim, Yoonjung Choi, Se Jung Kwon, Yongkweon Jeon, Baeseong Park, Sangha Kim 0002, Dongsoo Lee. 4812-4826 [doi]
- Robust Backed-off Estimation of Out-of-Vocabulary EmbeddingsNobukazu Fukuda, Naoki Yoshinaga 0001, Masaru Kitsuregawa. 4827-4838 [doi]
- Exploiting Unsupervised Data for Emotion Recognition in ConversationsWenxiang Jiao, Michael R. Lyu, Irwin King. 4839-4846 [doi]
- Tensorized Embedding LayersOleksii Hrinchuk, Valentin Khrulkov, Leyla Mirvakhabova, Elena Orlova, Ivan V. Oseledets. 4847-4860 [doi]
- Speaker or Listener? The Role of a Dialogue AgentYa-Fei Liu, Hongjin Qian, Hengpeng Xu, Jinmao Wei. 4861-4869 [doi]
- Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic ParsingXi Victoria Lin, Richard Socher, Caiming Xiong. 4870-4888 [doi]
- Do Language Embeddings capture Scales?Xikun Zhang 0001, Deepak Ramachandran, Ian Tenney, Yanai Elazar, Dan Roth. 4889-4896 [doi]
- Paraphrasing vs Coreferring: Two Sides of the Same CoinYehudit Meged, Avi Caciularu, Vered Shwartz, Ido Dagan. 4897-4907 [doi]
- Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete SpaceDongyu Ru, JiangTao Feng, Lin Qiu, Hao Zhou, Mingxuan Wang, Weinan Zhang 0001, Yong Yu 0001, Lei Li 0005. 4908-4917 [doi]
- Coming to Terms: Automatic Formation of Neologisms in HebrewMoran Mizrahi 0001, Stav Yardeni Seelig, Dafna Shahaf. 4918-4929 [doi]
- Dual Inference for Improving Language Understanding and GenerationShang-Yu Su, Yung-Sung Chuang, Yun-Nung Chen. 4930-4936 [doi]
- Joint Intent Detection and Entity Linking on Spatial Domain QueriesLei Zhang, Runze Wang, Jingbo Zhou, Jingsong Yu, Zhenhua Ling, Hui Xiong 0001. 4937-4947 [doi]
- iNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian LanguagesDivyanshu Kakwani, Anoop Kunchukuttan, Satish Golla, Gokul N. C., Avik Bhattacharyya, Mitesh M. Khapra, Pratyush Kumar. 4948-4961 [doi]
- Weakly-Supervised Modeling of Contextualized Event Embedding for Discourse RelationsI-Ta Lee, Maria Leonor Pacheco, Dan Goldwasser. 4962-4972 [doi]
- Enhancing Generalization in Natural Language Inference by SyntaxQi He, Han Wang, Yue Zhang. 4973-4978 [doi]