Abstract is missing.
- Explainable Inference Over Grounding-Abstract Chains for Science QuestionsMokanarangan Thayaparan, Marco Valentino, André Freitas. 1-12 [doi]
- LV-BERT: Exploiting Layer Variety for BERTWeihao Yu, Zihang Jiang, Fei Chen, Qibin Hou, Jiashi Feng. 13-27 [doi]
- Few-Shot Event Detection with Prototypical Amortized Conditional Random FieldXin Cong, Shiyao Cui, Bowen Yu 0002, Tingwen Liu, Yubin Wang, Bin Wang 0004. 28-40 [doi]
- LUX (Linguistic aspects Under eXamination): Discourse Analysis for Automatic Fake News ClassificationLucas Azevedo, Mathieu d'Aquin, Brian Davis 0001, Manel Zarrouk. 41-56 [doi]
- Diagnosing Transformers in Task-Oriented Semantic ParsingShrey Desai, Ahmed Aly. 57-62 [doi]
- Semantic Relation-aware Difference Representation Learning for Change CaptioningYunbin Tu, Tingting Yao, Liang Li, Jiedong Lou, Shengxiang Gao, Zhengtao Yu 0001, Chenggang Yan. 63-73 [doi]
- The Authors Matter: Understanding and Mitigating Implicit Bias in Deep Text ClassificationHaochen Liu, Wei Jin, Hamid Karimi, Zitao Liu, Jiliang Tang. 74-85 [doi]
- From What to Why: Improving Relation Extraction with Rationale GraphZhenyu Zhang 0006, Bowen Yu 0002, Xiaobo Shu, Mengge Xue, Tingwen Liu, Li Guo 0001. 86-95 [doi]
- More Parameters? No Thanks!Zeeshan Khan, Kartheek Akella, Vinay Namboodiri, C. V. Jawahar. 96-102 [doi]
- SyGNS: A Systematic Generalization Testbed Based on Natural Language SemanticsHitomi Yanaka, Koji Mineshima, Kentaro Inui. 103-119 [doi]
- Fully Non-autoregressive Neural Machine Translation: Tricks of the TradeJiatao Gu, Xiang Kong. 120-133 [doi]
- Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate SpeechWanzheng Zhu, Suma Bhat. 134-149 [doi]
- REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-trainingFangkai Jiao, Yangyang Guo, Yilin Niu, Feng Ji, Feng-Lin Li, Liqiang Nie. 150-163 [doi]
- CasEE: A Joint Learning Framework with Cascade Decoding for Overlapping Event ExtractionJiawei Sheng, Shu Guo, Bowen Yu 0002, Qian Li, Yiming Hei, Lihong Wang, Tingwen Liu, Hongbo Xu. 164-174 [doi]
- Discovering Topics in Long-tailed Corpora with Causal InterventionXiaobao Wu, Chunping Li, Yishu Miao. 175-185 [doi]
- More than just Frequency? Demasking Unsupervised Hypernymy Prediction MethodsThomas Bott, Dominik Schlechtweg, Sabine Schulte im Walde. 186-192 [doi]
- WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article SectionsMingda Chen, Sam Wiseman, Kevin Gimpel. 193-209 [doi]
- CoDesc: A Large Code-Description Parallel DatasetMasum Hasan, Tanveer Muttaqueen, Abdullah Al Ishtiaq, Kazi Sajeed Mehrab, Md. Mahim Anjum Haque, Tahmid Hasan, Wasi Uddin Ahmad, Anindya Iqbal, Rifat Shahriyar. 210-218 [doi]
- Deep Cognitive Reasoning Network for Multi-hop Question Answering over Knowledge GraphsJianyu Cai, Zhanqiu Zhang, Feng Wu, Jie Wang 0005. 219-229 [doi]
- GoG: Relation-aware Graph-over-Graph Network for Visual DialogFeilong Chen, Xiuyi Chen, Fandong Meng, Peng Li 0030, Jie Zhou 0016. 230-243 [doi]
- Joint Optimization of Tokenization and Downstream ModelTatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki. 244-255 [doi]
- How does Attention Affect the Model?Cheng Zhang, Qiuchi Li, Lingyu Hua, Dawei Song 0001. 256-268 [doi]
- Contrastive Attention for Automatic Chest X-ray Report GenerationFenglin Liu, Changchang Yin, Xian Wu, Shen Ge, Ping Zhang, Xu Sun 0001. 269-280 [doi]
- O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video CaptioningFenglin Liu, Xuancheng Ren, Xian Wu, Bang Yang, Shen Ge, Xu Sun 0001. 281-292 [doi]
- Better Chinese Sentence Segmentation with Reinforcement LearningSrivatsan Srinivasan, Chris Dyer. 293-302 [doi]
- Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-TuningBenjamin Minixhofer, Milan Gritta, Ignacio Iacobacci. 303-313 [doi]
- Empirical Error Modeling Improves Robustness of Noisy Neural Sequence LabelingMarcin Namysl, Sven Behnke, Joachim Köhler. 314-329 [doi]
- Spatial Dependency Parsing for Semi-Structured Document Information ExtractionWonseok Hwang, Jinyeong Yim, Seunghyun Park, Sohee Yang, Minjoon Seo. 330-343 [doi]
- Reader-Guided Passage Reranking for Open-Domain Question AnsweringYuning Mao, Pengcheng He, Xiaodong Liu, Yelong Shen, Jianfeng Gao, Jiawei Han 0001, Weizhu Chen. 344-350 [doi]
- Entity-Aware Abstractive Multi-Document SummarizationHao Zhou, Weidong Ren, Gongshen Liu, Bo Su, Wei Lu. 351-362 [doi]
- LenAtten: An Effective Length Controlling Unit For Text SummarizationZhongyi Yu, Zhenghao Wu, Hao Zheng, Zhe XuanYuan, Jefferson Fong, Weifeng Su. 363-370 [doi]
- XeroAlign: Zero-shot cross-lingual transformer alignmentMilan Gritta, Ignacio Iacobacci. 371-381 [doi]
- Using Word Embeddings to Analyze Teacher Evaluations: An Application to a Filipino Education Non-Profit OrganizationFrancesca Vera. 382-389 [doi]
- Relation Classification with Entity Type RestrictionShengfei Lyu, Huanhuan Chen. 390-395 [doi]
- Link Prediction on N-ary Relational Facts: A Graph-based ApproachQuan Wang, Haifeng Wang 0001, Yajuan Lyu, Yong Zhu. 396-407 [doi]
- GLGE: A New General Language Generation Evaluation BenchmarkDayiheng Liu, Yu Yan, Yeyun Gong, Weizhen Qi, Hang Zhang, Jian Jiao 0007, Weizhu Chen, Jie Fu, Linjun Shou, Ming Gong, Pengcheng Wang, Jiusheng Chen, Daxin Jiang, Jiancheng Lv 0001, Ruofei Zhang, Winnie Wu, Ming Zhou 0001, Nan Duan. 408-420 [doi]
- AMBERT: A Pre-trained Language Model with Multi-Grained TokenizationXinsong Zhang, Pengshuai Li, Hang Li. 421-435 [doi]
- Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue GenerationFeilong Chen, Fandong Meng, Xiuyi Chen, Peng Li 0030, Jie Zhou 0016. 436-446 [doi]
- Retrieve & Memorize: Dialog Policy Learning with Multi-Action MemoryYunhao Li, Yunyi Yang, Xiaojun Quan, Jianxing Yu. 447-459 [doi]
- Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for DomainsYunzhi Yao, Shaohan Huang, Wenhui Wang, Li Dong 0004, Furu Wei. 460-470 [doi]
- Decoupling Adversarial Training for Fair NLPXudong Han, Timothy Baldwin, Trevor Cohn. 471-477 [doi]
- GO FIGURE: A Meta Evaluation of Factuality in SummarizationSaadia Gabriel, Asli Celikyilmaz, Rahul Jha, Yejin Choi, Jianfeng Gao. 478-487 [doi]
- DNN-driven Gradual Machine Learning for Aspect-term Sentiment AnalysisMurtadha H. M. Ahmed, Qun Chen 0001, Yanyan Wang, Youcef Nafa, Zhanhuai Li, Tianyi Duan. 488-497 [doi]
- Error Detection in Large-Scale Natural Language Understanding Systems Using Transformer ModelsRakesh Chada, Pradeep Natarajan, Darshan Fofadiya, Prathap Ramachandra. 498-503 [doi]
- OutFlip: Generating Examples for Unknown Intent Detection with Natural Language AttackDonghyun Choi, Myeongcheol Shin, EungGyun Kim, Dong Ryeol Shin. 504-512 [doi]
- GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical ReasoningJiaqi Chen, Jianheng Tang, Jinghui Qin, Xiaodan Liang, Lingbo Liu, Eric P. Xing, Liang Lin. 513-523 [doi]
- SIRE: Separate Intra- and Inter-sentential Reasoning for Document-level Relation ExtractionShuang Zeng, Yuting Wu, Baobao Chang. 524-534 [doi]
- KGPool: Dynamic Knowledge Graph Context Selection for Relation ExtractionAbhishek Nadgeri, Anson Bastos, Kuldeep Singh 0001, Isaiah Onando Mulang, Johannes Hoffart, Saeedeh Shekarpour, Vijay Saraswat. 535-548 [doi]
- Better Combine Them Together! Integrating Syntactic Constituency and Dependency Representations for Semantic Role LabelingHao Fei 0001, Shengqiong Wu, Yafeng Ren, Fei Li, Donghong Ji. 549-559 [doi]
- Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase GenerationYixuan Su, David Vandyke, Simon Baker, Yan Wang 0060, Nigel Collier. 560-569 [doi]
- Contrastive Fine-tuning Improves Robustness for Neural RankersXiaofei Ma, Cícero Nogueira dos Santos, Andrew O. Arnold. 570-582 [doi]
- Cross-Lingual Transfer in Zero-Shot Cross-Language Entity LinkingElliot Schumacher, James Mayfield, Mark Dredze. 583-595 [doi]
- TellMeWhy: A Dataset for Answering Why-Questions in NarrativesYash Kumar Lal, Nathanael Chambers, Raymond Mooney, Niranjan Balasubramanian. 596-610 [doi]
- Dialogue in the Wild: Learning from a Deployed Role-Playing Game with Humans and BotsKurt Shuster 0001, Jack Urbanek, Emily Dinan, Arthur Szlam, Jason Weston. 611-624 [doi]
- Deep Learning against COVID-19: Respiratory Insufficiency Detection in Brazilian Portuguese SpeechEdresson Casanova, Lucas Gris, Augusto Camargo, Daniel Da Silva, Murilo Gazzola, Ester C. Sabino, Anna Levin, Arnaldo Candido Jr., Sandra M. Aluísio, Marcelo Finger. 625-633 [doi]
- Benchmarking Robustness of Machine Reading Comprehension ModelsChenglei Si, Ziqing Yang, Yiming Cui, Wentao Ma, Ting Liu 0001, Shijin Wang 0001. 634-644 [doi]
- Improving BERT with Syntax-aware Local AttentionZhongli Li, Qingyu Zhou, Chao Li, Ke Xu, Yunbo Cao. 645-653 [doi]
- A Dialogue-based Information Extraction System for Medical Insurance AssessmentShuang Peng 0009, Mengdi Zhou, Minghui Yang, Haitao Mi, Shaosheng Cao, Zujie Wen, Teng Xu 0007, Hongbin Wang, Lei Liu. 654-663 [doi]
- Prediction or Comparison: Toward Interpretable Qualitative ReasoningMucheng Ren, Heyan Huang, Yang Gao 0016. 664-675 [doi]
- Boundary Detection with BERT for Span-level Emotion Cause AnalysisXiangju Li, Wei Gao 0001, Shi Feng, Yifei Zhang 0003, Daling Wang. 676-682 [doi]
- On Commonsense Cues in BERT for Solving Commonsense TasksLeyang Cui, Sijie Cheng, Yu Wu 0012, Yue Zhang 0004. 683-693 [doi]
- Weakly Supervised Pre-Training for Multi-Hop RetrieverYeon Seonwoo, Sang-Woo Lee, Ji-Hoon Kim, Jung-Woo Ha, Alice Oh. 694-704 [doi]
- Meet The Truth: Leverage Objective Facts and Subjective Views for Interpretable Rumor DetectionJiawen Li, Shiwen Ni, Hung-Yu Kao. 705-715 [doi]
- Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell CheckingHeng-Da Xu, Zhongli Li, Qingyu Zhou, Chao Li, Zizhen Wang, Yunbo Cao, Heyan Huang, Xian-Ling Mao. 716-728 [doi]
- TransSum: Translating Aspect and Sentiment Embeddings for Self-Supervised Opinion SummarizationKe Wang, Xiaojun Wan 0001. 729-742 [doi]
- Hashing based Efficient Inference for Image-Text MatchingRong-Cheng Tu, Lei Ji, Huaishao Luo, Botian Shi, Heyan Huang, Nan Duan, Xian-Ling Mao. 743-752 [doi]
- Can the Transformer Learn Nested Recursion with Symbol Masking?Jean-Philippe Bernardy, Adam Ek, Vladislav Maraev. 753-760 [doi]
- Rationalization through ConceptsDiego Antognini, Boi Faltings. 761-775 [doi]
- Parallel Attention Network with Sequence Matching for Video GroundingHao Zhang 0048, Aixin Sun, Wei Jing, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh. 776-790 [doi]
- MusicBERT: Symbolic Music Understanding with Large-Scale Pre-TrainingMingliang Zeng, Xu Tan 0003, Rui Wang, Zeqian Ju, Tao Qin, Tie-Yan Liu. 791-800 [doi]
- Evaluating the Efficacy of Summarization Evaluation across LanguagesFajri Koto, Jey Han Lau, Timothy Baldwin. 801-812 [doi]
- CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response GenerationChujie Zheng, Yong Liu, Wei Chen, Yongcai Leng, Minlie Huang. 813-824 [doi]
- UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase PredictionHuanqin Wu, Wei Liu, Lei Li, Dan Nie, Tao Chen, Feng Zhang, Di Wang. 825-835 [doi]
- As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other LanguagesWietse de Vries, Malvina Nissim. 836-846 [doi]
- Can Cognate Prediction Be Modelled as a Low-Resource Machine Translation Task?Clémentine Fourrier, Rachel Bawden, Benoît Sagot. 847-861 [doi]
- What if This Modified That? Syntactic Interventions with Counterfactual EmbeddingsMycal Tucker, Peng Qian, Roger Levy. 862-875 [doi]
- Investigating Text Simplification EvaluationLaura Vásquez-Rodríguez, Matthew Shardlow, Piotr Przybyla, Sophia Ananiadou. 876-882 [doi]
- COM2SENSE: A Commonsense Reasoning Benchmark with Complementary SentencesShikhar Singh, Nuan Wen, Yu Hou, Pegah Alipoormolabashi, Te-Lin Wu, Xuezhe Ma, Nanyun Peng. 883-898 [doi]
- Towards Knowledge-Grounded Counter Narrative Generation for Hate SpeechYi-Ling Chung, Serra Sinem Tekiroglu, Marco Guerini. 899-914 [doi]
- SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language IdentificationSara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Marcos Zampieri, Preslav Nakov. 915-928 [doi]
- RealFormer: Transformer Likes Residual AttentionRuining He, Anirudh Ravula, Bhargav Kanagal, Joshua Ainslie. 929-943 [doi]
- Promoting Graph Awareness in Linearized Graph-to-Text GenerationAlexander Miserlis Hoyle, Ana Marasovic, Noah A. Smith. 944-956 [doi]
- Predicting cross-linguistic adjective order with information gainWilliam Dyer, Richard Futrell, Zoey Liu, Gregory Scontras. 957-967 [doi]
- A Survey of Data Augmentation Approaches for NLPSteven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura, Eduard H. Hovy. 968-988 [doi]
- Why Machine Reading Comprehension Models Learn Shortcuts?Yuxuan Lai, Chen Zhang, Yansong Feng, Quzhe Huang, Dongyan Zhao 0001. 989-1002 [doi]
- Handling Cross- and Out-of-Domain Samples in Thai Word SegmentationPeerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich, Sarana Nutanong. 1003-1016 [doi]
- Sensei: Self-Supervised Sensor Name SegmentationJiaman Wu, Dezhi Hong, Rajesh E. Gupta, Jingbo Shang. 1017-1027 [doi]
- Frustratingly Simple Few-Shot Slot TaggingJianqiang Ma, Zeyu Yan, Chang Li, Yang Zhang. 1028-1033 [doi]
- Medical Code Assignment with Gated Convolution and Note-Code InteractionShaoxiong Ji, Shirui Pan, Pekka Marttinen. 1034-1043 [doi]
- Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question AnsweringWeiwen Xu, HuiHui Zhang, Deng Cai 0002, Wai Lam. 1044-1056 [doi]
- Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot ConsistencyZekang Li, Jinchao Zhang, Zhengcong Fei, Yang Feng 0004, Jie Zhou 0016. 1057-1067 [doi]
- Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationShun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang, Hung-yi Lee. 1068-1077 [doi]
- Code Summarization with Structure-induced TransformerHongqiu Wu, Hai Zhao, Min Zhang. 1078-1090 [doi]
- Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog SystemSihong Liu, Jinchao Zhang, Keqing He, Weiran Xu, Jie Zhou 0016. 1091-1102 [doi]
- Do Explanations Help Users Detect Errors in Open-Domain QA? An Evaluation of Spoken vs. Visual ExplanationsAna Valeria Gonzalez, Gagan Bansal, Angela Fan, Yashar Mehdad, Robin Jia, Srinivasan Iyer. 1103-1116 [doi]
- OntoEA: Ontology-guided Entity Alignment via Joint Knowledge Graph EmbeddingYuejia Xiang, Ziheng Zhang, Jiaoyan Chen, Xi Chen, Zhenxi Lin, Yefeng Zheng. 1117-1128 [doi]
- Learning Algebraic Recombination for Compositional GeneralizationChenyao Liu, Shengnan An, Zeqi Lin, Qian Liu, Bei Chen, Jian-Guang Lou, Lijie Wen, Nanning Zheng 0001, Dongmei Zhang. 1129-1144 [doi]
- Out of Order: How important is the sequential order of words in a sentence in Natural Language Understanding tasks?Thang M. Pham, Trung Bui, Long Mai, Anh Nguyen. 1145-1160 [doi]
- RevCore: Review-Augmented Conversational RecommendationYu Lu, Junwei Bao 0001, Yan Song, Zichen Ma, Shuguang Cui, Youzheng Wu, Xiaodong He 0002. 1161-1173 [doi]
- Awakening Latent Grounding from Pretrained Language Models for Semantic ParsingQian Liu, Dejian Yang, Jiahui Zhang, Jiaqi Guo, Bin Zhou, Jian-Guang Lou. 1174-1189 [doi]
- Enhancing Label Correlation Feedback in Multi-Label Text Classification via Multi-Task LearningXiming Zhang, Qian-Wen Zhang, Zhao Yan, Ruifang Liu, Yunbo Cao. 1190-1200 [doi]
- Fusing Context Into Knowledge Graph for Commonsense Question AnsweringYichong Xu, Chenguang Zhu, Ruochen Xu, Yang Liu, Michael Zeng, Xuedong Huang 0001. 1201-1207 [doi]
- Unsupervised Energy-based Adversarial Domain Adaptation for Cross-domain Text ClassificationHan Zou, Jianfei Yang, XiaoJian Wu. 1208-1218 [doi]
- Survival text regression for time-to-event prediction in conversationsChristine de Kock, Andreas Vlachos 0001. 1219-1229 [doi]
- Unsupervised Knowledge Selection for Dialogue GenerationXiuyi Chen, Feilong Chen, Fandong Meng, Peng Li 0030, Jie Zhou 0016. 1230-1244 [doi]
- Minimax and Neyman-Pearson Meta-Learning for Outlier LanguagesEdoardo Maria Ponti, Rahul Aralikatte, Disha Shrivastava, Siva Reddy, Anders Søgaard. 1245-1260 [doi]
- On-the-Fly Attention Modulation for Neural GenerationYue Dong, Chandra Bhagavatula, Ximing Lu, Jena D. Hwang, Antoine Bosselut, Jackie Chi Kit Cheung, Yejin Choi. 1261-1274 [doi]
- Grammar-Constrained Neural Semantic Parsing with LR ParsersArtur Baranowski, Nico Hochgeschwender. 1275-1279 [doi]
- Enhanced Metaphor Detection via Incorporation of External Knowledge Based on Linguistic TheoriesChang Su, Kechun Wu, Yijiang Chen. 1280-1287 [doi]
- Controlling Text Edition by Changing Answers of Specific QuestionsLei Sha, Patrick Hohenecker, Thomas Lukasiewicz. 1288-1299 [doi]
- Grammar-Based Patches Generation for Automated Program RepairYu Tang, Long Zhou, Ambrosio Blanco, Shujie Liu 0001, Furu Wei, Ming Zhou 0001, Muyun Yang. 1300-1305 [doi]
- Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation ExtractionTianyu Gao, Xu Han 0007, Yuzhuo Bai, Keyue Qiu, Zhiyu Xie, Yankai Lin, Zhiyuan Liu 0001, Peng Li 0030, Maosong Sun, Jie Zhou 0016. 1306-1318 [doi]
- GCRC: A New Challenging MRC Dataset from Gaokao Chinese for Explainable EvaluationHongye Tan, Xiaoyue Wang, Yu Ji, Ru Li, Xiaoli Li 0001, Zhiwei Hu, Yunxiao Zhao, Xiaoqi Han. 1319-1330 [doi]
- Zero-shot Label-Aware Event Trigger and Argument ClassificationHongming Zhang, Haoyu Wang, Dan Roth. 1331-1340 [doi]
- Incorporating Global Information in Local Attention for Knowledge Representation LearningYu Zhao 0019, Han Zhou, Ruobing Xie, Fuzhen Zhuang, Qing Li, Ji Liu. 1341-1351 [doi]
- Exploiting Position Bias for Robust Aspect Sentiment ClassificationFang Ma, Chen Zhang, Dawei Song 0001. 1352-1358 [doi]
- MRN: A Locally and Globally Mention-Based Reasoning Network for Document-Level Relation ExtractionJingye Li, Kang Xu, Fei Li, Hao Fei 0001, Yafeng Ren, Donghong Ji. 1359-1370 [doi]
- Adversary-Aware Rumor DetectionYun-Zhu Song, Yi-Syuan Chen, Yi-Ting Chang, Shao-Yu Weng, Hong-Han Shuai. 1371-1382 [doi]
- LICHEE: Improving Language Model Pre-training with Multi-grained TokenizationWeidong Guo, Mingjun Zhao, Lusheng Zhang, Di Niu, Jinwen Luo, Zhenhua Liu, Zhenyang Li, Jianbo Tang. 1383-1392 [doi]
- Detecting Hallucinated Content in Conditional Neural Sequence GenerationChunting Zhou, Graham Neubig, Jiatao Gu, Mona Diab, Francisco Guzmán, Luke Zettlemoyer, Marjan Ghazvininejad. 1393-1404 [doi]
- K-Adapter: Infusing Knowledge into Pre-Trained Models with AdaptersRuize Wang, Duyu Tang, Nan Duan, Zhongyu Wei, Xuanjing Huang, Jianshu Ji, Guihong Cao, Daxin Jiang, Ming Zhou 0001. 1405-1418 [doi]
- Global Attention Decoder for Chinese Spelling Error CorrectionZhao Guo, Yuan Ni, Keqiang Wang, Wei Zhu, Guotong Xie. 1419-1428 [doi]
- Jointly Identifying Rhetoric and Implicit Emotions via Multi-Task LearningXin Chen, Zhen Hai, Deyu Li, Suge Wang, Dian Wang. 1429-1434 [doi]
- Exploring the Role of Context in Utterance-level Emotion, Act and Intent Classification in Conversations: An Empirical StudyDeepanway Ghosal, Navonil Majumder, Rada Mihalcea, Soujanya Poria. 1435-1449 [doi]
- Encouraging Neural Machine Translation to Satisfy Terminology ConstraintsMelissa Ailem, Jingshu Liu, Raheel Qader. 1450-1455 [doi]
- BertGCN: Transductive Text Classification by Combining GNN and BERTYuxiao Lin, Yuxian Meng, Xiaofei Sun, Qinghong Han, Kun Kuang, Jiwei Li, Fei Wu. 1456-1462 [doi]
- Putting words into the system's mouth: A targeted attack on neural machine translation using monolingual data poisoningJun Wang, Chang Xu, Francisco Guzmán, Ahmed El-Kishky, Yuqing Tang, Benjamin I. P. Rubinstein, Trevor Cohn. 1463-1473 [doi]
- Semantic and Syntactic Enhanced Aspect Sentiment Triplet ExtractionZhexue Chen, Hong Huang 0001, Bang Liu, Xuanhua Shi, Hai Jin 0001. 1474-1483 [doi]
- UserAdapter: Few-Shot User Learning in Sentiment AnalysisWanjun Zhong, Duyu Tang, Jiahai Wang, Jian Yin 0001, Nan Duan. 1484-1488 [doi]
- PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health SupportHao Sun, Zhenru Lin, Chujie Zheng, Siyang Liu, Minlie Huang. 1489-1503 [doi]
- RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense KnowledgeBill Yuchen Lin, Ziyi Wu, Yichi Yang, Dong-Ho Lee, Xiang Ren 0001. 1504-1515 [doi]
- Learning to Generate Questions by Learning to Recover Answer-containing SentencesSeohyun Back, Akhil Kedia, Sai Chetan Chinthakindi, Haejun Lee, Jaegul Choo. 1516-1529 [doi]
- Learning Slice-Aware Representations with Mixture of AttentionsCheng Wang, Sungjin Lee, Sunghyun Park, Han Li, Young-Bum Kim, Ruhi Sarikaya. 1530-1536 [doi]
- Making Better Use of Bilingual Information for Cross-Lingual AMR ParsingYitao Cai, Zhe Lin, Xiaojun Wan 0001. 1537-1547 [doi]
- Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation ApproachZhe Lin, Xiaojun Wan 0001. 1548-1557 [doi]
- Few-shot Knowledge Graph-to-Text Generation with Pretrained Language ModelsJunyi Li, Tianyi Tang, Wayne Xin Zhao, Zhicheng Wei, Nicholas Jing Yuan, Ji-Rong Wen. 1558-1568 [doi]
- Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust FinetuningChenglei Si, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu 0001, Yasheng Wang, Qun Liu, Maosong Sun. 1569-1576 [doi]
- NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style TransferFei Huang, Zikai Chen, Chen Henry Wu, Qihan Guo, Xiaoyan Zhu 0001, Minlie Huang. 1577-1590 [doi]
- HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge ManagementSilin Gao, Ryuichi Takanobu, Wei Peng, Qun Liu, Minlie Huang. 1591-1602 [doi]
- Target-oriented Fine-tuning for Zero-Resource Named Entity RecognitionYing Zhang, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou 0016. 1603-1615 [doi]
- BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial AttacksYannik Keller, Jan Mackensen, Steffen Eger. 1616-1629 [doi]
- Event Detection as Graph ParsingJianye Xie, Haotong Sun, Junsheng Zhou, Weiguang Qu, Xinyu Dai. 1630-1640 [doi]
- Toward Fully Exploiting Heterogeneous Corpus: A Decoupled Named Entity Recognition Model with Two-stage TrainingYun Hu, Yeshuang Zhu, Jinchao Zhang, Changwen Zheng, Jie Zhou 0016. 1641-1652 [doi]
- Discriminative Reasoning for Document-level Relation ExtractionWang Xu, Kehai Chen, Tiejun Zhao. 1653-1663 [doi]
- Meta-Learning Adversarial Domain Adaptation Network for Few-Shot Text ClassificationChengcheng Han, Zeqiu Fan, Dongxiang Zhang, Minghui Qiu, Ming Gao 0001, Aoying Zhou. 1664-1673 [doi]
- Documents Representation via Generalized Coupled Tensor Chain with the Rotation Group constraintIgor Vorona, Anh Huy Phan, Alexander Panchenko, Andrzej Cichocki. 1674-1684 [doi]
- Improving Unsupervised Extractive Summarization with Facet-Aware ModelingXinnian Liang, Shuangzhi Wu, Mu Li 0001, Zhoujun Li. 1685-1697 [doi]
- Improving Gradient-based Adversarial Training for Text Classification by Contrastive Learning and Auto-EncoderYao Qiu, Jinchao Zhang, Jie Zhou 0016. 1698-1707 [doi]
- Multi-Granularity Contrasting for Cross-Lingual Pre-TrainingShicheng Li, Pengcheng Yang, Fuli Luo, Jun Xie. 1708-1717 [doi]
- A Comparison between Pre-training and Large-scale Back-translation for Neural Machine TranslationDandan Huang, Kun Wang, Yue Zhang 0004. 1718-1732 [doi]
- Bi-Granularity Contrastive Learning for Post-Training in Few-Shot SceneRuikun Luo, Guanhuan Huang, Xiaojun Quan. 1733-1742 [doi]
- Fusing Label Embedding into BERT: An Efficient Improvement for Text ClassificationYijin Xiong, Yukun Feng, Hao Wu, Hidetaka Kamigaito, Manabu Okumura. 1743-1750 [doi]
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and CompletionJie Zhou 0016, Shengding Hu, Xin Lv, Cheng Yang 0002, Zhiyuan Liu 0001, Wei Xu, Jie Jiang, Juanzi Li, Maosong Sun. 1751-1763 [doi]
- A Query-Driven Topic ModelZheng Fang, Yulan He, Rob Procter. 1764-1777 [doi]
- How Reliable are Model Diagnostics?Vamsi Aribandi, Yi Tay, Donald Metzler. 1778-1785 [doi]
- Gaussian Process based Deep Dyna-Q approach for Dialogue Policy LearningGuanlin Wu, Wenqi Fang, Ji Wang, Jiang Cao, Weidong Bao, Yang Ping, Xiaomin Zhu, Zheng Wang. 1786-1795 [doi]
- CiteWorth: Cite-Worthiness Detection for Improved Scientific Document UnderstandingDustin Wright 0001, Isabelle Augenstein. 1796-1807 [doi]
- Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web TextsBarbara Plank. 1808-1815 [doi]
- Counter-Argument Generation by Attacking Weak PremisesMilad Alshomary, Shahbaz Syed, Arkajit Dhar, Martin Potthast, Henning Wachsmuth. 1816-1827 [doi]
- Alternated Training with Synthetic and Authentic Data for Neural Machine TranslationRui Jiao, Zonghan Yang, Maosong Sun, Yang Liu 0005. 1828-1834 [doi]
- Template-Based Named Entity Recognition Using BARTLeyang Cui, Yu Wu, Jian Liu, Sen Yang, Yue Zhang 0004. 1835-1845 [doi]
- "Does it Matter When I Think You Are Lying?" Improving Deception Detection by Integrating Interlocutor's Judgements in ConversationsHuang-Cheng Chou, Woan-Shiuan Chien, Da-Cheng Juan, Chi-Chun Lee. 1846-1860 [doi]
- High-Quality Dialogue Diversification by Intermittent Short Extension EnsemblesZhiwen Tang, Hrishikesh Kulkarni, Grace Hui Yang. 1861-1872 [doi]
- Structured Refinement for Sequential LabelingYiran Wang, Hiroyuki Shindo, Yuji Matsumoto 0001, Taro Watanabe. 1873-1884 [doi]
- End-to-End Construction of NLP Knowledge GraphIshani Mondal, Yufang Hou 0001, Charles Jochim. 1885-1895 [doi]
- Deciphering Implicit Hate: Evaluating Automated Detection Algorithms for Multimodal HateAustin Botelho, Scott A. Hale, Bertie Vidgen. 1896-1907 [doi]
- Studying the Evolution of Scientific Topics and their RelationshipsAna Sabina Uban, Cornelia Caragea, Liviu P. Dinu. 1908-1922 [doi]
- End-to-End Self-Debiasing Framework for Robust NLU TrainingAbbas Ghaddar, Phillippe Langlais, Mehdi Rezagholizadeh, Ahmad Rashid. 1923-1929 [doi]
- A Mixed-Method Design Approach for Empirically Based Selection of Unbiased Data AnnotatorsGautam Thakur, Janna Caspersen, Drahomira Herrmannova, Bryan Eaton, Jordan Burdette. 1930-1938 [doi]
- An Evaluation of Disentangled Representation Learning for TextsKrishnapriya Vishnubhotla, Graeme Hirst, Frank Rudzicz. 1939-1951 [doi]
- Injecting Knowledge Base Information into End-to-End Joint Entity and Relation Extraction and Coreference ResolutionSeverine Verlinden, Klim Zaporojets, Johannes Deleu, Thomas Demeester, Chris Develder. 1952-1957 [doi]
- Knowing More About Questions Can Help: Improving Calibration in Question AnsweringShujian Zhang, ChengYue Gong, Eunsol Choi. 1958-1970 [doi]
- Enhancing Metaphor Detection by Gloss-based InterpretationsHai Wan, Jinxia Lin, Jianfeng Du, Dawei Shen, Manrong Zhang. 1971-1981 [doi]
- Evaluating Word Embeddings with Categorical ModularitySílvia Casacuberta, Karina Halevy, Damián E. Blasi. 1982-1993 [doi]
- Attention-based Contextual Language Model Adaptation for Speech RecognitionRichard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe. 1994-2003 [doi]
- Annotation and Evaluation of Coreference Resolution in ScreenplaysSabyasachee Baruah, Sandeep Nallan Chakravarthula, Shrikanth Narayanan. 2004-2010 [doi]
- Exploring Cross-Lingual Transfer Learning with Unsupervised Machine TranslationChao Wang, Judith Gaspers, Quynh Ngoc Thi Do, Hui Jiang. 2011-2020 [doi]
- Pipeline Signed Japanese Translation Focusing on a Post-positional Particle Complement and Conjugation in a Low-resource SettingKen Yano, Akira Utsumi. 2021-2032 [doi]
- Language-Mediated, Object-Centric Representation LearningRuocheng Wang, Jiayuan Mao, Samuel Gershman, Jiajun Wu 0001. 2033-2046 [doi]
- Entheos: A Multimodal Dataset for Studying EnthusiasmCarla Viegas, Malihe Alikhani. 2047-2060 [doi]
- Are Rotten Apples Edible? Challenging Commonsense Inference Ability with ExceptionsNam Do, Ellie Pavlick. 2061-2073 [doi]
- GRICE: A Grammar-based Dataset for Recovering Implicature and Conversational rEasoningZilong Zheng, Shuwen Qiu, Lifeng Fan, Yixin Zhu, Song Chun Zhu. 2074-2085 [doi]
- RetroGAN: A Cyclic Post-Specialization System for Improving Out-of-Knowledge and Rare Word RepresentationsPedro Colon-Hernandez, Yida Xin, Henry Lieberman, Catherine Havasi, Cynthia Breazeal, Peter Chin. 2086-2095 [doi]
- Fusion: Towards Automated ICD Coding via Feature CompressionJunyu Luo, Cao Xiao, Lucas Glass, Jimeng Sun, Fenglong Ma. 2096-2101 [doi]
- Automatic Document Sketching: Generating Drafts from Analogous TextsZeqiu Wu, Michel Galley, Chris Brockett, Yizhe Zhang, Bill Dolan. 2102-2113 [doi]
- Trade the Event: Corporate Events Detection for News-Based Event-Driven TradingZhihan Zhou, Liqian Ma, Han Liu. 2114-2124 [doi]
- Language-based General Action Template for Reinforcement Learning AgentsRyosuke Kohita, Akifumi Wachi, Daiki Kimura, Subhajit Chaudhury, Michiaki Tatsubori, Asim Munawar. 2125-2139 [doi]
- MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained TransformersWenhui Wang, Hangbo Bao, Shaohan Huang, Li Dong 0004, Furu Wei. 2140-2151 [doi]
- Attending via both Fine-tuning and CompressingJie Zhou 0015, Yuanbin Wu, Qin Chen, Xuanjing Huang, Liang He 0001. 2152-2161 [doi]
- Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal StatementXinyu Zuo, Pengfei Cao, Yubo Chen 0001, Kang Liu 0001, Jun Zhao 0001, Weihua Peng, Yuguang Chen. 2162-2172 [doi]
- PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage RetrievalRuiyang Ren, Shangwen Lv, Yingqi Qu, Jing Liu 0022, Wayne Xin Zhao, Qiaoqiao She, Hua Wu 0003, Haifeng Wang 0001, Ji-Rong Wen. 2173-2183 [doi]
- Is Human Scoring the Best Criteria for Summary Evaluation?Oleg V. Vasilyev, John Bohannon. 2184-2191 [doi]
- Assessing Dialogue Systems with Distribution DistancesJiannan Xiang, Yahui Liu, Deng Cai 0002, Huayang Li, Defu Lian, Lemao Liu. 2192-2198 [doi]
- Neural Combinatory Constituency ParsingZhousi Chen, Longtu Zhang, Aizhan Imankulova, Mamoru Komachi. 2199-2213 [doi]
- Learning Shared Semantic Space for Speech-to-Text TranslationChi Han, Mingxuan Wang, Heng Ji, Lei Li 0005. 2214-2225 [doi]
- Empowering Language Understanding with Counterfactual ReasoningFuli Feng, Jizhi Zhang, Xiangnan He 0001, Hanwang Zhang, Tat-Seng Chua. 2226-2236 [doi]
- Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and ResourcesTaolin Zhang, Chengyu Wang 0001, Minghui Qiu, Bite Yang, Zerui Cai, Xiaofeng He, Jun Huang 0007. 2237-2249 [doi]
- Correcting Chinese Spelling Errors with Phonetic Pre-trainingRuiqing Zhang, Chao Pang, Chuanqiang Zhang, Shuohuan Wang, Zhongjun He, Yu Sun, Hua Wu 0003, Haifeng Wang 0001. 2250-2261 [doi]
- Multi-Lingual Question Generation with Language Agnostic Language ModelBingning Wang, Ting Yao, Weipeng Chen, Jingfang Xu, Xiaochuan Wang. 2262-2272 [doi]
- Structure-Aware Pre-Training for Table-to-Text GenerationXinyu Xing, Xiaojun Wan 0001. 2273-2278 [doi]
- On the Interplay Between Fine-tuning and Composition in TransformersLang Yu, Allyson Ettinger. 2279-2293 [doi]
- Lifelong Learning of Topics and Domain-Specific Word EmbeddingsXiaorui Qin, Yuyin Lu, Yufu Chen, Yanghui Rao. 2294-2309 [doi]
- Leveraging Argumentation Knowledge Graph for Interactive Argument Pair IdentificationJian Yuan, Zhongyu Wei, Donghua Zhao, Qi Zhang, Changjian Jiang. 2310-2319 [doi]
- A Multi-Task Learning Framework for Multi-Target Stance DetectionYingjie Li, Cornelia Caragea. 2320-2326 [doi]
- Confidence-Aware Scheduled Sampling for Neural Machine TranslationYijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou 0016. 2327-2337 [doi]
- MA-BERT: Learning Representation by Incorporating Multi-Attribute Knowledge in TransformersYou Zhang, Jin Wang, Liang-Chih Yu, Xuejie Zhang 0002. 2338-2343 [doi]
- A Closer Look into the Robustness of Neural Dependency Parsers Using Better Adversarial ExamplesYuxuan Wang 0001, Wanxiang Che, Ivan Titov, Shay B. Cohen, Zhi-Lei Zhao, Ting Liu 0001. 2344-2354 [doi]
- P-Stance: A Large Dataset for Stance Detection in Political DomainYingjie Li, Tiberiu Sosea, Aditya Sawant, Ajith Jayaraman Nair, Diana Inkpen, Cornelia Caragea. 2355-2365 [doi]
- WIND: Weighting Instances Differentially for Model-Agnostic Domain AdaptationXiang Chen, Yue Cao 0006, Xiaojun Wan 0001. 2366-2376 [doi]
- DocOIE: A Document-level Context-Aware Dataset for OpenIEKuicai Dong, Yilin Zhao, Aixin Sun, Jung-Jae Kim 0001, Xiaoli Li 0001. 2377-2389 [doi]
- Event Extraction from Historical Texts: A New Dataset for Black RebellionsViet Lai, Minh Van Nguyen, Heidi Kaufman, Thien Huu Nguyen. 2390-2400 [doi]
- Zero-shot Medical Entity Retrieval without Annotation: Learning From Rich Knowledge Graph SemanticsLuyang Kong, Christopher Winestock, Parminder Bhatia. 2401-2405 [doi]
- CONDA: a CONtextual Dual-Annotated dataset for in-game toxicity understanding and detectionHenry Weld, Guanghao Huang, Jean Lee, Tongshu Zhang, Kunze Wang, Xinghong Guo, Siqu Long, Josiah Poon, Soyeon Caren Han. 2406-2416 [doi]
- Adaptive Knowledge-Enhanced Bayesian Meta-Learning for Few-shot Event DetectionShirong Shen, Tongtong Wu, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari, Sheng Bi. 2417-2429 [doi]
- Stylized Story Generation with Style-Guided PlanningXiangzhe Kong, Jialiang Huang, Ziquan Tung, Jian Guan, Minlie Huang. 2430-2436 [doi]
- Dynamic Connected Networks for Chinese Spelling CheckBaoxin Wang, Wanxiang Che, Dayong Wu, Shijin Wang 0001, Guoping Hu, Ting Liu 0001. 2437-2446 [doi]
- A Multi-Level Attention Model for Evidence-Based Fact CheckingCanasai Kruengkrai, Junichi Yamagishi, Xin Wang. 2447-2460 [doi]
- RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking TransformerXingshan Zeng, Liangyou Li, Qun Liu. 2461-2474 [doi]
- Training ELECTRA Augmented with Multi-word SelectionJiaming Shen, Jialu Liu, Tianqi Liu, Cong Yu 0001, Jiawei Han 0001. 2475-2486 [doi]
- REAM$\sharp$: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog GenerationJun Gao, Wei Bi, Ruifeng Xu, Shuming Shi 0001. 2487-2500 [doi]
- Relation Extraction with Type-aware Map Memories of Word DependenciesGuimin Chen, Yuanhe Tian, Yan Song, Xiang Wan. 2501-2512 [doi]
- PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum LearningSiqi Bao, Huang He, Fan Wang, Hua Wu 0003, Haifeng Wang 0001, WenQuan Wu, Zhen Guo, Zhibin Liu, Xinchao Xu. 2513-2525 [doi]
- JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge GraphsPei Ke, Haozhe Ji, Yu Ran, Xin Cui, Liwei Wang 0009, Linfeng Song, Xiaoyan Zhu 0001, Minlie Huang. 2526-2538 [doi]
- AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text TranslationWuwei Huang, Dexin Wang, Deyi Xiong. 2539-2545 [doi]
- OKGIT: Open Knowledge Graph Link Prediction with Implicit TypesChandrahas, Partha P. Talukdar. 2546-2559 [doi]
- Multimodal Fusion with Co-Attention Networks for Fake News DetectionYang Wu, Pengwei Zhan, Yunjian Zhang, Liming Wang, Zhen Xu. 2560-2569 [doi]
- Joint Multi-Decoder Framework with Hierarchical Pointer Network for Frame Semantic ParsingXudong Chen, Ce Zheng, Baobao Chang. 2570-2578 [doi]
- H-FND: Hierarchical False-Negative Denoising for Distant Supervision Relation ExtractionJhih-wei Chen, Tsu-Jui Fu, Chen-Kang Lee, Wei-Yun Ma. 2579-2593 [doi]
- GEM: A General Evaluation Benchmark for Multimodal TasksLin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti. 2594-2603 [doi]
- Graph Relational Topic Model with Higher-order Graph Attention Auto-encodersQianqian Xie, Jimin Huang, Pan Du, Min Peng. 2604-2613 [doi]
- Paths to Relation Extraction through Semantic StructureJonathan Yellin, Omri Abend. 2614-2626 [doi]
- Dynamic and Multi-Channel Graph Convolutional Networks for Aspect-Based Sentiment AnalysisShiguan Pang, Yun Xue, Zehao Yan, Weihao Huang, Jinhui Feng. 2627-2636 [doi]
- Automatic Text Simplification for Social Good: Progress and ChallengesSanja Stajner. 2637-2652 [doi]
- A Neural Edge-Editing Approach for Document-Level Relation Graph ExtractionKohei Makino, Makoto Miwa, Yutaka Sasaki. 2653-2662 [doi]
- Dialogue-oriented Pre-trainingYi Xu, Hai Zhao. 2663-2673 [doi]
- GrantRel: Grant Information Extraction via Joint Entity and Relation ExtractionJunyi Bian, Li Huang, Xiaodi Huang, Hong Zhou, Shanfeng Zhu. 2674-2685 [doi]
- Enhancing Language Generation with Effective Checkpoints of Pre-trained Language ModelJeonghyeok Park, Hai Zhao. 2686-2694 [doi]
- Making Flexible Use of Subtasks: A Multiplex Interaction Network for Unified Aspect-based Sentiment AnalysisGuoxin Yu, Xiang Ao, Ling Luo, Min Yang, Xiaofei Sun, Jiwei Li, Qing He 0003. 2695-2705 [doi]
- Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine TranslationZihan Liu, Genta Indra Winata, Pascale Fung. 2706-2718 [doi]
- Transformer-Exclusive Cross-Modal Representation for Vision and LanguageAndrew Shin, Takuya Narihira. 2719-2725 [doi]
- Two Parents, One Child: Dual Transfer for Low-Resource Neural Machine TranslationMeng Zhang, Liangyou Li, Qun Liu. 2726-2738 [doi]
- Contrastive Aligned Joint Learning for Multilingual SummarizationDanqing Wang, Jiaze Chen, Hao Zhou 0012, Xipeng Qiu, Lei Li 0005. 2739-2750 [doi]
- When Time Makes Sense: A Historically-Aware Approach to Targeted Sense DisambiguationKaspar Beelen, Federico Nanni, Mariona Coll Ardanuy, Kasra Hosseini, Giorgia Tolfo, Barbara McGillivray. 2751-2761 [doi]
- Understanding Feature Focus in Multitask Settings for Lexico-semantic Relation IdentificationHoussam Akhmouch, Gaël Dias, Jose G. Moreno. 2762-2772 [doi]
- Don't Miss the Labels: Label-semantic Augmented Meta-Learner for Few-Shot Text ClassificationQiaoyang Luo, Lingqiao Liu, Yuhao Lin, Wei Zhang. 2773-2782 [doi]
- Detecting Harmful Memes and Their TargetsShraman Pramanick, Dimitar Dimitrov, Rituparna Mukherjee, Shivam Sharma, Md. Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty 0002. 2783-2796 [doi]
- Progressive Multi-Granularity Training for Non-Autoregressive TranslationLiang Ding, Longyue Wang, Xuebo Liu 0002, Derek F. Wong, Dacheng Tao, Zhaopeng Tu. 2797-2803 [doi]
- ZmBART: An Unsupervised Cross-lingual Transfer Framework for Language GenerationKaushal Kumar Maurya, Maunendra Sankar Desarkar, Yoshinobu Kano, Kumari Deepshikha. 2804-2818 [doi]
- HacRED: A Large-Scale Relation Extraction Dataset Toward Hard Cases in Practical ApplicationsQiao Cheng 0005, Juntao Liu, Xiaoye Qu, Jin Zhao, Jiaqing Liang, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan, Yanghua Xiao. 2819-2831 [doi]
- Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?Zae Myung Kim, Laurent Besacier, Vassilina Nikoulina, Didier Schwab. 2832-2841 [doi]
- Learning Sequential and Structural Information for Source Code SummarizationYunseok Choi, JinYeong Bak, CheolWon Na, Jee-Hyong Lee. 2842-2851 [doi]
- Energy-based Unknown Intent Detection with Data ManipulationYawen Ouyang, Jiasheng Ye, Yu Chen, Xinyu Dai, Shujian Huang, Jiajun Chen. 2852-2861 [doi]
- Automatic Rephrasing of Transcripts-based Action ItemsAmir Cohen, Amir Kantor, Sagi Hilleli, Eyal Kolman. 2862-2873 [doi]
- MergeDistill: Merging Language Models using Pre-trained DistillationSimran Khanuja, Melvin Johnson, Partha Talukdar. 2874-2887 [doi]
- On Sparsifying Encoder Outputs in Sequence-to-Sequence ModelsBiao Zhang, Ivan Titov, Rico Sennrich. 2888-2900 [doi]
- FrameNet-assisted Noun Compound InterpretationGirishkumar Ponkiya, Diptesh Kanojia, Pushpak Bhattacharyya, Girish Keshav Palshikar. 2901-2911 [doi]
- Hypernym Discovery via a Recurrent Mapping ModelYuhang Bai, Richong Zhang, Fanshuang Kong, Junfan Chen, Yongyi Mao. 2912-2921 [doi]
- Modeling the Influence of Verb Aspect on the Activation of Typical Event Locations with BERTWon-Ik Cho, Emmanuele Chersoni, Yu-Yin Hsu, Chu-Ren Huang. 2922-2929 [doi]
- On the Interaction of Belief Bias and ExplanationsAna Valeria Gonzalez, Anna Rogers, Anders Søgaard. 2930-2942 [doi]
- Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon InductionJinpeng Zhang, Baijun Ji, Nini Xiao, Xiangyu Duan, Min Zhang 0005, Yangbin Shi, Weihua Luo. 2943-2955 [doi]
- Exploring Unsupervised Pretraining Objectives for Machine TranslationChristos Baziotis, Ivan Titov, Alexandra Birch, Barry Haddow. 2956-2971 [doi]
- Knowledge-Grounded Dialogue Generation with Term-level De-noisingWen Zheng, Natasa Milic-Frayling, Ke Zhou. 2972-2983 [doi]
- Inspecting the concept knowledge graph encoded by modern language modelsCarlos Aspillaga, Marcelo Mendoza, Alvaro Soto. 2984-3000 [doi]
- Language Tags Matter for Zero-Shot Neural Machine TranslationLiwei Wu, Shanbo Cheng, Mingxuan Wang, Lei Li 0005. 3001-3007 [doi]
- Latent Reasoning for Low-Resource Question GenerationXinting Huang, Jianzhong Qi 0001, Yu Sun 0021, Rui Zhang 0003. 3008-3022 [doi]
- Probing Pre-Trained Language Models for Disease KnowledgeIsraa Alghanmi, Luis Espinosa Anke, Steven Schockaert. 3023-3033 [doi]
- AugVic: Exploiting BiText Vicinity for Low-Resource NMTTasnim Mohiuddin, M. Saiful Bari, Shafiq R. Joty. 3034-3045 [doi]
- Provably Secure Generative Linguistic SteganographySi-yu Zhang, Zhongliang Yang, Jinshuai Yang, Yongfeng Huang 0001. 3046-3055 [doi]
- Retrieval Enhanced Model for Commonsense GenerationHan Wang, Yang Liu, Chenguang Zhu, Linjun Shou, Ming Gong, Yichong Xu, Michael Zeng. 3056-3062 [doi]
- Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQLZhi Chen 0006, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu 0004. 3063-3074 [doi]
- Adjacency List Oriented Relational Fact Extraction via Adaptive Multi-task LearningFubang Zhao, Zhuoren Jiang, Yangyang Kang, Changlong Sun, Xiaozhong Liu. 3075-3087 [doi]
- Self-Supervised Document Similarity Ranking via Contextualized Language Models and Hierarchical InferenceDvir Ginzburg, Itzik Malkiel, Oren Barkan, Avi Caciularu, Noam Koenigstein. 3088-3098 [doi]
- How Good Is NLP? A Sober Look at NLP Tasks through the Lens of Social ImpactZhijing Jin, Geeticka Chauhan, Brian Tse, Mrinmaya Sachan, Rada Mihalcea. 3099-3113 [doi]
- IgSEG: Image-guided Story Ending GenerationQingbao Huang, Chuan Huang, Linzhang Mo, Jielong Wei, Yi Cai, Ho-Fung Leung, Qing Li 0001. 3114-3123 [doi]
- Improve Query Focused Abstractive Summarization by Incorporating Answer RelevanceDan Su, Tiezheng Yu, Pascale Fung. 3124-3131 [doi]
- Learning a Reversible Embedding Mapping using Bi-Directional Manifold AlignmentAshwinkumar Ganesan, Francis Ferraro, Tim Oates. 3132-3139 [doi]
- Probabilistic Graph Reasoning for Natural Proof GenerationChangzhi Sun, Xinbo Zhang, Jiangjie Chen, Chun Gan, Yuanbin Wu, Jiaze Chen, Hao Zhou 0012, Lei Li 0005. 3140-3151 [doi]
- Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge GraphRui Liu, Zheng Lin, Yutong Tan, Weiping Wang. 3152-3157 [doi]
- Dialogue Graph Modeling for Conversational Machine ReadingSiru Ouyang, Zhuosheng Zhang 0001, Hai Zhao. 3158-3169 [doi]
- IndoCollex: A Testbed for Morphological Transformation of Indonesian Word ColloquialismHaryo Akbarianto Wibowo, Made Nindyatama Nityasya, Afra Feyza Akyürek, Suci Fitriany, Alham Fikri Aji, Radityo Eko Prasojo, Derry Tanti Wijaya. 3170-3183 [doi]
- Manifold Adversarial Augmentation for Neural Machine TranslationGuandan Chen, Kai Fan, Kaibo Zhang, Boxing Chen, Zhongqiang Huang. 3184-3189 [doi]
- Learning to Bridge Metric Spaces: Few-shot Joint Learning of Intent Detection and Slot FillingYutai Hou, Yongkui Lai, Cheng Chen, Wanxiang Che, Ting Liu 0001. 3190-3200 [doi]
- Insertion-based Tree DecodingDenis Lukovnikov, Asja Fischer. 3201-3213 [doi]
- Is the Lottery Fair? Evaluating Winning Tickets Across DemographicsVictor Petrén Bach Hansen, Anders Søgaard. 3214-3224 [doi]
- SSMix: Saliency-Based Span Mixup for Text ClassificationSoyoung Yoon, Gyuwan Kim, Kyumin Park. 3225-3234 [doi]
- Detecting Bot-Generated Text by Characterizing Linguistic Accommodation in Human-Bot InteractionsParas Bhatt, Anthony Rios. 3235-3247 [doi]
- Defending Pre-trained Language Models from Adversarial Word Substitution Without Performance SacrificeRongzhou Bao, Jiayi Wang, Hai Zhao. 3248-3258 [doi]
- BERT-Proof Syntactic Structures: Investigating Errors in Discontinuous Constituency ParsingMaximin Coavoux. 3259-3272 [doi]
- DoT: An efficient Double Transformer for NLP tasks with tablesSyrine Krichene, Thomas Müller 0009, Julian Eisenschlos. 3273-3283 [doi]
- Grammatical Error Correction as GAN-like Sequence LabelingKevin Parnow, Zuchao Li, Hai Zhao. 3284-3290 [doi]
- Neural Entity Recognition with Gazetteer based FusionQing Sun, Parminder Bhatia. 3291-3295 [doi]
- Hyperbolic Temporal Knowledge Graph Embeddings with Relational and Time CurvaturesSebastien Montella, Lina Maria Rojas-Barahona, Johannes Heinecke. 3296-3308 [doi]
- Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question AnsweringAditya Gupta, Jiacheng Xu, Shyam Upadhyay, Diyi Yang, Manaal Faruqui. 3309-3319 [doi]
- Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text ClassificationYada Pruksachatkun, Satyapriya Krishna, Jwala Dhamala, Rahul Gupta, Kai-Wei Chang. 3320-3331 [doi]
- A Joint Model for Structure-based News Genre Classification with Application to Text SummarizationZeyu Dai, Ruihong Huang. 3332-3342 [doi]
- Representing Syntax and Composition with Geometric TransformationsLorenzo Bertolini, Julie Weeds, David J. Weir, Qiwei Peng. 3343-3353 [doi]
- Figurative Language in Recognizing Textual EntailmentTuhin Chakrabarty, Debanjan Ghosh, Adam Poliak, Smaranda Muresan. 3354-3361 [doi]
- To Point or Not to Point: Understanding How Abstractive Summarizers Paraphrase TextMatt Wilber, William Timkey, Marten Van Schijndel. 3362-3376 [doi]
- AgreeSum: Agreement-Oriented Multi-Document SummarizationRichard Yuanzhe Pang, Ádám Dániel Lelkes, Vinh Q. Tran 0002, Cong Yu 0001. 3377-3391 [doi]
- BERT Busters: Outlier Dimensions that Disrupt TransformersOlga Kovaleva, Saurabh Kulshreshtha, Anna Rogers, Anna Rumshisky. 3392-3405 [doi]
- "We will Reduce Taxes" - Identifying Election Pledges with Language ModelsTommaso Fornaciari, Dirk Hovy, Elin Naurin, Julia Runeson, Robert Thomson, Pankaj Adhikari. 3406-3419 [doi]
- WeaQA: Weak Supervision via Captions for Visual Question AnsweringPratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral. 3420-3435 [doi]
- How well do you know your summarization datasets?Priyam Tejaswin, Dhruv Naik, Pengfei Liu. 3436-3449 [doi]
- Multilingual Translation from Denoising Pre-TrainingYuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan. 3450-3466 [doi]
- Annotations Matter: Leveraging Multi-task Learning to Parse UD and SUDZeeshan Ali Sayyed, Daniel Dakota. 3467-3481 [doi]
- Generating Informative Conclusions for Argumentative TextsShahbaz Syed, Khalid Al Khatib, Milad Alshomary, Henning Wachsmuth, Martin Potthast. 3482-3493 [doi]
- Substructure Substitution: Structured Data Augmentation for NLPHaoyue Shi, Karen Livescu, Kevin Gimpel. 3494-3508 [doi]
- Towards Protecting Vital Healthcare Programs by Extracting Actionable Knowledge from PolicyVanessa López, Nagesh Yadav, Gabriele Picco, Inge Vejsbjerg, Eoin Carrol, Seamus Brady, Marco Luca Sbodio, Lam Thanh Hoang, Miao Wei, John Segrave-Daly. 3509-3521 [doi]
- Not Far Away, Not So Close: Sample Efficient Nearest Neighbour Data Augmentation via MiniMaxEhsan Kamalloo, Mehdi Rezagholizadeh, Peyman Passban, Ali Ghodsi 0001. 3522-3533 [doi]
- It's All in the Heads: Using Attention Heads as a Baseline for Cross-Lingual Transfer in Commonsense ReasoningAlexey Tikhonov, Max Ryabinin. 3534-3546 [doi]
- Biomedical Interpretable Entity RepresentationsDiego García-Olano, Yasumasa Onoe, Ioana Baldini, Joydeep Ghosh, Byron C. Wallace, Kush R. Varshney. 3547-3561 [doi]
- Learning Robust Latent Representations for Controllable Speech SynthesisShakti Kumar, Jithin Pradeep, Hussain Zaidi. 3562-3575 [doi]
- How to Split: the Effect of Word Segmentation on Gender Bias in Speech TranslationMarco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi. 3576-3589 [doi]
- On the Ethical Limits of Natural Language Processing on Legal TextDimitrios Tsarapatsanis, Nikolaos Aletras. 3590-3599 [doi]
- An Exploratory Analysis of the Relation between Offensive Language and Mental HealthAna-Maria Bucur, Marcos Zampieri, Liviu P. Dinu. 3600-3606 [doi]
- Transforming Term Extraction: Transformer-Based Approaches to Multilingual Term Extraction Across DomainsChristian Lang, Lennart Wachowiak, Barbara Heinisch, Dagmar Gromann. 3607-3620 [doi]
- ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural LanguageOyvind Tafjord, Bhavana Dalvi, Peter Clark. 3621-3634 [doi]
- Probing Image-Language Transformers for Verb UnderstandingLisa Anne Hendricks, Aida Nematzadeh. 3635-3644 [doi]
- Implications of Using Internet Sting Corpora to Approximate Underage VictimsTatiana R. Ringenberg, Kathryn C. Seigfried-Spellar, Julia Rayz. 3645-3656 [doi]
- Detecting Domain Polarity-Changes of Words in a Sentiment LexiconShuai Wang 0020, Guangyi Lv, Sahisnu Mazumder, Bing Liu 0001. 3657-3668 [doi]
- Analyzing Online Political AdvertisementsDanae Sanchez Villegas, Saeid Mokaram, Nikolaos Aletras. 3669-3680 [doi]
- Do Language Models Perform Generalizable Commonsense Inference?PeiFeng Wang, Filip Ilievski, Muhao Chen, Xiang Ren 0001. 3681-3688 [doi]
- Probing Multi-modal Machine Translation with Pre-trained Language ModelYawei Kong, Kai Fan. 3689-3699 [doi]
- The interplay between language similarity and script on a novel multi-layer Algerian dialect corpusSamia Touileb, Jeremy Barnes. 3700-3712 [doi]
- Few-Shot Upsampling for Protest Size DetectionAndrew Halterman, Benjamin J. Radford. 3713-3720 [doi]
- Modeling the Unigram DistributionIrene Nikkarinen, Tiago Pimentel, Damián E. Blasi, Ryan Cotterell. 3721-3729 [doi]
- On the Lack of Robust Interpretability of Neural Text ClassifiersMuhammad Bilal Zafar, Michele Donini, Dylan Slack, Cédric Archambeau, Sanjiv Das, Krishnaram Kenthapadi. 3730-3740 [doi]
- Multimodal Graph-based Transformer Framework for Biomedical Relation ExtractionSriram Pingali, Shweta Yadav, Pratik Dutta, Sriparna Saha 0001. 3741-3747 [doi]
- Summary Grounded Conversation GenerationR. Chulaka Gunasekara, Guy Feigenblat, Benjamin Sznajder, Sachindra Joshi, David Konopnicki. 3748-3756 [doi]
- A Non-Autoregressive Edit-Based Approach to Controllable Text SimplificationSweta Agrawal, Weijia Xu, Marine Carpuat. 3757-3769 [doi]
- Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language InferenceHai Hu, He Zhou, Zuoyu Tian, Yiwen Zhang, Yina Patterson, Yanting Li, Yixin Nie, Kyle Richardson 0001. 3770-3785 [doi]
- Using surprisal and fMRI to map the neural bases of broad and local contextual prediction during natural language comprehensionShohini Bhattasali, Philip Resnik. 3786-3798 [doi]
- Assessing the Syntactic Capabilities of Transformer-based Multilingual Language ModelsLaura Pérez-Mayos, Alba Táboas García, Simon Mille, Leo Wanner. 3799-3812 [doi]
- Are Larger Pretrained Language Models Uniformly Better? Comparing Performance at the Instance LevelRuiqi Zhong, Dhruba Ghosh, Dan Klein, Jacob Steinhardt. 3813-3827 [doi]
- Named Entity Recognition through Deep Representation Learning and Weak SupervisionJerrod Parker, Shi Yu. 3828-3839 [doi]
- Explaining NLP Models via Minimal Contrastive Editing (MiCE)Alexis Ross, Ana Marasovic, Matthew E. Peters. 3840-3852 [doi]
- Differential Privacy for Text Analytics via Natural Text SanitizationXiang Yue, Minxin Du, Tianhao Wang 0016, Yaliang Li, Huan Sun, Sherman S. M. Chow. 3853-3866 [doi]
- Synthesizing Adversarial Negative Responses for Robust Response Ranking and EvaluationPrakhar Gupta, Yulia Tsvetkov, Jeffrey P. Bigham. 3867-3883 [doi]
- Leveraging Abstract Meaning Representation for Knowledge Base Question AnsweringPavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander G. Gray, Ramón Fernandez Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li 0001, Francois P. S. Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa 0001, Revanth Gangi Reddy, Ryan Riegel, Gaetano Rossiello, Udit Sharma, G. P. Shrivatsa Bhargav, Mo Yu. 3884-3894 [doi]
- On the Gap between Adoption and Understanding in NLPFederico Bianchi, Dirk Hovy. 3895-3901 [doi]
- Learning Disentangled Latent Topics for Twitter Rumour Veracity ClassificationJohn Dougrez-Lewis, Maria Liakata, Elena Kochkina, Yulan He. 3902-3908 [doi]
- Perceptual Models of Machine-Edited TextElizabeth M. Merkhofer, Monica-Ann Mendoza, Rebecca Marvin, John C. Henderson. 3909-3920 [doi]
- Scaling Within Document Coreference to Long TextsRaghuveer Thirukovalluru, Nicholas Monath, Kumar Shridhar, Manzil Zaheer, Mrinmaya Sachan, Andrew McCallum. 3921-3931 [doi]
- LEWIS: Levenshtein Editing for Unsupervised Text Style TransferMachel Reid, Victor Zhong. 3932-3944 [doi]
- Constructing Flow Graphs from Procedural Cybersecurity TextsKuntal Kumar Pal, Kazuaki Kashihara, Pratyay Banerjee, Swaroop Mishra, Ruoyu Wang 0001, Chitta Baral. 3945-3957 [doi]
- Cluster-Former: Clustering-based Sparse Transformer for Question AnsweringShuohang Wang, Luowei Zhou, Zhe Gan, Yen-Chun Chen 0001, Yuwei Fang, Siqi Sun, Yu Cheng 0001, Jingjing Liu 0001. 3958-3968 [doi]
- Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic PriorsRamy Eskander, Cass Lowry, Sujay Khandagale, Francesca Callejas, Judith Klavans, Maria Polinsky, Smaranda Muresan. 3969-3974 [doi]
- Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause ExtractionElsbeth Turcan, Shuai Wang, Rishita Anubhai, Kasturi Bhattacharjee, Yaser Al-Onaizan, Smaranda Muresan. 3975-3989 [doi]
- The Utility and Interplay of Gazetteers and Entity Segmentation for Named Entity Recognition in EnglishOshin Agarwal, Ani Nenkova. 3990-4002 [doi]
- On the Cost-Effectiveness of Stacking of Neural and Non-Neural Methods for Text Classification: Scenarios and Performance PredictionChristian Gomes, Marcos André Gonçalves, Leonardo Rocha 0001, Sérgio D. Canuto. 4003-4014 [doi]
- Unsupervised Domain Adaptation for Event Detection using Domain-specific AdaptersNghia Trung Ngo, Duy Phung, Thien Huu Nguyen. 4015-4025 [doi]
- Predicting in-hospital mortality by combining clinical notes with time-series dataIman Deznabi, Mohit Iyyer, Madalina Fiterau. 4026-4031 [doi]
- Sequence Models for Computational Etymology of BorrowingsWinston Wu, Kevin Duh, David Yarowsky. 4032-4037 [doi]
- Learning Contextualized Knowledge Structures for Commonsense ReasoningJun Yan 0001, Mrigank Raman, Aaron Chan, Tianyu Zhang, Ryan A. Rossi, Handong Zhao, SungChul Kim, Nedim Lipka, Xiang Ren 0001. 4038-4051 [doi]
- Analyzing Stereotypes in Generative Text Inference TasksAnna Sotnikova, Yang Trista Cao, Hal Daumé III, Rachel Rudinger. 4052-4065 [doi]
- HySPA: Hybrid Span Generation for Scalable Text-to-Graph ExtractionLiliang Ren, Chenkai Sun, Heng Ji, Julia Hockenmaier. 4066-4078 [doi]
- Improving Automated Evaluation of Open Domain Dialog via Diverse Reference AugmentationVarun Gangal, Harsh Jhamtani, Eduard H. Hovy, Taylor Berg-Kirkpatrick. 4079-4090 [doi]
- Who Blames or Endorses Whom? Entity-to-Entity Directed Sentiment Extraction in News TextKunwoo Park, Zhufeng Pan, Jungseock Joo. 4091-4102 [doi]
- New Dataset and Strong Baselines for the Grammatical Error Correction of RussianViet Anh Trinh, Alla Rozovskaya. 4103-4111 [doi]
- A Formidable Ability: Detecting Adjectival Extremeness with DSMsFarhan Samir, Barend Beekhuizen, Suzanne Stevenson. 4112-4125 [doi]
- Effective Attention Sheds Light On InterpretabilityKaiser Sun, Ana Marasovic. 4126-4135 [doi]
- Compositionality of Complex Graphemes in the Undeciphered Proto-Elamite Script using Image and Text Embedding ModelsLogan Born, Kathryn Kelley, M. Willis Monroe, Anoop Sarkar. 4136-4146 [doi]
- On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in TransformersTianchu Ji, Shraddhan Jain, Michael Ferdman, Peter A. Milder, H. Andrew Schwartz, Niranjan Balasubramanian. 4147-4157 [doi]
- Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?Jieyu Zhao, Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Kai-Wei Chang. 4158-4164 [doi]
- Unsupervised Label Refinement Improves Dataless Text ClassificationZewei Chu, Karl Stratos, Kevin Gimpel. 4165-4178 [doi]
- Prompting Contrastive Explanations for Commonsense Reasoning TasksBhargavi Paranjape, Julian Michael, Marjan Ghazvininejad, Hannaneh Hajishirzi, Luke Zettlemoyer. 4179-4192 [doi]
- SMS Spam Detection Through Skip-gram Embeddings and Shallow NetworksGustavo José de Sousa, Daniel Carlos Guimarães Pedronette, João Paulo Papa, Ivan Rizzo Guilherme. 4193-4201 [doi]
- Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-MonitoringYichi Zhang, Joyce Chai. 4202-4213 [doi]
- Marked Attribute Bias in Natural Language InferenceHillary Dawkins. 4214-4226 [doi]
- VLM: Task-agnostic Video-Language Model Pre-training for Video UnderstandingHu Xu, Gargi Ghosh, Po-Yao Huang 0001, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer. 4227-4239 [doi]
- Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat ViolenceAndrew Halterman, Katherine A. Keith, Sheikh Muhammad Sarwar, Brendan O'Connor. 4240-4253 [doi]
- Memory-Efficient Differentiable Transformer Architecture SearchYuekai Zhao, Li Dong 0004, Yelong Shen, Zhihua Zhang, Furu Wei, Weizhu Chen. 4254-4264 [doi]
- On the Copying Behaviors of Pre-Training for Neural Machine TranslationXuebo Liu 0002, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi 0001, Zhaopeng Tu. 4265-4275 [doi]
- Answer Generation for Retrieval-based Question Answering SystemsChao-Chun Hsu, Eric Lind, Luca Soldaini, Alessandro Moschitti. 4276-4282 [doi]
- Grounding 'Grounding' in NLPKhyathi Raghavi Chandu, Yonatan Bisk, Alan W. Black. 4283-4305 [doi]
- Federated Chinese Word Segmentation with Global Character AssociationsYuanhe Tian, Guimin Chen, Han Qin, Yan Song. 4306-4313 [doi]
- PSED: A Dataset for Selecting Emphasis in Presentation SlidesAmirreza Shirani, Giai Tran, Hieu Trinh, Franck Dernoncourt, Nedim Lipka, Jose Echevarria, Thamar Solorio, Paul Asente. 4314-4320 [doi]
- MLMLM: Link Prediction with Mean Likelihood Masked Language ModelLouis Clouâtre, Philippe Trempe, Amal Zouaq, Sarath Chandar. 4321-4331 [doi]
- Modulating Language Models with EmotionsRuibo Liu, Jason Wei, Chenyan Jia, Soroush Vosoughi. 4332-4339 [doi]
- Effective Batching for Recurrent Neural Network GrammarsHiroshi Noji, Yohei Oseki. 4340-4352 [doi]
- Verb Sense Clustering using Contextualized Word Representations for Semantic Frame InductionKosuke Yamada, Ryohei Sasano, Koichi Takeda. 4353-4362 [doi]
- Benchmarking Neural Topic Models: An Empirical StudyThanh-Nam Doan, Tuan-Anh Hoang. 4363-4368 [doi]
- Enhancing Chinese Word Segmentation via Pseudo Labels for PracticabilityKaiyu Huang, Junpeng Liu, Degen Huang, Deyi Xiong, Zhuang Liu 0001, Jinsong Su. 4369-4381 [doi]
- Analysis of Tree-Structured Architectures for Code GenerationSamip Dahal, Adyasha Maharana, Mohit Bansal. 4382-4391 [doi]
- How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?Weijia Xu, Shuming Ma, Dongdong Zhang 0001, Marine Carpuat. 4392-4400 [doi]
- Leveraging Topic Relatedness for Argument PersuasionXinran Zhao, Esin Durmus, Hongming Zhang, Claire Cardie. 4401-4407 [doi]
- One Teacher is Enough? Pre-trained Language Model Distillation from Multiple TeachersChuhan Wu, Fangzhao Wu, Yongfeng Huang 0001. 4408-4413 [doi]
- Logic-Consistency Text Generation from Semantic ParsesChang Shu, Yusen Zhang, Xiangyu Dong, Peng Shi, Tao Yu, Rui Zhang. 4414-4426 [doi]
- Inducing Semantic Roles Without SyntaxJulian Michael, Luke Zettlemoyer. 4427-4442 [doi]
- Plot and Rework: Modeling Storylines for Visual StorytellingChi-Yang Hsu, Yun-Wei Chu, Ting-Hao (Kenneth) Huang, Lun-Wei Ku. 4443-4453 [doi]
- Disentangled Code Representation Learning for Multiple Programming LanguagesJingfeng Zhang, Haiwen Hong, Yin Zhang, Yao Wan, Ye Liu, Yulei Sui. 4454-4466 [doi]
- Exploring Self-Identified Counseling Expertise in Online Support ForumsAllison Lahnala, Yuntian Zhao, Charles Welch, Jonathan K. Kummerfeld, Lawrence C. An, Kenneth Resnicow, Rada Mihalcea, Verónica Pérez-Rosas. 4467-4480 [doi]
- An Investigation of Suitability of Pre-Trained Language Models for Dialogue Generation - Avoiding DiscrepanciesYan Zeng, Jian-Yun Nie. 4481-4494 [doi]
- Learning to Sample Replacements for ELECTRA Pre-TrainingYaru Hao, Li Dong 0004, Hangbo Bao, Ke Xu 0001, Furu Wei. 4495-4506 [doi]
- Reordering Examples Helps during Priming-based Few-Shot LearningSawan Kumar, Partha P. Talukdar. 4507-4518 [doi]
- Constrained Labeled Data Generation for Low-Resource Named Entity RecognitionRuohao Guo, Dan Roth. 4519-4533 [doi]
- He is very intelligent, she is very beautiful? On Mitigating Social Biases in Language Modelling and GenerationAparna Garimella, Akhash Amarnath, Kiran Kumar, Akash Pramod Yalla, Anandhavelu Natarajan, Niyati Chhaya, Balaji Vasan Srinivasan. 4534-4545 [doi]
- Task-adaptive Pre-training of Language Models with Word Embedding RegularizationKosuke Nishida, Kyosuke Nishida, Sen Yoshida. 4546-4553 [doi]
- Do Grammatical Error Correction Models Realize Grammatical Generalization?Masato Mita, Hitomi Yanaka. 4554-4561 [doi]
- Domain-Aware Dependency Parsing for QuestionsAparna Garimella, Laura Chiticariu, Yunyao Li 0001. 4562-4568 [doi]
- Using Social and Linguistic Information to Adapt Pretrained Representations for Political Perspective IdentificationChang Li 0005, Dan Goldwasser. 4569-4579 [doi]
- Enhancing Dialogue-based Relation Extraction by Speaker and Trigger Words PredictionTianyang Zhao, Zhao Yan, Yunbo Cao, Zhoujun Li. 4580-4585 [doi]
- Modeling Event-Pair Relations in External Knowledge Graphs for Script ReasoningYucheng Zhou, Xiubo Geng, Tao Shen, Jian Pei, Wenqiang Zhang, Daxin Jiang. 4586-4596 [doi]
- PROST: Physical Reasoning about Objects through Space and TimeStephane Aroca-Ouellette, Cory Paik, Alessandro Roncone, Katharina Kann. 4597-4608 [doi]
- Revisiting the Evaluation of End-to-end Event ExtractionShun Zheng, Wei Cao, Wei Xu, Jiang Bian 0002. 4609-4617 [doi]
- Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASRJunkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang 0001. 4618-4624 [doi]
- HIT - A Hierarchically Fused Deep Attention Network for Robust Code-mixed Language RepresentationAyan Sengupta, Sourabh Kumar Bhattacharjee, Tanmoy Chakraborty 0002, Md. Shad Akhtar. 4625-4639 [doi]
- Semi-Supervised Data Programming with Subset SelectionAyush Maheshwari, Oishik Chatterjee, KrishnaTeja Killamsetty, Ganesh Ramakrishnan, Rishabh K. Iyer. 4640-4651 [doi]
- Fingerprinting Fine-tuned Language Models in the WildNirav Diwan, Tanmoy Chakraborty, Zubair Shafiq. 4652-4664 [doi]
- Analyzing Code Embeddings for Coding Clinical NarrativesWei Shi, Jiewen Wu, Xiwen Yang, Nancy Chen, Ivan Ho Mien, Jung-Jae Kim 0001, Pavitra Krishnaswamy. 4665-4672 [doi]
- Automatic Construction of Sememe Knowledge Bases via DictionariesFanchao Qi, Yangyi Chen, Fengyu Wang, Zhiyuan Liu 0001, Xiao Chen, Maosong Sun. 4673-4686 [doi]
- Rule-Aware Reinforcement Learning for Knowledge Graph ReasoningZhongni Hou, Xiaolong Jin, Zixuan Li, Long Bai. 4687-4692 [doi]
- XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 LanguagesTahmid Hasan, Abhik Bhattacharjee, Md. Saiful Islam, Kazi Mubasshir, Yuan-Fang Li, Yong-Bin Kang, M. Sohel Rahman, Rifat Shahriyar. 4693-4703 [doi]
- Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current PracticesSebastin Santy, Anku Rani, Monojit Choudhury. 4704-4710 [doi]
- As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical TranslationJun Wang, Chang Xu, Francisco Guzmán, Ahmed El-Kishky, Benjamin I. P. Rubinstein, Trevor Cohn. 4711-4717 [doi]
- Investigating Memorization of Conspiracy Theories in Text GenerationSharon Levy, Michael Saxon, William Yang Wang. 4718-4729 [doi]
- A Text-Centered Shared-Private Framework via Cross-Modal Prediction for Multimodal Sentiment AnalysisYang Wu, Zijie Lin, Yanyan Zhao, Bing Qin 0001, Li-Nan Zhu. 4730-4738 [doi]
- What Would a Teacher Do? Predicting Future Talk MovesAnanya Ganesh, Martha Palmer, Katharina Kann. 4739-4751 [doi]
- BioGen: Generating Biography Summary under Table Guidance on WikipediaShen Gao, Xiuying Chen, Chang Liu, Dongyan Zhao 0001, Rui Yan 0001. 4752-4757 [doi]
- Multilingual Simultaneous Neural Machine TranslationPhilip Arthur, Dongwon Ryu, Gholamreza Haffari. 4758-4766 [doi]
- Cross-Domain Review Generation for Aspect-Based Sentiment AnalysisJianfei Yu, Chenggong Gong, Rui Xia. 4767-4777 [doi]
- On the Language Coverage Bias for Neural Machine TranslationShuo Wang, Zhaopeng Tu, Zhixing Tan, Shuming Shi 0001, Maosong Sun, Yang Liu 0005. 4778-4790 [doi]
- Named Entity Recognition via Noise Aware Training Mechanism with Data FilterXiusheng Huang, Yubo Chen 0001, Shun Wu, Jun Zhao 0001, Yuantao Xie, Weijian Sun. 4791-4803 [doi]
- A Multi-Task Approach for Improving Biomedical Named Entity Recognition by Incorporating Multi-Granularity informationYiqi Tong, Yidong Chen, Xiaodong Shi. 4804-4813 [doi]
- EBERT: Efficient BERT Inference with Dynamic Structured PruningZejian Liu, Fanrong Li, Gang Li 0015, Jian Cheng 0001. 4814-4823 [doi]
- Strong and Light Baseline Models for Fact-Checking Joint InferenceKateryna Tymoshenko, Alessandro Moschitti. 4824-4830 [doi]
- Sketch and Refine: Towards Faithful and Informative Table-to-Text GenerationPeng Wang 0028, Junyang Lin, an Yang, Chang Zhou, Yichang Zhang, Jingren Zhou, Hongxia Yang. 4831-4843 [doi]
- TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text GenerationShizhe Diao, Xinwei Shen, Kashun Shum, Yan Song, Tong Zhang. 4844-4858 [doi]
- John praised Mary because _he_? Implicit Causality Bias and Its Interaction with Explicit Cues in LMsYova Kementchedjhieva, Mark Anderson 0005, Anders Søgaard. 4859-4871 [doi]
- Do It Once: An Embarrassingly Simple Joint Matching Approach to Response SelectionLinhao Zhang, Dehong Ma, Sujian Li, Houfeng Wang. 4872-4877 [doi]
- Climbing the Tower of Treebanks: Improving Low-Resource Dependency Parsing via Hierarchical Source SelectionGoran Glavas, Ivan Vulic. 4878-4888 [doi]
- Enhancing the Open-Domain Dialogue Evaluation in Latent SpaceZhangming Chan, Lemao Liu, Juntao Li, Haisong Zhang, Dongyan Zhao 0001, Shuming Shi 0001, Rui Yan 0001. 4889-4900 [doi]
- Adapting Monolingual Models: Data can be Scarce when Language Similarity is HighWietse de Vries, Martijn Bartelds, Malvina Nissim, Martijn Wieling. 4901-4907 [doi]
- BatchMixup: Improving Training by Interpolating Hidden States of the Entire Mini-batchWenpeng Yin 0001, Huan Wang, Jin Qu, Caiming Xiong. 4908-4912 [doi]
- DocNLI: A Large-scale Dataset for Document-level Natural Language InferenceWenpeng Yin 0001, Dragomir R. Radev, Caiming Xiong. 4913-4922 [doi]
- Rule Augmented Unsupervised Constituency ParsingAtul Sahay, Anshul Nasery, Ayush Maheshwari, Ganesh Ramakrishnan, Rishabh K. Iyer. 4923-4932 [doi]
- Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for CatalanJordi Armengol-Estapé, Casimiro Pio Carrino, Carlos Rodríguez Penagos, Ona De Gibert Bonet, Carme Armentano-Oller, Aitor Gonzalez-Agirre, Maite Melero, Marta Villegas. 4933-4946 [doi]
- How transfer learning impacts linguistic knowledge in deep NLP models?Nadir Durrani, Hassan Sajjad, Fahim Dalvi. 4947-4957 [doi]
- Language Models Use Monotonicity to Assess NPI LicensingJaap Jumelet, Milica Denic, Jakub Szymanik, Dieuwke Hupkes, Shane Steinert-Threlkeld. 4958-4969 [doi]
- Slot Transferability for Cross-domain Slot FillingHengtong Lu, Zhuoxin Han, Caixia Yuan, Xiaojie Wang 0006, Shuyu Lei, Huixing Jiang, Wei Wu. 4970-4979 [doi]
- Word Graph Guided Summarization for Radiology FindingsJinpeng Hu, Jianling Li, Zhihong Chen, Yaling Shen, Yan Song, Xiang Wan, Tsung-Hui Chang. 4980-4990 [doi]
- Generalized Supervised Attention for Text GenerationYixian Liu, Liwen Zhang, Xinyu Zhang, Yong Jiang, Yue Zhang 0004, Kewei Tu. 4991-5003 [doi]
- Uncertainty Aware Review Hallucination for Science Article ClassificationKorbinian Friedl, Georgios Rizos, Lukas Stappen, Madina Hasan, Lucia Specia, Thomas Hain, Björn W. Schuller. 5004-5009 [doi]
- Automatically Select Emotion for Response via Personality-affected Emotion TransitionZhiyuan Wen, Jiannong Cao 0001, Ruosong Yang, Shuaiqi Liu, Jiaxing Shen. 5010-5020 [doi]
- Highlight-Transformer: Leveraging Key Phrase Aware Attention to Improve Abstractive Multi-Document SummarizationShuaiqi Liu, Jiannong Cao 0001, Ruosong Yang, Zhiyuan Wen. 5021-5027 [doi]
- Phrase-Level Action Reinforcement Learning for Neural Dialog Response GenerationTakato Yamazaki, Akiko Aizawa. 5028-5038 [doi]
- Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling InsightsDevaraja Adiga, Rishabh Kumar, Amrith Krishna, Preethi Jyothi, Ganesh Ramakrishnan, Pawan Goyal 0002. 5039-5050 [doi]
- Constraint based Knowledge Base Distillation in End-to-End Task Oriented DialogsAtishya Jain Dinesh Raghu, Sachindra Joshi Mausam. 5051-5061 [doi]
- DialogSum: A Real-Life Scenario Dialogue Summarization DatasetYulong Chen, Yang Liu, Liang Chen, Yue Zhang. 5062-5074 [doi]
- What Did You Refer to? Evaluating Co-References in DialogueWeinan Zhang 0003, Yue Zhang 0004, Hanlin Tang, Zhengyu Zhao 0003, Caihai Zhu, Ting Liu 0001. 5075-5084 [doi]
- Beyond Metadata: What Paper Authors Say About Corpora They UseNikolay Kolyada, Martin Potthast, Benno Stein 0001. 5085-5090 [doi]
- Knowledge Distillation for Quality EstimationAmit Gajbhiye, Marina Fomicheva, Fernando Alva-Manchego, Frédéric Blain, Abiola Obamuyide, Nikolaos Aletras, Lucia Specia. 5091-5099 [doi]
- Cross-document Coreference Resolution over Predicted MentionsArie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan. 5100-5107 [doi]
- Controllable Abstractive Dialogue Summarization with Sketch SupervisionChien-Sheng Wu, Linqing Liu, Wenhao Liu, Pontus Stenetorp, Caiming Xiong. 5108-5122 [doi]
- Elaborative Simplification: Content Addition and Explanation Generation in Text SimplificationNeha Srikanth, Junyi Jessy Li. 5123-5137 [doi]
- Could you give me a hint ? Generating inference graphs for defeasible reasoningAman Madaan, Dheeraj Rajagopal, Niket Tandon, Yiming Yang, Eduard H. Hovy. 5138-5147 [doi]
- Characterizing Social Spambots by their Human TraitsSalvatore Giorgi, Lyle Ungar, H. Andrew Schwartz. 5148-5158 [doi]