Abstract is missing.
- Frontmatter [doi]
- "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error CorrectionYong Dai, Linyang Li, Cong Zhou, Zhangyin Feng, Enbo Zhao, Xipeng Qiu, Piji Li, Duyu Tang. 1-8 [doi]
- Compilable Neural Code Generation with Compiler FeedbackXin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, Jin Liu, Hao Wu, Xin Jiang, Qun Liu. 9-19 [doi]
- Towards Unifying the Label Space for Aspect- and Sentence-based Sentiment AnalysisYiming Zhang, Min Zhang, Sai Wu, Junbo Zhao. 20-30 [doi]
- Input-specific Attention Subnetworks for Adversarial DetectionEmil Biju, Anirudh Sriram, Pratyush Kumar, Mitesh M. Khapra. 31-44 [doi]
- RelationPrompt: Leveraging Prompts to Generate Synthetic Data for Zero-Shot Relation Triplet ExtractionYew Ken Chia, Lidong Bing, Soujanya Poria, Luo Si. 45-57 [doi]
- Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?En-Shiun Annie Lee, Sarubi Thillainathan, Shravan Nayak, Surangika Ranathunga, David Ifeoluwa Adelani, Ruisi Su, Arya McCarthy. 58-67 [doi]
- Multi-Scale Distribution Deep Variational Autoencoder for Explanation GenerationZeFeng Cai, Linlin Wang, Gerard de Melo, Fei Sun, Liang He. 68-78 [doi]
- Dual Context-Guided Continuous Prompt Tuning for Few-Shot LearningJie Zhou 0016, Le Tian, Houjin Yu, Zhou Xiao, Hui Su, Jie Zhou. 79-84 [doi]
- Extract-Select: A Span Selection Framework for Nested Named Entity Recognition with Generative Adversarial TrainingPeixin Huang, Xiang Zhao 0002, Minghao Hu, Yang Fang 0001, Xinyi Li, Weidong Xiao. 85-96 [doi]
- Controlled Text Generation Using Dictionary Prior in Variational AutoencodersXianghong Fang, Jian Li, Lifeng Shang, Xin Jiang, Qun Liu, Dit-Yan Yeung. 97-111 [doi]
- Challenges to Open-Domain Constituency ParsingSen Yang, Leyang Cui, Ruoxi Ning, Di Wu, Yue Zhang 0004. 112-127 [doi]
- Going "Deeper": Structured Sememe Prediction via Transformer with Tree AttentionYining Ye, Fanchao Qi, Zhiyuan Liu, Maosong Sun. 128-138 [doi]
- Table-based Fact Verification with Self-adaptive Mixture of ExpertsYuxuan Zhou, Xien Liu, Kaiyin Zhou, Ji Wu. 139-149 [doi]
- Investigating Data Variance in Evaluations of Automatic Machine Translation MetricsJiannan Xiang, Huayang Li, Yahui Liu, Lemao Liu, Guoping Huang, Defu Lian, Shuming Shi 0001. 150-157 [doi]
- Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal InformationFanchao Qi, Chuancheng Lv, Zhiyuan Liu, Xiaojun Meng, Maosong Sun, Hai-Tao Zheng. 158-168 [doi]
- Query and Extract: Refining Event Extraction as Type-oriented Binary DecodingSijia Wang, Mo Yu, Shiyu Chang, Lichao Sun, Lifu Huang. 169-182 [doi]
- LEVEN: A Large-Scale Chinese Legal Event Detection DatasetFeng Yao, Chaojun Xiao, Xiaozhi Wang, Zhiyuan Liu, Lei Hou 0001, Cunchao Tu, Juanzi Li, Yun Liu, Weixing Shen, Maosong Sun. 183-201 [doi]
- Analyzing Dynamic Adversarial Training Data in the LimitEric Wallace, Adina Williams, Robin Jia, Douwe Kiela. 202-217 [doi]
- AbductionRules: Training Transformers to Explain Unexpected InputsNathan Young, Qiming Bao 0001, Joshua Bensemann, Michael Witbrock. 218-227 [doi]
- On the Importance of Data Size in Probing Fine-tuned ModelsHouman Mehrafarin, Sara Rajaee, Mohammad Taher Pilehvar. 228-238 [doi]
- RuCCoN: Clinical Concept Normalization in RussianAlexandr Nesterov, Galina Zubkova, Zulfat Miftahutdinov, Vladimir Kokh, Elena Tutubalina, Artem Shelmanov, Anton Alekseev 0001, Manvel Avetisian, Andrey Chertok, Sergey I. Nikolenko. 239-245 [doi]
- A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence EmbeddingsHaochen Tan, Wei Shao, Han Wu, Ke Yang, Linqi Song. 246-256 [doi]
- Eider: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage FusionYiqing Xie, Jiaming Shen, Sha Li, Yuning Mao, Jiawei Han 0001. 257-268 [doi]
- NLG: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and GenerationKaushal Maurya, Maunendra Desarkar. 269-284 [doi]
- MR-P: A Parallel Decoding Algorithm for Iterative Refinement Non-Autoregressive TranslationHao Cheng, Zhihua Zhang. 285-296 [doi]
- Open Relation Modeling: Learning to Define Relations between EntitiesJie Huang 0009, Kevin Chang, Jinjun Xiong, Wen-mei Hwu. 297-308 [doi]
- A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-SlotsSai Zhang, Yuwei Hu, Yuchuan Wu, Jiaman Wu, Yongbin Li, Jian Sun, Caixia Yuan, Xiaojie Wang. 309-321 [doi]
- Towards Transparent Interactive Semantic Parsing via Step-by-Step CorrectionLingbo Mo, Ashley Lewis, Huan Sun, Michael White. 322-342 [doi]
- MINER: Multi-Interest Matching Network for News RecommendationJian Li, Jieming Zhu, Qiwei Bi, Guohao Cai, Lifeng Shang, Zhenhua Dong, Xin Jiang, Qun Liu. 343-352 [doi]
- KSAM: Infusing Multi-Source Knowledge into Dialogue Generation via Knowledge Source Aware Multi-Head DecodingSixing Wu, Ying Li, Dawei Zhang, Zhonghai Wu. 353-363 [doi]
- Towards Responsible Natural Language Annotation for the Varieties of ArabicA. Stevie Bergman, Mona T. Diab. 364-371 [doi]
- Dynamically Refined Regularization for Improving Cross-corpora Hate Speech DetectionTulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr. 372-382 [doi]
- Towards Large-Scale Interpretable Knowledge Graph Reasoning for Dialogue SystemsYi-Lin Tuan, Sajjad Beygi, Maryam Fazel-Zarandi, Qiaozi Gao, Alessandra Cervone, William Yang Wang. 383-395 [doi]
- MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase ExtractionLinhan Zhang, Qian Chen, Wen Wang, Chong Deng, Shiliang Zhang, Bing Li, Wei Wang, Xin Cao 0001. 396-409 [doi]
- Visualizing the Relationship Between Encoded Linguistic Information and Task PerformanceJiannan Xiang, Huayang Li, Defu Lian, Guoping Huang, Taro Watanabe, Lemao Liu. 410-422 [doi]
- Efficient Argument Structure Extraction with Transfer Learning and Active LearningXinyu Hua, Lu Wang 0008. 423-437 [doi]
- Plug-and-Play Adaptation for Continuously-updated QAKyungjae Lee 0002, Wookje Han, Seung-won Hwang, Hwaran Lee, Joonsuk Park, Sang-Woo Lee. 438-447 [doi]
- Reinforced Cross-modal Alignment for Radiology Report GenerationHan Qin, Yan Song 0003. 448-458 [doi]
- What Works and Doesn't Work, A Deep Decoder for Neural Machine TranslationZuchao Li, Yiran Wang, Masao Utiyama, Eiichiro Sumita, Hai Zhao, Taro Watanabe. 459-471 [doi]
- SyMCoM - Syntactic Measure of Code Mixing A Study Of English-Hindi Code-MixingPrashant Kodali, Anmol Goel, Monojit Choudhury, Manish Shrivastava 0001, Ponnurangam Kumaraguru. 472-480 [doi]
- HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual DataKai Nakamura, Sharon Levy, Yi-Lin Tuan, Wenhu Chen, William Yang Wang. 481-492 [doi]
- NEWTS: A Corpus for News Topic-Focused SummarizationSeyed Ali Bahrainian, Sheridan Feucht, Carsten Eickhoff. 493-503 [doi]
- Classification without (Proper) Representation: Political Heterogeneity in Social Media and Its Implications for Classification and Behavioral AnalysisKenan Alkiek, Bohan Zhang, David Jurgens. 504-522 [doi]
- Toward More Meaningful Resources for Lower-resourced LanguagesConstantine Lignos, Nolan Holley, Chester Palen-Michel, Jonne Sälevä. 523-532 [doi]
- Better Quality Estimation for Low Resource Corpus MiningMuhammed Yusuf Kocyigit, Jiho Lee, Derry Wijaya. 533-543 [doi]
- End-to-End Segmentation-based News SummarizationYang Liu, Chenguang Zhu 0001, Michael Zeng 0001. 544-554 [doi]
- Fast Nearest Neighbor Machine TranslationYuxian Meng, Xiaoya Li, Xiayu Zheng, Fei Wu 0001, Xiaofei Sun, Tianwei Zhang 0004, Jiwei Li. 555-565 [doi]
- Extracting Latent Steering Vectors from Pretrained Language ModelsNishant Subramani, Nivedita Suresh, Matthew E. Peters. 566-581 [doi]
- Domain Generalisation of NMT: Fusing Adapters with Leave-One-Domain-Out TrainingThuy-Trang Vu, Shahram Khadivi, Dinh Q. Phung, Gholamreza Haffari. 582-588 [doi]
- Reframing Instructional Prompts to GPTk's LanguageDaniel Khashabi, Chitta Baral, Yejin Choi, Hannaneh Hajishirzi. 589-612 [doi]
- Read Top News First: A Document Reordering Approach for Multi-Document News SummarizationChao Zhao, Tenghao Huang, Somnath Basu Roy Chowdhury, Muthu Kumar Chandrasekaran, Kathleen R. McKeown, Snigdha Chaturvedi. 613-621 [doi]
- Human Language ModelingNikita Soni 0002, Matthew Matero, Niranjan Balasubramanian, H. Andrew Schwartz. 622-636 [doi]
- Inverse is Better! Fast and Accurate Prompt for Few-shot Slot TaggingYutai Hou, Cheng Chen, Xianzhen Luo, Bohan Li, Wanxiang Che. 637-647 [doi]
- Cross-Modal Cloze Task: A New Task to Brain-to-Word DecodingShuxian Zou, Shaonan Wang, Jiajun Zhang, Chengqing Zong. 648-657 [doi]
- Mitigating Gender Bias in Distilled Language Models via Counterfactual Role ReversalUmang Gupta, Jwala Dhamala, Varun Kumar, Apurv Verma, Yada Pruksachatkun, Satyapriya Krishna, Rahul Gupta, Kai-Wei Chang, Greg Ver Steeg, Aram Galstyan. 658-678 [doi]
- Domain Representative Keywords Selection: A Probabilistic ApproachPritom Saha Akash, Jie Huang 0009, Kevin Chen-Chuan Chang, Yunyao Li 0001, Lucian Popa 0001, ChengXiang Zhai. 679-692 [doi]
- Hierarchical Inductive Transfer for Continual Dialogue LearningShaoxiong Feng, Xuancheng Ren, Kan Li 0001, Xu Sun 0001. 693-699 [doi]
- Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language GenerationKushal Arora, Layla El Asri, Hareesh Bahuleyan, Jackie Chi Kit Cheung. 700-710 [doi]
- Question Answering Infused Pre-training of General-Purpose Contextualized RepresentationsRobin Jia, Mike Lewis, Luke Zettlemoyer. 711-728 [doi]
- Automatic Song Translation for Tonal LanguagesFenfei Guo, Chen Zhang, Zhirui Zhang, Qixin He, Kejun Zhang, Jun Xie, Jordan L. Boyd-Graber. 729-743 [doi]
- Read before Generate! Faithful Long Form Question Answering with Machine ReadingDan Su 0003, Xiaoguang Li, Jindi Zhang, Lifeng Shang, Xin Jiang, Qun Liu 0001, Pascale Fung. 744-756 [doi]
- A Simple yet Effective Relation Information Guided Approach for Few-Shot Relation ExtractionYang Liu, Jinpeng Hu, Xiang Wan, Tsung-Hui Chang. 757-763 [doi]
- MIMICause: Representation and automatic extraction of causal relation types from clinical notesVivek Khetan, Md Imbesat Hassan Rizvi, Jessica Huber, Paige Bartusiak, Bogdan Sacaleanu, Andrew E. Fano. 764-773 [doi]
- Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective DistillationXuandong Zhao, Zhiguo Yu, Ming Wu, Lei Li. 774-781 [doi]
- Debiasing Event Understanding for Visual Commonsense TasksMinji Seo, YeonJoon Jung, Seungtaek Choi, Seung-won Hwang, Bei Liu. 782-787 [doi]
- Fact-Tree Reasoning for N-ary Question Answering over Knowledge GraphsYao Zhang, Peiyao Li, Hongru Liang, Adam Jatowt, Zhenglu Yang. 788-802 [doi]
- DeepStruct: Pretraining of Language Models for Structure PredictionChenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang 0001, Dawn Song. 803-823 [doi]
- The Change that Matters in Discourse Parsing: Estimating the Impact of Domain Shift on Parser ErrorKatherine Atwell, Anthony Sicilia, Seong Jae Hwang, Malihe Alikhani. 824-845 [doi]
- Mukayese: Turkish NLP Strikes BackAli Safaya, Emirhan Kurtulus, Arda Göktogan, Deniz Yüret. 846-863 [doi]
- Virtual Augmentation Supported Contrastive Learning of Sentence RepresentationsDejiao Zhang, Wei Xiao, Henghui Zhu, Xiaofei Ma, Andrew O. Arnold. 864-876 [doi]
- MoEfication: Transformer Feed-forward Layers are Mixtures of ExpertsZhengyan Zhang, Yankai Lin, Zhiyuan Liu 0001, Peng Li 0030, Maosong Sun, Jie Zhou 0016. 877-890 [doi]
- DS-TOD: Efficient Domain Specialization for Task-Oriented DialogChia-Chien Hung, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavas. 891-904 [doi]
- Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language ModelJiayi Wang, Rongzhou Bao, Zhuosheng Zhang 0001, Hai Zhao. 905-915 [doi]
- Learning Adaptive Axis Attentions in Fine-tuning: Beyond Fixed Sparse Attention PatternsZihan Wang, Jiuxiang Gu, Jason Kuen, Handong Zhao, Vlad I. Morariu, Ruiyi Zhang, Ani Nenkova, Tong Sun, Jingbo Shang. 916-925 [doi]
- Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-DeploymentZichao Li, Prakhar Sharma, Xing Han Lu, Jackie Chi Kit Cheung, Siva Reddy. 926-937 [doi]
- To be or not to be an Integer? Encoding Variables for Mathematical TextDeborah Ferreira, Mokanarangan Thayaparan, Marco Valentino, Julia Rozanova, André Freitas. 938-948 [doi]
- GRS: Combining Generation and Revision in Unsupervised Sentence SimplificationMohammad Dehghan, Dhruv Kumar 0005, Lukasz Golab. 949-960 [doi]
- BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic LanguagesManuel Mager, Arturo Oncevay, Elisabeth Mager, Katharina Kann, Ngoc Thang Vu. 961-971 [doi]
- Distributed NLI: Learning to Predict Human Opinion Distributions for Language ReasoningXiang Zhou, Yixin Nie, Mohit Bansal. 972-987 [doi]
- Morphological Processing of Low-Resource Languages: Where We Are and What's NextAdam Wiemerslage, Miikka Silfverberg, Changbing Yang, Arya McCarthy, Garrett Nicolai, Eliana Colunga, Katharina Kann. 988-1007 [doi]
- Learning and Evaluating Character Representations in NovelsNaoya Inoue, Charuta Pethe, Allen Kim, Steven Skiena. 1008-1019 [doi]
- Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading ComprehensionVatsal Raina, Mark J. F. Gales. 1020-1034 [doi]
- Measuring the Language of Self-Disclosure across CorporaAnn-Katrin Reuel, Sebastian Peralta, João Sedoc, Garrick Sherman, Lyle Ungar. 1035-1047 [doi]
- When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data AugmentationEhsan Kamalloo, Mehdi Rezagholizadeh, Ali Ghodsi 0001. 1048-1062 [doi]
- Explaining Classes through Stable Word AttributionsSamuel Rönnqvist, Aki-Juhani Kyröläinen, Amanda Myntti, Filip Ginter, Veronika Laippala. 1063-1074 [doi]
- What to Learn, and How: Toward Effective Learning from RationalesSamuel Carton, Surya Kanoria, Chenhao Tan. 1075-1088 [doi]
- Listening to Affected Communities to Define Extreme Speech: Dataset and ExperimentsAntonis Maronikolakis, Axel Wisiorek, Leah Nann, Haris Jabbar, Sahana Udupa, Hinrich Schütze. 1089-1104 [doi]
- Entropy-based Attention Regularization Frees Unintended Bias Mitigation from ListsGiuseppe Attanasio, Debora Nozza, Dirk Hovy, Elena Baralis. 1105-1119 [doi]
- From BERT's Point of View: Revealing the Prevailing Contextual DifferencesCarolin Schuster, Simon Hegelich. 1120-1138 [doi]
- Learning Bias-reduced Word Embeddings Using Dictionary DefinitionsHaozhe An, Xiaojiang Liu, Donald Zhang. 1139-1152 [doi]
- Knowledge Graph Embedding by Adaptive Limit Scoring Loss Using Dynamic Weighting StrategyJinfa Yang, Xianghua Ying, Yongjie Shi, Xin Tong, Ruibin Wang, Taiyan Chen, Bowei Xing. 1153-1163 [doi]
- OCR Improves Machine Translation for Low-Resource LanguagesOana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán. 1164-1174 [doi]
- CoCoLM: Complex Commonsense Enhanced Language Model with Discourse RelationsChanglong Yu, Hongming Zhang, Yangqiu Song, Wilfred Ng. 1175-1187 [doi]
- Learning to Robustly Aggregate Labeling Functions for Semi-supervised Data ProgrammingAyush Maheshwari, KrishnaTeja Killamsetty, Ganesh Ramakrishnan, Rishabh K. Iyer, Marina Danilevsky, Lucian Popa 0001. 1188-1202 [doi]
- Multi-Granularity Semantic Aware Graph Model for Reducing Position Bias in Emotion Cause Pair ExtractionYinan Bao, Qianwen Ma, Lingwei Wei, Wei Zhou, Songlin Hu. 1203-1213 [doi]
- Cross-lingual Inference with A Chinese Entailment GraphTianyi Li, Sabine Weber, Mohammad Javad Hosseini, Liane Guillou, Mark Steedman. 1214-1233 [doi]
- Multi-task Learning for Paraphrase Generation With Keyword and Part-of-Speech ReconstructionXuhang Xie, Xuesong Lu, Bei Chen. 1234-1243 [doi]
- MDCSpell: A Multi-task Detector-Corrector Framework for Chinese Spelling CorrectionChenxi Zhu, Ziqiang Ying, Boyu Zhang, Feng Mao. 1244-1253 [doi]
- S²SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL ParsersBinyuan Hui, Ruiying Geng, Lihan Wang, Bowen Qin, Yanyang Li, Bowen Li, Jian Sun, Yongbin Li. 1254-1262 [doi]
- Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of TransformersMariano Felice, Shiva Taslimipoor, Paula Buttery. 1263-1273 [doi]
- Co-training an Unsupervised Constituency Parser with Weak SupervisionNickil Maveli, Shay B. Cohen. 1274-1291 [doi]
- HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure InformationQian Ruan, Malte Ostendorff, Georg Rehm. 1292-1308 [doi]
- An Isotropy Analysis in the Multilingual BERT Embedding SpaceSara Rajaee, Mohammad Taher Pilehvar. 1309-1316 [doi]
- Multi-Stage Prompting for Knowledgeable Dialogue GenerationZihan Liu, Mostofa Patwary, Ryan Prenger, Shrimai Prabhumoye, Wei Ping, Mohammad Shoeybi, Bryan Catanzaro. 1317-1337 [doi]
- vis: A Chinese Dataset for Open-domain Document Visual Question AnsweringLe Qi, Shangwen Lv, HongYu Li, Jing Liu, Yu Zhang, Qiaoqiao She, Hua Wu 0003, Haifeng Wang, Ting Liu. 1338-1351 [doi]
- Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence ModelsAaron Mueller, Robert Frank 0001, Tal Linzen, Luheng Wang, Sebastian Schuster. 1352-1368 [doi]
- C³KG: A Chinese Commonsense Conversation Knowledge GraphDawei Li, Yanran Li, Jiayi Zhang, Ke Li, Chen Wei, Jianwei Cui, Bin Wang. 1369-1383 [doi]
- Graph Neural Networks for Multiparallel Word AlignmentAyyoob Imani, Lütfi Kerem Senel, Masoud Jalili Sabet, François Yvon, Hinrich Schütze. 1384-1396 [doi]
- Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR ErrorsYang Wu, Yanyan Zhao, Hao Yang, Song Chen, Bing Qin 0001, Xiaohuan Cao, Wenting Zhao. 1397-1406 [doi]
- A Novel Framework Based on Medical Concept Driven Attention for Explainable Medical Code Prediction via External KnowledgeTao Wang, Linhai Zhang, Chenchen Ye, Junxi Liu, Deyu Zhou. 1407-1416 [doi]
- Effective Unsupervised Constrained Text Generation based on Perturbed MaskingYingwen Fu, Wenjie Ou, Zhou Yu, Yue Lin. 1417-1427 [doi]
- Combining (Second-Order) Graph-Based and Headed-Span-Based Projective Dependency ParsingSonglin Yang, Kewei Tu. 1428-1434 [doi]
- End-to-End Speech Translation for Code Switched SpeechOrion Weller, Matthias Sperber, Telmo Pires, Hendra Setiawan, Christian Gollan, Dominic Telaar, Matthias Paulik. 1435-1448 [doi]
- A Transformational Biencoder with In-Domain Negative Sampling for Zero-Shot Entity LinkingKai Sun, Richong Zhang, Samuel Mensah, Yongyi Mao, Xudong Liu 0001. 1449-1458 [doi]
- Finding the Dominant Winning Ticket in Pre-Trained Language ModelsZhuocheng Gong, Di He, Yelong Shen, Tie-Yan Liu, Weizhu Chen, Dongyan Zhao 0001, Ji-Rong Wen, Rui Yan 0001. 1459-1472 [doi]
- Thai Nested Named Entity Recognition CorpusWeerayut Buaphet, Can Udomcharoenchaikit, Peerat Limkonchotiwat, Attapol Rutherford, Sarana Nutanong. 1473-1486 [doi]
- Two-Step Question Retrieval for Open-Domain QAYeon Seonwoo, Juhee Son, Jiho Jin, Sang-Woo Lee 0001, Ji-Hoon Kim, Jung-Woo Ha 0001, Alice Oh. 1487-1492 [doi]
- Semantically Distributed Robust Optimization for Vision-and-Language InferenceTejas Gokhale, Abhishek Chaudhary, Pratyay Banerjee, Chitta Baral, Yezhou Yang. 1493-1513 [doi]
- Learning from Missing Relations: Contrastive Learning with Commonsense Knowledge Graphs for Commonsense InferenceYong-Ho Jung, Jun-Hyung Park, Joon-Young Choi, Mingyu Lee, Junho Kim, Kang Min Kim, SangKeun Lee 0001. 1514-1523 [doi]
- Capture Human Disagreement Distributions by Calibrated Networks for Natural Language InferenceYuxia Wang, Minghan Wang, Yimeng Chen, Shimin Tao, Jiaxin Guo, Chang Su, Min Zhang, Hao Yang. 1524-1535 [doi]
- Efficient, Uncertainty-based Moderation of Neural Networks Text ClassifiersJakob Smedegaard Andersen, Walid Maalej. 1536-1546 [doi]
- Revisiting Automatic Evaluation of Extractive Summarization Task: Can We Do Better than ROUGE?Mousumi Akter, Naman Bansal, Shubhra Kanti Karmaker Santu. 1547-1560 [doi]
- Open Vocabulary Extreme Classification Using Generative ModelsDaniel Simig, Fabio Petroni, Pouya Yanki, Kashyap Popat, Christina Du, Sebastian Riedel 0001, Majid Yazdani. 1561-1583 [doi]
- Decomposed Meta-Learning for Few-Shot Named Entity RecognitionTingting Ma, Huiqiang Jiang, Qianhui Wu, Tiejun Zhao, Chin-Yew Lin. 1584-1596 [doi]
- TegTok: Augmenting Text Generation via Task-specific and Open-world KnowledgeChao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang. 1597-1609 [doi]
- EmoCaps: Emotion Capsule based Model for Conversational Emotion RecognitionZaijing Li, Fengxiao Tang, Ming Zhao, Yusen Zhu. 1610-1618 [doi]
- Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of TextSiyuan Wang, Wanjun Zhong, Duyu Tang, Zhongyu Wei, Zhihao Fan, Daxin Jiang, Ming Zhou 0001, Nan Duan. 1619-1629 [doi]
- Transfer Learning and Prediction Consistency for Detecting Offensive Spans of TextAmir Pouran Ben Veyseh, Ning Xu, Quan Hung Tran, Varun Manjunatha, Franck Dernoncourt, Thien Huu Nguyen. 1630-1637 [doi]
- Learning Reasoning Patterns for Relational Triple Extraction with Mutual Generation of Text and GraphYubo Chen 0002, Yunqi Zhang, Yongfeng Huang 0001. 1638-1647 [doi]
- Document-Level Event Argument Extraction via Optimal TransportAmir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Bonan Min, Thien Nguyen. 1648-1658 [doi]
- N-Shot Learning for Augmenting Task-Oriented Dialogue State TrackingIbrahim Taha Aksu, Zhengyuan Liu, Min-Yen Kan, Nancy F. Chen. 1659-1671 [doi]
- Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge DistillationQingyu Tan, Ruidan He, Lidong Bing, Hwee Tou Ng. 1672-1681 [doi]
- Calibration of Machine Reading Systems at ScaleShehzaad Dhuliawala, Leonard Adolphs, Rajarshi Das, Mrinmaya Sachan. 1682-1693 [doi]
- Towards Adversarially Robust Text Classifiers by Learning to Reweight Clean ExamplesJianhan Xu, Cenyuan Zhang, Xiaoqing Zheng, Linyang Li, Cho-Jui Hsieh, Kai-Wei Chang, Xuanjing Huang. 1694-1707 [doi]
- Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its DialectsGo Inoue, Salam Khalifa, Nizar Habash. 1708-1719 [doi]
- How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired AnalysisShaobo Li, Xiaoguang Li, Lifeng Shang, Zhenhua Dong, Chengjie Sun, Bingquan Liu, Zhenzhou Ji, Xin Jiang, Qun Liu. 1720-1732 [doi]
- Metadata Shaping: A Simple Approach for Knowledge-Enhanced Language ModelsSimran Arora, Sen Wu 0002, Enci Liu, Christopher Ré. 1733-1745 [doi]
- Enhancing Natural Language Representation with Large-Scale Out-of-Domain CommonsenseWanyun Cui, Xingran Chen. 1746-1756 [doi]
- Weighted self Distillation for Chinese word segmentationRian He, Shubin Cai, Zhong Ming 0001, Jialei Zhang. 1757-1770 [doi]
- Sibylvariant Transformations for Robust Text ClassificationFabrice Harel-Canada, Muhammad Ali Gulzar, Nanyun Peng, Miryung Kim. 1771-1788 [doi]
- DaLC: Domain Adaptation Learning Curve Prediction for Neural Machine TranslationCheonbok Park, Hantae Kim, Ioan Calapodescu, Hyunchang Cho, Vassilina Nikoulina. 1789-1807 [doi]
- Hey AI, Can You Solve Complex Tasks by Talking to Agents?Tushar Khot, Kyle Richardson 0001, Daniel Khashabi, Ashish Sabharwal. 1808-1823 [doi]
- Modality-specific Learning Rates for Effective Multimodal Additive Late-fusionYiqun Yao, Rada Mihalcea. 1824-1834 [doi]
- BiSyn-GAT+: Bi-Syntax Aware Graph Attention Network for Aspect-based Sentiment AnalysisShuo Liang, Wei Wei 0002, Xian-Ling Mao, Fei Wang, Zhiyong He. 1835-1848 [doi]
- IndicBART: A Pre-trained Model for Indic Natural Language GenerationRaj Dabre, Himani Shrotriya, Anoop Kunchukuttan, Ratish Puduppully, Mitesh Khapra, Pratyush Kumar. 1849-1863 [doi]
- Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text ModelsJianmo Ni, Gustavo Hernandez Ábrego, Noah Constant, Ji Ma, Keith B. Hall, Daniel Cer, Yinfei Yang. 1864-1874 [doi]
- Improving Relation Extraction through Syntax-induced Pre-training with Dependency MaskingYuanhe Tian, Yan Song 0003, Fei Xia. 1875-1886 [doi]
- Striking a Balance: Alleviating Inconsistency in Pre-trained Models for Symmetric Classification TasksAshutosh Kumar, Aditya Joshi. 1887-1895 [doi]
- Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph ExpertsWenhao Yu 0002, Chenguang Zhu 0001, Lianhui Qin, Zhihan Zhang, Tong Zhao 0003, Meng Jiang 0001. 1896-1906 [doi]
- Dict-BERT: Enhancing Language Model Pre-training with DictionaryWenhao Yu 0002, Chenguang Zhu 0001, Yuwei Fang, Donghan Yu, Shuohang Wang, Yichong Xu, Michael Zeng 0001, Meng Jiang 0001. 1907-1918 [doi]
- A Feasibility Study of Answer-Unaware Question Generation for EducationLiam Dugan, Eleni Miltsakaki, Shriyash Upadhyay, Etan Ginsberg, Hannah Gonzalez, DaHyeon Choi, Chuning Yuan, Chris Callison-Burch. 1919-1926 [doi]
- Relevant CommonSense Subgraphs for "What if..." Procedural ReasoningChen Zheng, Parisa KordJamshidi. 1927-1933 [doi]
- Combining Feature and Instance Attribution to Detect ArtifactsPouya Pezeshkpour, Sarthak Jain, Sameer Singh 0001, Byron C. Wallace. 1934-1946 [doi]
- Leveraging Expert Guided Adversarial Augmentation For Improving Generalization in Named Entity RecognitionAaron Reich, Jiaao Chen, Aastha Agrawal, Yanzhe Zhang, Diyi Yang. 1947-1955 [doi]
- Label Semantics for Few Shot Named Entity RecognitionJie Ma, Miguel Ballesteros, Srikanth Doss, Rishita Anubhai, Sunil Mallya, Yaser Al-Onaizan, Dan Roth. 1956-1971 [doi]
- Detection, Disambiguation, Re-ranking: Autoregressive Entity Linking as a Multi-Task ProblemKhalil Mrini, Shaoliang Nie, Jiatao Gu, Sinong Wang, Maziar Sanjabi, Hamed Firooz. 1972-1983 [doi]
- VISITRON: Visual Semantics-Aligned Interactively Trained Object-NavigatorAyush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gökhan Tür, Devi Parikh, Dilek Hakkani-Tur. 1984-1994 [doi]
- Investigating Selective Prediction Approaches Across Several Tasks in IID, OOD, and Adversarial SettingsNeeraj Varshney, Swaroop Mishra, Chitta Baral. 1995-2002 [doi]
- Unsupervised Natural Language Inference Using PHL Triplet GenerationNeeraj Varshney, Pratyay Banerjee, Tejas Gokhale, Chitta Baral. 2003-2016 [doi]
- Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in DialogueEvgeniia Razumovskaia, Ivan Vulic, Anna Korhonen. 2017-2033 [doi]
- Ranking-Constrained Learning with Rationales for Text ClassificationJuanyan Wang, Manali Sharma, Mustafa Bilgic 0001. 2034-2046 [doi]
- CaM-Gen: Causally Aware Metric-Guided Text GenerationNavita Goyal, Roodram Paneri, Ayush Agarwal, Udit Kalani, Abhilasha Sancheti, Niyati Chhaya. 2047-2060 [doi]
- Training Dynamics for Text Summarization ModelsTanya Goyal, Jiacheng Xu, Junyi Jessy Li, Greg Durrett. 2061-2073 [doi]
- Richer Countries and Richer RepresentationsKaitlyn Zhou, Kawin Ethayarajh, Dan Jurafsky. 2074-2085 [doi]
- BBQ: A hand-built bias benchmark for question answeringAlicia Parrish, Angelica Chen, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Jana Thompson, Phu Mon Htut, Samuel R. Bowman. 2086-2105 [doi]
- Zero-shot Learning for Grapheme to Phoneme Conversion with Language EnsembleXinjian Li, Florian Metze, David Mortensen, Shinji Watanabe 0001, Alan W. Black. 2106-2115 [doi]
- Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Under-Documented LanguagesClarissa Forbes, Farhan Samir, Bruce Harold Oliver, Changbing Yang, Edith Coates, Garrett Nicolai, Miikka Silfverberg. 2116-2130 [doi]
- Question Generation for Reading Comprehension Assessment by Modeling How and What to AskBilal Ghanem, Lauren Lutz Coleman, Julia Rivard Dexter, Spencer McIntosh von der Ohe, Alona Fyshe. 2131-2146 [doi]
- TABi: Type-Aware Bi-Encoders for Open-Domain Entity RetrievalMegan Leszczynski, Daniel Y. Fu, Mayee F. Chen, Christopher Ré. 2147-2166 [doi]
- Hierarchical Recurrent Aggregative Generation for Few-Shot NLGGiulio Zhou, Gerasimos Lampouras, Ignacio Iacobacci. 2167-2181 [doi]
- Training Text-to-Text Transformers with Privacy GuaranteesNatalia Ponomareva, Jasmijn Bastings, Sergei Vassilvitskii. 2182-2193 [doi]
- Revisiting Uncertainty-based Query Strategies for Active Learning with TransformersChristopher Schröder, Andreas Niekler, Martin Potthast. 2194-2203 [doi]
- The impact of lexical and grammatical processing on generating code from natural languageNathanaël Beau, Benoît Crabbé. 2204-2214 [doi]
- Seq2Path: Generating Sentiment Tuples as Paths of a TreeYue Mao, Yi Shen, Jingchao Yang, Xiaoying Zhu, Longjun Cai. 2215-2225 [doi]
- Mitigating the Inconsistency Between Word Saliency and Model Confidence with Pathological Contrastive TrainingPengwei Zhan, Yang Wu, Shaolei Zhou, Yunjian Zhang, Liming Wang. 2226-2244 [doi]
- Your fairness may vary: Pretrained language model fairness in toxic text classificationIoana Baldini, Dennis Wei, Karthikeyan Natesan Ramamurthy, Moninder Singh, Mikhail Yurochkin. 2245-2262 [doi]
- ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical ReasoningAhmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque. 2263-2279 [doi]
- A Novel Perspective to Look At Attention: Bi-level Attention-based Explainable Topic Modeling for News ClassificationDairui Liu, Derek Greene, Ruihai Dong. 2280-2290 [doi]
- Learn and Review: Enhancing Continual Named Entity Recognition via Reviewing Synthetic SamplesYu Xia, Quan Wang, Yajuan Lyu, Yong Zhu 0004, Wenhao Wu, Sujian Li, Dai Dai. 2291-2300 [doi]
- Phoneme transcription of endangered languages: an evaluation of recent ASR architectures in the single speaker scenarioGilles Boulianne. 2301-2308 [doi]
- Does BERT really agree ? Fine-grained Analysis of Lexical Dependence on a Syntactic TaskKarim Lasri, Alessandro Lenci, Thierry Poibeau. 2309-2315 [doi]
- Combining Static and Contextualised Multilingual EmbeddingsKatharina Hämmerl, Jindrich Libovický, Alexander Fraser 0001. 2316-2329 [doi]
- An Accurate Unsupervised Method for Joint Entity Alignment and Dangling Entity DetectionShengxuan Luo, Sheng Yu 0002. 2330-2339 [doi]
- Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research ManifoldSebastian Ruder, Ivan Vulic, Anders Søgaard. 2340-2354 [doi]
- Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing PerspectiveEdoardo Manino, Julia Rozanova, Danilo Carvalho, André Freitas, Lucas C. Cordeiro. 2355-2366 [doi]
- Improving Neural Political Statement Classification with Class Hierarchical InformationErenay Dayanik, André Blessing, Nico Blokker, Sebastian Haunss, Jonas Kuhn, Gabriella Lapesa, Sebastian Padó. 2367-2382 [doi]
- Enabling Multimodal Generation on CLIP via Vision-Language Knowledge DistillationWenliang Dai, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Pascale Fung. 2383-2395 [doi]
- Co-VQA : Answering by Interactive Sub Question SequenceRuonan Wang, Yuxi Qian, Fangxiang Feng, Xiaojie Wang, Huixing Jiang. 2396-2408 [doi]
- A Simple Hash-Based Early Exiting Approach For Language Understanding and GenerationTianxiang Sun, Xiangyang Liu, Wei Zhu, Zhichao Geng, Lingling Wu, Yilong He, Yuan Ni, Guotong Xie, Xuanjing Huang, Xipeng Qiu. 2409-2421 [doi]
- Auxiliary tasks to boost Biaffine Semantic Dependency ParsingMarie Candito. 2422-2429 [doi]
- Syntax-guided Contrastive Learning for Pre-trained Language ModelShuai Zhang, Lijie Wang, Xinyan Xiao, Hua Wu 0003. 2430-2440 [doi]
- Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise SettingIlias Chalkidis, Anders Søgaard. 2441-2454 [doi]
- ASCM: An Answer Space Clustered Prompting Method without Answer EngineeringZhen Wang, Yating Yang, Zhou Xi, Bo Ma 0004, Lei Wang, Rui Dong 0002, Azmat Anwar. 2455-2469 [doi]
- Why don't people use character-level machine translation?Jindrich Libovický, Helmut Schmid, Alexander Fraser 0001. 2470-2485 [doi]
- Seeking Patterns, Not just Memorizing Procedures: Contrastive Learning for Solving Math Word ProblemsZhongli Li, Wenxuan Zhang, Chao Yan, Qingyu Zhou, Chao Li, Hongzhi Liu, Yunbo Cao. 2486-2496 [doi]
- xGQA: Cross-Lingual Visual Question AnsweringJonas Pfeiffer, Gregor Geigle, Aishwarya Kamath, Jan-Martin O. Steitz, Stefan Roth 0001, Ivan Vulic, Iryna Gurevych. 2497-2511 [doi]
- Automatic Speech Recognition and Query By Example for Creole Languages DocumentationCécile Macaire, Didier Schwab, Benjamin Lecouteux, Emmanuel Schang. 2512-2520 [doi]
- MReD: A Meta-Review Dataset for Structure-Controllable Text GenerationChenhui Shen, LiYing Cheng, Ran Zhou, Lidong Bing, Yang You, Luo Si. 2521-2535 [doi]
- Single Model Ensemble for Subword Regularized Models in Low-Resource Machine TranslationSho Takase, Tatsuya Hiraoka, Naoaki Okazaki. 2536-2541 [doi]
- Detecting Various Types of Noise for Neural Machine TranslationChristian Herold, Jan Rosendahl, Joris Vanvinckenroye, Hermann Ney. 2542-2551 [doi]
- DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-trainingLuyang Huang, Guocheng Niu, Jiachen Liu, Xinyan Xiao, Hua Wu 0003. 2552-2566 [doi]
- HiCLRE: A Hierarchical Contrastive Learning Framework for Distantly Supervised Relation ExtractionDongyang Li, Taolin Zhang, Nan Hu, Chengyu Wang 0001, Xiaofeng He. 2567-2578 [doi]
- Prompt-Driven Neural Machine TranslationYafu Li, Yongjing Yin, Jing Li, Yue Zhang 0004. 2579-2590 [doi]
- On Controlling Fallback Responses for Grounded Dialogue GenerationHongyuan Lu, Wai Lam, Hong Cheng 0001, Helen Meng. 2591-2601 [doi]
- CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractionsTayfun Ates, Muhammed Samil Atesoglu, Cagatay Yigit, Ilker Kesen, Mert Kobas, Erkut Erdem, Aykut Erdem, Tilbe Göksun, Deniz Yuret. 2602-2627 [doi]
- A Graph Enhanced BERT Model for Event PredictionLi Du, Xiao Ding, Yue Zhang 0004, Ting Liu, Bing Qin 0001. 2628-2638 [doi]
- Long Time No See! Open-Domain Conversation with Long-Term Persona MemoryXinchao Xu, Zhibin Gou, WenQuan Wu, Zheng-Yu Niu, Hua Wu, Haifeng Wang 0001, Shihang Wang. 2639-2650 [doi]
- Lacking the Embedding of a Word? Look it up into a Traditional DictionaryElena Sofia Ruzzetti, Leonardo Ranaldi, Michele Mastromattei, Francesca Fallucchi, Noemi Scarpato, Fabio Massimo Zanzotto. 2651-2662 [doi]
- MTRec: Multi-Task Learning over BERT for News RecommendationQiwei Bi, Jian Li, Lifeng Shang, Xin Jiang, Qun Liu, Hanfang Yang. 2663-2669 [doi]
- Cross-domain Named Entity Recognition via Graph MatchingJunhao Zheng, Haibin Chen, Qianli Ma. 2670-2680 [doi]
- Assessing Multilingual Fairness in Pre-trained Multimodal RepresentationsJialu Wang, Yang Liu, Xin Wang. 2681-2695 [doi]
- More Than Words: Collocation Retokenization for Latent Dirichlet Allocation ModelsJin Cheevaprawatdomrong, Alexandra Schofield, Attapol Rutherford. 2696-2704 [doi]
- Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial RobustnessTejas Gokhale, Swaroop Mishra, Man Luo, Bhavdeep Singh Sachdeva, Chitta Baral. 2705-2718 [doi]
- ASSIST: Towards Label Noise-Robust Dialogue State TrackingFanghua Ye 0001, Yue Feng, Emine Yilmaz. 2719-2731 [doi]
- Graph Refinement for Coreference ResolutionLesly Miculicich, James Henderson 0001. 2732-2742 [doi]
- ECO v1: Towards Event-Centric Opinion MiningRuoxi Xu, Hongyu Lin, Meng Liao, Xianpei Han, Jin Xu, Wei Tan, Yingfei Sun, Le Sun 0001. 2743-2753 [doi]
- Deep Reinforcement Learning for Entity AlignmentLingbing Guo, Yuqiang Han, Qiang Zhang, Huajun Chen. 2754-2765 [doi]
- Breaking Down Multilingual Machine TranslationTing-Rui Chiang, Yi Pei Chen, Yi-Ting Yeh, Graham Neubig. 2766-2780 [doi]
- Mitigating Contradictions in Dialogue Based on Contrastive LearningWeizhao Li, Junsheng Kong, Ben Liao, Yi Cai 0001. 2781-2788 [doi]
- ELLE: Efficient Lifelong Pre-training for Emerging DataYujia Qin, Jiajie Zhang, Yankai Lin, Zhiyuan Liu, Peng Li 0030, Maosong Sun, Jie Zhou. 2789-2810 [doi]
- EnCBP: A New Benchmark Dataset for Finer-Grained Cultural Background Prediction in EnglishWeicheng Ma, Samiha Datta, Lili Wang, Soroush Vosoughi. 2811-2823 [doi]
- Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language ModelsRobert L. Logan IV, Ivana Balazevic, Eric Wallace, Fabio Petroni, Sameer Singh 0001, Sebastian Riedel 0001. 2824-2835 [doi]
- uFACT: Unfaithful Alien-Corpora Training for Semantically Consistent Data-to-Text GenerationTisha Anders, Alexandru Coca, Bill Byrne. 2836-2841 [doi]
- Good Night at 4 pm?! Time Expressions in Different CulturesVered Shwartz. 2842-2853 [doi]
- Extracting Person Names from User Generated Text: Named-Entity Recognition for Combating Human TraffickingYifei Li, Pratheeksha Nair, Kellin Pelrine, Reihaneh Rabbany. 2854-2868 [doi]
- OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence RetrievalTong Niu, Kazuma Hashimoto, Yingbo Zhou, Caiming Xiong. 2869-2882 [doi]
- Suum Cuique: Studying Bias in Taboo Detection with a Community PerspectiveOsama Khalid, Jonathan Rusert, Padmini Srinivasan. 2883-2896 [doi]
- Modeling Intensification for Sign Language Generation: A Computational ApproachMert Inan, Yang Zhong, Sabit Hassan, Lorna C. Quandt, Malihe Alikhani. 2897-2911 [doi]
- Controllable Natural Language Generation with Contrastive PrefixesJing Qian, Li Dong 0004, Yelong Shen, Furu Wei, Weizhu Chen. 2912-2924 [doi]
- Revisiting the Effects of Leakage on Dependency ParsingNathaniel Krasner, Miriam Wanner, Antonios Anastasopoulos. 2925-2934 [doi]
- Learning to Describe Solutions for Bug Reports Based on Developer DiscussionsSheena Panthaplackel, Junyi Jessy Li, Milos Gligoric, Raymond J. Mooney. 2935-2952 [doi]
- Perturbations in the Wild: Leveraging Human-Written Text Perturbations for Realistic Adversarial Attack and DefenseThai Le, Jooyoung Lee, Kevin Yen, Yifan Hu, Dongwon Lee 0001. 2953-2965 [doi]
- Improving Chinese Grammatical Error Detection via Data augmentation by Conditional Error GenerationTianchi Yue, Shulin Liu, Huihui Cai, Tao Yang, Shengkang Song, Tinghao Yu. 2966-2975 [doi]
- Modular and Parameter-Efficient Multimodal Fusion with PromptingSheng Liang, Mengjie Zhao, Hinrich Schütze. 2976-2985 [doi]
- Synchronous Refinement for Neural Machine TranslationKehai Chen, Masao Utiyama, Eiichiro Sumita, Rui Wang, Min Zhang 0005. 2986-2996 [doi]
- HIE-SQL: History Information Enhanced Network for Context-Dependent Text-to-SQL Semantic ParsingYanzhao Zheng, Haibin Wang, Baohua Dong, Xingjun Wang, Changshan Li. 2997-3007 [doi]
- CRASpell: A Contextual Typo Robust Approach to Improve Chinese Spelling CorrectionShulin Liu, Shengkang Song, Tianchi Yue, Tao Yang, Huihui Cai, Tinghao Yu, Shengli Sun. 3008-3018 [doi]
- Gaussian Multi-head Attention for Simultaneous Machine TranslationShaolei Zhang, Yang Feng. 3019-3030 [doi]
- Composing Structure-Aware Batches for Pairwise Sentence ClassificationAndreas Waldis, Tilman Beck, Iryna Gurevych. 3031-3045 [doi]
- Factual Consistency of Multilingual Pretrained Language ModelsConstanza Fierro, Anders Søgaard. 3046-3052 [doi]
- Selecting Stickers in Open-Domain Dialogue through Multitask LearningZhexin Zhang, Yeshuang Zhu, Zhengcong Fei, Jinchao Zhang, Jie Zhou 0016. 3053-3060 [doi]
- ZiNet: Linking Chinese Characters Spanning Three Thousand YearsYang Chi, Fausto Giunchiglia, Daqian Shi, Xiaolei Diao, Chuntao Li, Hao Xu. 3061-3070 [doi]
- How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing?Hailong Jin, Tiansi Dong, Lei Hou 0001, Juanzi Li, Hui Chen, Zelin Dai, Yincen Qu. 3071-3081 [doi]
- AMR-DA: Data Augmentation by Abstract Meaning RepresentationZiyi Shou, Yuxin Jiang, Fangzhen Lin. 3082-3098 [doi]
- Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative StudySerra Sinem Tekiroglu, Helena Bonaldi, Margherita Fanton, Marco Guerini. 3099-3114 [doi]
- Improving Robustness of Language Models from a Geometry-aware PerspectiveBin Zhu, Zhaoquan Gu, Le Wang 0008, Jinyin Chen, Qi Xuan. 3115-3125 [doi]
- Task-guided Disentangled Tuning for Pretrained Language ModelsJiali Zeng, Yufan Jiang, Shuangzhi Wu, Yongjing Yin, Mu Li 0001. 3126-3137 [doi]
- Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence EmbeddingRui Cao, Yihao Wang, Yuxin Liang, Ling Gao, Jie Zheng 0005, Jie Ren, Zheng Wang 0001. 3138-3152 [doi]
- The Inefficiency of Language Models in Scholarly Retrieval: An Experimental Walk-throughShruti Singh, Mayank Singh 0001. 3153-3173 [doi]
- Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity RecognitionZheng Yuan 0002, Chuanqi Tan, Songfang Huang, Fei Huang. 3174-3186 [doi]
- UNIMO-2: End-to-End Unified Vision-Language Grounded LearningWei Li, Can Gao, Guocheng Niu, Xinyan Xiao, Hao Liu, Jiachen Liu, Hua Wu, Haifeng Wang. 3187-3201 [doi]
- The Past Mistake is the Future Wisdom: Error-driven Contrastive Probability Optimization for Chinese Spell CheckingYinghui Li, Qingyu Zhou, Yangning Li, Zhongli Li, Ruiyang Liu, Rongyi Sun, Zizhen Wang, Chao Li, Yunbo Cao, Hai-Tao Zheng. 3202-3213 [doi]
- XFUND: A Benchmark Dataset for Multilingual Visually Rich Form UnderstandingYiheng Xu, Tengchao Lv, Lei Cui 0001, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Furu Wei. 3214-3224 [doi]
- Type-Driven Multi-Turn Corrections for Grammatical Error CorrectionShaopeng Lai, Qingyu Zhou, Jiali Zeng, Zhongli Li, Chao Li, Yunbo Cao, Jinsong Su. 3225-3236 [doi]
- Leveraging Knowledge in Multilingual Commonsense ReasoningYuwei Fang, Shuohang Wang, Yichong Xu, Ruochen Xu, Siqi Sun, Chenguang Zhu, Michael Zeng 0001. 3237-3246 [doi]
- Encoding and Fusing Semantic Connection and Linguistic Evidence for Implicit Discourse Relation RecognitionWei Xiang 0005, Bang Wang, Lu Dai, Yijun Mo. 3247-3257 [doi]
- One Agent To Rule Them All: Towards Multi-agent Conversational AIChristopher Clarke, Joseph Peper, Karthik Krishnamurthy, Walter Talamonti, Kevin Leach, Walter S. Lasecki, Yiping Kang, Lingjia Tang, Jason Mars. 3258-3267 [doi]
- Word-level Perturbation Considering Word Length and Compositional SubwordsTatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki. 3268-3275 [doi]
- Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS TaggingHouquan Zhou, Yang Li, Zhenghua Li, Min Zhang. 3276-3290 [doi]
- Controlling the Focus of Pretrained Language Generation ModelsJiabao Ji, Yoon Kim, James R. Glass, Tianxing He. 3291-3306 [doi]
- Comparative Opinion Summarization via Collaborative DecodingHayate Iso, Xiaolan Wang 0001, Stefanos Angelidis, Yoshihiko Suhara. 3307-3324 [doi]
- IsoScore: Measuring the Uniformity of Embedding Space UtilizationWilliam Rudman, Nate Gillman, Taylor Rayne, Carsten Eickhoff. 3325-3339 [doi]
- A Natural Diet: Towards Improving Naturalness of Machine Translation OutputMarkus Freitag, David Vilar, David Grangier, Colin Cherry, George F. Foster. 3340-3353 [doi]
- From Stance to Concern: Adaptation of Propositional Analysis to New Tasks and DomainsBrodie Mather, Bonnie J. Dorr, Adam Dalton 0001, William de Beaumont, Owen Rambow, Sonja Schmer-Galunder. 3354-3367 [doi]
- CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual SignalsScott Novotney, Sreeparna Mukherjee, Zeeshan Ahmed, Andreas Stolcke. 3368-3379 [doi]
- Cross-Lingual UMLS Named Entity Linking using UMLS Dictionary Fine-TuningRina Galperin, Shachar Schnapp, Michael Elhadad. 3380-3390 [doi]
- Aligned Weight Regularizers for Pruning Pretrained Neural NetworksJames O' Neill, Sourav Dutta, Haytham Assem. 3391-3401 [doi]
- Consistent Representation Learning for Continual Relation ExtractionKang Zhao, Hua Xu, Jiangong Yang, Kai Gao. 3402-3411 [doi]
- Event Transition Planning for Open-ended Text GenerationQintong Li, Piji Li, Wei Bi, Zhaochun Ren, Yuxuan Lai, Lingpeng Kong. 3412-3426 [doi]
- Comprehensive Multi-Modal Interactions for Referring Image SegmentationKanishk Jain, Vineet Gandhi. 3427-3435 [doi]
- MetaWeighting: Learning to Weight Tasks in Multi-Task LearningYuren Mao, Zekai Wang, Weiwei Liu 0003, Xuemin Lin 0001, Pengtao Xie. 3436-3448 [doi]
- Improving Controllable Text Generation with Position-Aware Weighted DecodingYuxuan Gu, Xiaocheng Feng, Sicheng Ma, Jiaming Wu, Heng Gong, Bing Qin 0001. 3449-3467 [doi]
- Prompt Tuning for Discriminative Pre-trained Language ModelsYuan Yao, Bowen Dong, Ao Zhang, Zhengyan Zhang, Ruobing Xie, Zhiyuan Liu, Leyu Lin, Maosong Sun, Jianyong Wang 0001. 3468-3473 [doi]
- Two Birds with One Stone: Unified Model Learning for Both Recall and Ranking in News RecommendationChuhan Wu, Fangzhao Wu, Tao Qi, Yongfeng Huang. 3474-3480 [doi]
- What does it take to bake a cake? The RecipeRef corpus and anaphora resolution in procedural textBiaoyan Fang, Timothy Baldwin, Karin Verspoor. 3481-3495 [doi]
- MERIt: Meta-Path Guided Contrastive Learning for Logical ReasoningFangkai Jiao, Yangyang Guo, Xuemeng Song, Liqiang Nie. 3496-3509 [doi]
- THE-X: Privacy-Preserving Transformer Inference with Homomorphic EncryptionTianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei. 3510-3520 [doi]
- HLDC: Hindi Legal Documents CorpusArnav Kapoor, Mudit Dhawan, Anmol Goel, T. H. Arjun, Akshala Bhatnagar, Vibhu Agrawal, Amul Agrawal, Arnab Bhattacharya 0001, Ponnurangam Kumaraguru, Ashutosh Modi. 3521-3536 [doi]
- Rethinking Document-level Neural Machine TranslationZewei Sun, Mingxuan Wang, Hao Zhou, Chengqi Zhao, Shujian Huang, Jiajun Chen, Lei Li. 3537-3548 [doi]
- Incremental Intent Detection for Medical Domain with Contrast Replay NetworksGuirong Bai, Shizhu He, Kang Liu 0001, Jun Zhao 0001. 3549-3556 [doi]
- LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text RetrievalCanwen Xu, Daya Guo, Nan Duan, Julian McAuley. 3557-3569 [doi]
- Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable ApproachXin Lv, Yankai Lin, Yixin Cao 0002, Lei Hou 0001, Juanzi Li, Zhiyuan Liu, Peng Li, Jie Zhou. 3570-3581 [doi]
- EICO: Improving Few-Shot Text Classification via Explicit and Implicit Consistency RegularizationLei Zhao, Cheng Yao. 3582-3587 [doi]
- Improving the Adversarial Robustness of NLP Models by Information BottleneckCenyuan Zhang, Xiang Zhou, Yixin Wan, Xiaoqing Zheng, Kai-Wei Chang, Cho-Jui Hsieh. 3588-3598 [doi]
- Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment AnalysisKai Zhang, Kun Zhang, Mengdi Zhang, Hongke Zhao, Qi Liu, Wei Wu 0014, Enhong Chen. 3599-3610 [doi]
- DARER: Dual-task Temporal Relational Recurrent Reasoning Network for Joint Dialog Sentiment Classification and Act RecognitionBowen Xing, Ivor W. Tsang. 3611-3621 [doi]
- Divide and Conquer: Text Semantic Matching with Disentangled Keywords and IntentsYicheng Zou, Hongwei Liu, Tao Gui, Junzhe Wang, Qi Zhang, Meng Tang, Haixiang Li, Daniel Wang. 3622-3632 [doi]
- Modular Domain AdaptationJunshen K. Chen, Dallas Card, Dan Jurafsky. 3633-3655 [doi]
- Detection of Adversarial Examples in Text Classification: Benchmark and Baseline via Robust Density EstimationKiYoon Yoo, Jangho Kim, Jiho Jang, Nojun Kwak. 3656-3672 [doi]
- Platt-Bin: Efficient Posterior Calibrated Training for NLP ClassifiersRishabh Singh, Shirin Goshtasbpour. 3673-3684 [doi]
- Addressing Resource and Privacy Constraints in Semantic Parsing Through Data AugmentationKevin Yang, Olivia Deng, Charles Chen, Richard Shin, Subhro Roy, Benjamin Van Durme. 3685-3695 [doi]
- Improving Candidate Retrieval with Entity Profile Generation for Wikidata Entity LinkingTuan Lai, Heng Ji, ChengXiang Zhai. 3696-3711 [doi]
- Local Structure Matters Most: Perturbation Study in NLULouis Clouâtre, Prasanna Parthasarathi, Amal Zouaq, Sarath Chandar. 3712-3731 [doi]
- Probing Factually Grounded Content Transfer with Factual AblationPeter West, Chris Quirk, Michel Galley, Yejin Choi. 3732-3746 [doi]
- ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking InferenceKai Hui 0001, Honglei Zhuang, Tao Chen 0008, Zhen Qin 0001, Jing Lu, Dara Bahri, Ji Ma, Jai Prakash Gupta, Cícero Nogueira dos Santos, Yi Tay, Donald Metzler. 3747-3758 [doi]
- Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation MetricsDaniel Deutsch, Dan Roth. 3759-3765 [doi]
- Prior Knowledge and Memory Enriched Transformer for Sign Language TranslationTao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng. 3766-3775 [doi]
- Discontinuous Constituency and BERT: A Case Study of DutchKonstantinos Kogkalidis, Gijs Wijnholds. 3776-3785 [doi]
- Probing Multilingual Cognate Prediction ModelsClémentine Fourrier, Benoît Sagot. 3786-3801 [doi]
- A Neural Pairwise Ranking Model for Readability AssessmentJustin Lee, Sowmya Vajjala. 3802-3813 [doi]
- First the Worst: Finding Better Gender Translations During Beam SearchDanielle Saunders, Rosie Sallis, Bill Byrne. 3814-3823 [doi]
- Dialogue Summaries as Dialogue States (DS2), Template-Guided Summarization for Few-shot Dialogue State TrackingJamin Shin, Hangyeol Yu, Hyeongdon Moon, Andrea Madotto, Juneyoung Park. 3824-3846 [doi]
- Unsupervised Preference-Aware Language IdentificationXingzhang Ren, Baosong Yang, Dayiheng Liu, Haibo Zhang, Xiaoyu Lv, Liang Yao, Jun Xie. 3847-3852 [doi]
- Using NLP to quantify the environmental cost and diversity benefits of in-person NLP conferencesPiotr Przybyla, Matthew Shardlow. 3853-3863 [doi]
- Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence MaskingTianyi Luo, Rui Meng, Xin Wang, Yang Liu. 3864-3876 [doi]
- Chinese Synesthesia Detection: New Dataset and ModelsXiaotong Jiang, Qingqing Zhao, Yunfei Long, Zhongqing Wang. 3877-3887 [doi]
- Rethinking Offensive Text Detection as a Multi-Hop Reasoning ProblemQiang Zhang, Jason Naradowsky, Yusuke Miyao. 3888-3905 [doi]
- On the Safety of Conversational Models: Taxonomy, Dataset, and BenchmarkHao Sun 0012, Guangxuan Xu, Jiawen Deng, Jiale Cheng, Chujie Zheng, Hao Zhou, Nanyun Peng, Xiaoyan Zhu 0001, Minlie Huang. 3906-3923 [doi]
- Word Segmentation by Separation Inference for East Asian LanguagesYu Tong, Jingzhi Guo, Jizhe Zhou, Ge Chen, Guokai Zheng. 3924-3934 [doi]
- Unsupervised Chinese Word Segmentation with BERT Oriented Probing and TransformationWei Li, Yuhan Song, Qi Su, Yanqiu Shao. 3935-3940 [doi]
- E-KAR: A Benchmark for Rationalizing Natural Language Analogical ReasoningJiangjie Chen, Rui Xu, Ziquan Fu, Wei Shi, Zhongqiao Li, Xinbo Zhang, Changzhi Sun, Lei Li, Yanghua Xiao, Hao Zhou. 3941-3955 [doi]
- Implicit Relation Linking for Question Answering over Knowledge GraphYao Zhao, JiaCheng Huang, Wei Hu 0007, Qijin Chen, Xiaoxia Qiu, Chengfu Huo, Weijun Ren. 3956-3968 [doi]
- Attention Mechanism with Energy-Friendly OperationsYu Wan 0004, Baosong Yang, Dayiheng Liu, Rong Xiao, Derek F. Wong, Haibo Zhang, Boxing Chen, Lidia S. Chao. 3969-3976 [doi]
- Probing BERT's priors with serial reproduction chainsTakateru Yamakoshi, Thomas L. Griffiths, Robert D. Hawkins. 3977-3992 [doi]
- Interpreting the Robustness of Neural NLP Models to Textual PerturbationsYunxiang Zhang 0002, Liangming Pan, Samson Tan, Min-Yen Kan. 3993-4007 [doi]
- Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant RepresentationsJi Xin, Chenyan Xiong, Ashwin Srinivasan 0003, Ankita Sharma, Damien Jose, Paul Bennett 0001. 4008-4020 [doi]
- A Few-Shot Semantic Parser for Wizard-of-Oz Dialogues with the Precise ThingTalk RepresentationGiovanni Campagna, Sina J. Semnani, Ryan Kearns, Lucas Jun Koba Sato, Silei Xu, Monica S. Lam. 4021-4034 [doi]
- GCPG: A General Framework for Controllable Paraphrase GenerationKexin Yang 0002, Dayiheng Liu, Wenqiang Lei, Baosong Yang, Haibo Zhang, Xue Zhao, Wenqing Yao, Boxing Chen. 4035-4047 [doi]
- CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language UnderstandingMilan Gritta, Ruoyu Hu, Ignacio Iacobacci. 4048-4061 [doi]
- Attention as Grounding: Exploring Textual and Cross-Modal Attention on Entities and Relations in Language-and-Vision TransformerNikolai Ilinykh, Simon Dobnik. 4062-4073 [doi]
- Improving Zero-Shot Cross-lingual Transfer Between Closely Related Languages by Injecting Character-Level NoiseNoëmi Aepli, Rico Sennrich. 4074-4083 [doi]
- Structural Supervision for Word Alignment and Machine TranslationLei Li, Kai Fan, Hongjia Li, Chun Yuan. 4084-4094 [doi]
- Focus on the Action: Learning to Highlight and Summarize Jointly for Email To-Do Items SummarizationKexun Zhang, Jiaao Chen, Diyi Yang. 4095-4106 [doi]
- Exploring the Capacity of a Large-scale Masked Language Model to Recognize Grammatical ErrorsRyo Nagata, Manabu Kimura, Kazuaki Hanawa. 4107-4118 [doi]
- Should We Trust This Summary? Bayesian Abstractive Summarization to The RescueAlexios Gidiotis, Grigorios Tsoumakas. 4119-4131 [doi]
- On the data requirements of probingZining Zhu, Jixuan Wang, Bai Li, Frank Rudzicz. 4132-4147 [doi]
- Translation Error Detection as Rationale ExtractionMarina Fomicheva, Lucia Specia, Nikolaos Aletras. 4148-4159 [doi]
- Towards Collaborative Neural-Symbolic Graph Semantic Parsing via UncertaintyZi Lin, Jeremiah Zhe Liu, Jingbo Shang. 4160-4173 [doi]
- Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence FrameworkZilong Wang 0002, Jingbo Shang. 4174-4186 [doi]
- On Length Divergence Bias in Textual Matching ModelsLan Jiang, Tianshu Lyu, Yankai Lin, Chong Meng, Xiaoyong Lyu, Dawei Yin. 4187-4193 [doi]
- What is wrong with you?: Leveraging User Sentiment for Automatic Dialog EvaluationSarik Ghazarian, Behnam Hedayatnia, Alexandros Papangelis, Yang Liu, Dilek Hakkani-Tur. 4194-4204 [doi]