Abstract is missing.
- Frontmatter [doi]
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-CommerceSong Xu 0002, Haoran Li 0001, Peng Yuan 0002, Yujia Wang, Youzheng Wu, Xiaodong He 0002, Ying Liu, Bowen Zhou. 1-17 [doi]
- Extracting Topics with Simultaneous Word Co-occurrence and Semantic Correlation Graphs: Neural Topic Modeling for Short TextsYiming Wang, XiMing Li, Xiaotang Zhou, Jihong OuYang. 18-27 [doi]
- Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question AnsweringChenyu You, Nuo Chen, Yuexian Zou. 28-39 [doi]
- Language Clustering for Multilingual Named Entity RecognitionKyle Shaffer. 40-45 [doi]
- Neural News Recommendation with Collaborative News Encoding and Structural User EncodingZhiming Mao, Xingshan Zeng, Kam-Fai Wong. 46-55 [doi]
- Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering DataDian Yu, Kai Sun 0006, Dong Yu 0001, Claire Cardie. 56-68 [doi]
- A Web Scale Entity Extraction SystemXuanting Cai, Quanbin Ma, Jianyu Liu, Pan Li, Qi Zeng, Zhengkan Yang, Pushkar Tripathi. 69-73 [doi]
- Joint Multimedia Event Extraction from Video and ArticleBrian Chen, Xudong Lin 0003, Christopher Thomas, Manling Li, Shoya Yoshida, Lovish Chum, Heng Ji, Shih-Fu Chang. 74-88 [doi]
- Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language GroundingYuechen Wang, Wengang Zhou, Houqiang Li. 89-99 [doi]
- Factual Consistency Evaluation for Text Summarization via Counterfactual EstimationYuexiang Xie, Fei Sun 0001, Yang Deng 0002, Yaliang Li, Bolin Ding. 100-110 [doi]
- Cross-Modal Retrieval Augmentation for Multi-Modal ClassificationShir Gur, Natalia Neverova, Chris Stauffer, Ser-Nam Lim, Douwe Kiela, Austin Reiter. 111-123 [doi]
- HiTRANS: A Hierarchical Transformer Network for Nested Named Entity RecognitionZhiwei Yang 0005, Jing Ma 0004, Hechang Chen, Yunke Zhang, Yi Chang 0001. 124-132 [doi]
- Improving Embedding-based Large-scale Retrieval via Label EnhancementPeiyang Liu, Xi Wang, Sen Wang, Wei Ye, Xiangyu Xi, Shikun Zhang. 133-142 [doi]
- Improving Privacy Guarantee and Efficiency of Latent Dirichlet Allocation Model Training Under Differential PrivacyTao Huang, Hong Chen 0001. 143-152 [doi]
- Generating Mammography Reports from Multi-view Mammograms with BERTAlexander Yalunin, Elena Sokolova, Ilya Burenko, Alexander Ponomarchuk, Olga Puchkova, Dmitriy Umerenkov. 153-162 [doi]
- Euphemistic Phrase Detection by Masked Language ModelWanzheng Zhu, Suma Bhat. 163-168 [doi]
- Decomposing Complex Questions Makes Multi-Hop QA Easier and More InterpretableRuiliu Fu, Han Wang, Xuejun Zhang, Jun Zhou, Yonghong Yan 0002. 169-180 [doi]
- Segmenting Natural Language Sentences via Lexical Unit AnalysisYangming Li, Lemao Liu, Shuming Shi 0001. 181-187 [doi]
- Dense Hierarchical Retrieval for Open-domain Question AnsweringYe Liu, Kazuma Hashimoto, Yingbo Zhou, Semih Yavuz, Caiming Xiong, Philip S. Yu. 188-200 [doi]
- Visually Grounded Concept CompositionBowen Zhang 0002, Hexiang Hu, Linlu Qiu, Peter Shaw, Fei Sha. 201-215 [doi]
- Compositional Networks Enable Systematic Generalization for Grounded Language UnderstandingYen-ling Kuo, Boris Katz, Andrei Barbu. 216-226 [doi]
- An Unsupervised Method for Building Sentence Simplification Corpora in Multiple LanguagesXinyu Lu, Jipeng Qiang, Yun Li, Yunhao Yuan, Yi Zhu. 227-237 [doi]
- WhiteningBERT: An Easy Unsupervised Sentence Embedding ApproachJunjie Huang, Duyu Tang, Wanjun Zhong, Shuai Lu, Linjun Shou, Ming Gong, Daxin Jiang, Nan Duan. 238-244 [doi]
- TWEETSUMM - A Dialog Summarization Dataset for Customer ServiceGuy Feigenblat, R. Chulaka Gunasekara, Benjamin Sznajder, Sachindra Joshi, David Konopnicki, Ranit Aharonov. 245-260 [doi]
- Discourse-Based Sentence SplittingLiam Cripwell, Joël Legrand, Claire Gardent. 261-273 [doi]
- Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringMinghan Li, Ming Li, Kun Xiong, Jimmy Lin. 274-287 [doi]
- Mining the Cause of Political Decision-Making from Social Media: A Case Study of COVID-19 Policies across the US StatesZhijing Jin, Zeyu Peng, Tejas Vaidhya, Bernhard Schölkopf, Rada Mihalcea. 288-301 [doi]
- Self-Attention Graph Residual Convolutional Networks for Event Detection with dependency relationsAnan Liu, Ning Xu 0003, Haozhe Liu. 302-311 [doi]
- Mixup Decoding for Diverse Machine TranslationJicheng Li, Pengzhi Gao, Xuanfu Wu, Yang Feng, Zhongjun He, Hua Wu 0003, Haifeng Wang 0001. 312-320 [doi]
- An Alignment-Agnostic Model for Chinese Text Error CorrectionLiying Zheng, Yue Deng, Weishun Song, Liang Xu, Jing Xiao. 321-326 [doi]
- Reasoning Visual Dialog with Sparse Graph Learning and Knowledge TransferGi-Cheon Kang, Junseok Park, Hwaran Lee, Byoung-Tak Zhang, Jin-Hwa Kim. 327-339 [doi]
- Exploring Sentence Community for Document-Level Event ExtractionYusheng Huang, Weijia Jia 0001. 340-351 [doi]
- A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue SystemsSan Kim, Jin Yea Jang, Minyoung Jung, Saim Shin. 352-365 [doi]
- WHOSe Heritage: Classification of UNESCO World Heritage Statements of "Outstanding Universal Value" with Soft LabelsNan Bai, Renqian Luo, Pirouz Nourian, Ana Pereira Roders. 366-384 [doi]
- P-INT: A Path-based Interaction Model for Few-shot Knowledge Graph CompletionJingwen Xu, Jing Zhang, Xirui Ke, Yuxiao Dong, Hong Chen, Cuiping Li, Yongbin Liu. 385-394 [doi]
- Cartography Active LearningMike Zhang, Barbara Plank. 395-406 [doi]
- Beyond Reptile: Meta-Learned Dot-Product Maximization between Gradients for Improved Single-Task RegularizationAkhil Kedia, Sai Chetan Chinthakindi, Wonho Ryu. 407-420 [doi]
- GooAQ: Open Question Answering with Diverse Answer TypesDaniel Khashabi, Amos Ng, Tushar Khot, Ashish Sabharwal, Hannaneh Hajishirzi, Chris Callison-Burch. 421-433 [doi]
- Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model PredictionsJavier Ferrando, Marta R. Costa-Jussà. 434-443 [doi]
- BFClass: A Backdoor-free Text Classification FrameworkZichao Li, Dheeraj Mekala, Chengyu Dong, Jingbo Shang. 444-453 [doi]
- Multilingual Chart-based Constituency Parse Extraction from Pre-trained Language ModelsTaeuk Kim, Bowen Li, Sang-goo Lee. 454-463 [doi]
- Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph EmbeddingsKai Wang, Yu Liu, Dan Lin, Michael Sheng. 464-474 [doi]
- CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models CascadeLei Li, Yankai Lin, Deli Chen, Shuhuai Ren, Peng Li, Jie Zhou, Xu Sun. 475-486 [doi]
- Semi-supervised Relation Extraction via Incremental Meta Self-TrainingXuming Hu, Chenwei Zhang, Fukun Ma, Chenyao Liu, Lijie Wen, Philip S. Yu. 487-496 [doi]
- Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement LearningYichao Luo, Yige Xu 0001, Jiacheng Ye, Xipeng Qiu, Qi Zhang. 497-507 [doi]
- Improving Knowledge Graph Embedding Using Affine Transformations of Entities Corresponding to Each RelationJinfa Yang, Yongjie Shi, Xin Tong, Robin Wang, Taiyan Chen, Xianghua Ying. 508-517 [doi]
- Using Question Answering Rewards to Improve Abstractive SummarizationChulaka Gunasekara, Guy Feigenblat, Benjamin Sznajder, Ranit Aharonov, Sachindra Joshi. 518-526 [doi]
- Effect Generation Based on Causal ReasoningFeiteng Mu, Wenjie Li 0002, Zhipeng Xie. 527-533 [doi]
- Distilling Word Meaning in Context from Pre-trained Language ModelsYuki Arase, Tomoyuki Kajiwara. 534-546 [doi]
- Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language GenerationXin Huang, Jung-jae Kim, Bowei Zou. 547-557 [doi]
- Bidirectional Hierarchical Attention Networks based on Document-level Context for Emotion Cause ExtractionGuimin Hu, Guangming Lu, Yi Zhao. 558-568 [doi]
- Distantly Supervised Relation Extraction in Federated SettingsDianbo Sui, Yubo Chen 0001, Kang Liu 0001, Jun Zhao 0001. 569-583 [doi]
- Casting the Same Sentiment Classification ProblemErik Körner, Ahmad Dawar Hakimi, Gerhard Heyer, Martin Potthast. 584-590 [doi]
- Detecting Compositionally Out-of-Distribution Examples in Semantic ParsingDenis Lukovnikov, Sina Däubener, Asja Fischer. 591-598 [doi]
- Saliency-based Multi-View Mixed Language Training for Zero-shot Cross-lingual ClassificationSiyu Lai, Hui Huang, Dong Jing, Yufeng Chen, Jinan Xu, Jian Liu. 599-610 [doi]
- Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the SocietyFiroj Alam, Shaden Shaar, Fahim Dalvi, Hassan Sajjad, Alex Nikolov, Hamdy Mubarak, Giovanni Da San Martino, Ahmed Abdelali, Nadir Durrani, Kareem Darwish, Abdulaziz Al-Homaid, Wajdi Zaghouani, Tommaso Caselli, Gijs Danoe, Friso Stolk, Britt Bruntink, Preslav Nakov. 611-649 [doi]
- FANATIC: FAst Noise-Aware TopIc ClusteringAri Silburt, Anja Subasic, Evan Thompson, Carmeline Dsilva, Tarec Fares. 650-663 [doi]
- Stream-level Latency Evaluation for Simultaneous Machine TranslationJavier Iranzo-Sánchez, Jorge Civera Saiz, Alfons Juan. 664-670 [doi]
- TSDAE: Using Transformer-based Sequential Denoising Auto-Encoderfor Unsupervised Sentence Embedding LearningKexin Wang, Nils Reimers 0001, Iryna Gurevych. 671-688 [doi]
- How Suitable Are Subword Segmentation Strategies for Translating Non-Concatenative Morphology?Chantal Amrhein, Rico Sennrich. 689-705 [doi]
- Rethinking Why Intermediate-Task Fine-Tuning WorksTing-Yun Chang, Chi-Jen Lu. 706-713 [doi]
- Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot LearningXisen Jin, Bill Yuchen Lin, Mohammad Rostami, Xiang Ren 0001. 714-729 [doi]
- Efficient Test Time Adapter Ensembling for Low-resource Language VarietiesXinyi Wang, Yulia Tsvetkov, Sebastian Ruder, Graham Neubig. 730-737 [doi]
- An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding SpacesKelly Marchisio, Youngser Park, Ali Saad-Eldin, Anton Alyakin, Kevin Duh, Carey E. Priebe, Philipp Koehn. 738-749 [doi]
- How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language UnderstandingTianda Li, Ahmad Rashid, Aref Jafari, Pranav Sharma, Ali Ghodsi 0001, Mehdi Rezagholizadeh. 750-762 [doi]
- Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-ExtractionZeYu Li, Wei Cheng, Reema Kshetramade, John Houser, Haifeng Chen, Wei Wang. 763-778 [doi]
- Learning Hard Retrieval Decoder Attention for TransformersHongfei Xu, Qiuhui Liu, Josef van Genabith, Deyi Xiong. 779-785 [doi]
- Recall and Learn: A Memory-augmented Solver for Math Word ProblemsShifeng Huang, Jiawei Wang, Jiao Xu, Da Cao, Ming Yang. 786-796 [doi]
- An Uncertainty-Aware Encoder for Aspect DetectionThi-Nhung Nguyen, Kiem-Hieu Nguyen, Young-In Song, Tuan-Dung Cao. 797-806 [doi]
- Improving Empathetic Response Generation by Recognizing Emotion Cause in ConversationsJun Gao, Yuhan Liu, Haolin Deng, Wei Wang, Yu Cao, Jiachen Du, Ruifeng Xu. 807-819 [doi]
- Probing Across Time: What Does RoBERTa Know and When?Zeyu Liu, Yizhong Wang, Jungo Kasai, Hannaneh Hajishirzi, Noah A. Smith. 820-842 [doi]
- Knowledge-Guided Paraphrase IdentificationHaoyu Wang, Fenglong Ma, Yaqing Wang, Jing Gao. 843-853 [doi]
- R2-D2: A Modular Baseline for Open-Domain Question AnsweringMartin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz. 854-870 [doi]
- What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum ProbabilityYaochen Liu, Yazhou Zhang, Qiuchi Li, Benyou Wang, Dawei Song 0001. 871-880 [doi]
- Discovering Representation Sprachbund For Multilingual Pre-TrainingYimin Fan, Yaobo Liang, Alexandre Muzio, Hany Hassan, Houqiang Li, Ming Zhou 0001, Nan Duan. 881-894 [doi]
- Plan-then-Generate: Controlled Data-to-Text Generation via PlanningYixuan Su, David Vandyke, Sihui Wang, Yimai Fang, Nigel Collier. 895-909 [doi]
- Few-Shot Table-to-Text Generation with Prototype MemoryYixuan Su, Zaiqiao Meng, Simon Baker, Nigel Collier. 910-917 [doi]
- Leveraging Word-Formation Knowledge for Chinese Word Sense DisambiguationHua Zheng, Lei Li, Damai Dai, Deli Chen, Tianyu Liu, Xu Sun, Yang Liu. 918-923 [doi]
- Exploiting Curriculum Learning in Unsupervised Neural Machine TranslationJinliang Lu, Jiajun Zhang. 924-934 [doi]
- Robust Fragment-Based Framework for Cross-lingual Sentence RetrievalNattapol Trijakwanich, Peerat Limkonchotiwat, Raheem Sarwar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich, Sarana Nutanong. 935-944 [doi]
- Towards Improving Adversarial Training of NLP ModelsJin-Yong Yoo, Yanjun Qi. 945-956 [doi]
- To Protect and To Serve? Analyzing Entity-Centric Framing of Police ViolenceCaleb Ziems, Diyi Yang. 957-976 [doi]
- Calibrate your listeners! Robust communication-based training for pragmatic speakersRose E. Wang, Julia White, Jesse Mu, Noah Goodman. 977-984 [doi]
- When Retriever-Reader Meets Scenario-Based Multiple-Choice QuestionsZixian Huang, Ao Wu, Yulin Shen, Gong Cheng 0001, Yuzhong Qu. 985-994 [doi]
- Structured abbreviation expansion in contextKyle Gorman, Christo Kirov, Brian Roark, Richard Sproat. 995-1005 [doi]
- Task-adaptive Pre-training and Self-training are Complementary for Natural Language UnderstandingShiyang Li, Semih Yavuz, Wenhu Chen, Xifeng Yan. 1006-1015 [doi]
- CNNBiF: CNN-based Bigram Features for Named Entity RecognitionChul Sung, Vaibhava Goel, Etienne Marcheret, Steven J. Rennie, David Nahamoo. 1016-1021 [doi]
- Compositional Generalization via Semantic TaggingHao Zheng, Mirella Lapata. 1022-1032 [doi]
- Towards Document-Level Paraphrase Generation with Sentence Rewriting and ReorderingZhe Lin, Yitao Cai, Xiaojun Wan 0001. 1033-1044 [doi]
- Exploring Decomposition for Table-based Fact VerificationXiaoyu Yang, Xiaodan Zhu. 1045-1052 [doi]
- Diversity and Consistency: Exploring Visual Question-Answer Pair GenerationSen Yang, Qingyu Zhou, Dawei Feng, Yang Liu, Chao Li, Yunbo Cao, Dongsheng Li. 1053-1066 [doi]
- Entity-level Cross-modal Learning Improves Multi-modal Machine TranslationXin Huang, Jiajun Zhang, Chengqing Zong. 1067-1080 [doi]
- Learning to Ground Visual Objects for Visual DialogFeilong Chen, Xiuyi Chen, Can Xu, Daxin Jiang. 1081-1091 [doi]
- KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple SubgoalsJun Zhang, Yan Yang, Chencai Chen, Liang He 0001, Zhou Yu. 1092-1101 [doi]
- Less Is More: Domain Adaptation with Lottery Ticket for Reading ComprehensionHaichao Zhu, Zekun Wang, Heng Zhang, Ming Liu, Sendong Zhao, Bing Qin 0001. 1102-1113 [doi]
- Effectiveness of Pre-training for Few-shot Intent ClassificationHaode Zhang, Yuwei Zhang, Li-Ming Zhan, Jiaxin Chen, Guangyuan Shi, Xiao-Ming Wu, Albert Y. S. Lam. 1114-1120 [doi]
- Improving Abstractive Dialogue Summarization with Hierarchical Pretraining and Topic SegmentMengNan Qi, Hao Liu, Yuzhuo Fu, Ting Liu. 1121-1130 [doi]
- Learning to Answer Psychological Questionnaire for Personality DetectionFeifan Yang, Tao Yang, Xiaojun Quan, Qinliang Su. 1131-1142 [doi]
- Exploiting Reasoning Chains for Multi-hop Science Question AnsweringWeiwen Xu, Yang Deng, HuiHui Zhang, Deng Cai 0002, Wai Lam. 1143-1156 [doi]
- Winnowing Knowledge for Multi-choice Question AnsweringYeqiu Li, Bowei Zou, Zhifeng Li, Ai Ti Aw, Yu Hong, Qiaoming Zhu. 1157-1165 [doi]
- Neural Media Bias Detection Using Distant Supervision With BABE - Bias Annotations By ExpertsTimo Spinde, Manuel Plank, Jan-David Krieger, Terry Ruas, Bela Gipp, Akiko Aizawa. 1166-1177 [doi]
- Learning and Evaluating a Differentially Private Pre-trained Language ModelShlomo Hoory, Amir Feder, Avichai Tendler, Sofia Erell, Alon Peled-Cohen, Itay Laish, Hootan Nakhost, Uri Stemmer, Ayelet Benjamini, Avinatan Hassidim, Yossi Matias. 1178-1189 [doi]
- Simulated Chats for Building Dialog Systems: Learning to Generate Conversations from InstructionsBiswesh Mohapatra, Gaurav Pandey 0001, Danish Contractor, Sachindra Joshi. 1190-1203 [doi]
- Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological KnowledgeJiangnan Li, Zheng Lin 0001, Peng Fu 0008, Weiping Wang. 1204-1214 [doi]
- An unsupervised framework for tracing textual sources of moral changeAida Ramezani, Zining Zhu, Frank Rudzicz, Yang Xu 0023. 1215-1228 [doi]
- Topic-Aware Contrastive Learning for Abstractive Dialogue SummarizationJunpeng Liu, Yanyan Zou, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Caixia Yuan, Xiaojie Wang. 1229-1243 [doi]
- TWT: Table with Written Text for Controlled Data-to-Text GenerationTongliang Li, Lei Fang, Jian-Guang Lou, Zhoujun Li. 1244-1254 [doi]
- ArabicTransformer: Efficient Large Arabic Language Model with Funnel Transformer and ELECTRA ObjectiveSultan Alrowili, Vijay-Shanker. 1255-1261 [doi]
- Which is Making the Contribution: Modulating Unimodal and Cross-modal Dynamics for Multimodal Sentiment AnalysisYing Zeng, Sijie Mai, Haifeng Hu 0001. 1262-1274 [doi]
- CVAE-based Re-anchoring for Implicit Discourse Relation ClassificationZujun Dou, Yu Hong, Yu Sun, Guodong Zhou. 1275-1283 [doi]
- Combining Curriculum Learning and Knowledge Distillation for Dialogue GenerationQingqing Zhu, Xiuying Chen, Pengfei Wu, Junfei Liu, Dongyan Zhao 0001. 1284-1295 [doi]
- Improving End-to-End Task-Oriented Dialog System with A Simple Auxiliary TaskYohan Lee. 1296-1303 [doi]
- EDTC: A Corpus for Discourse-Level Topic Chain ParsingLongyin Zhang, Xin Tan, Fang Kong 0001, Guodong Zhou. 1304-1312 [doi]
- Multilingual Neural Machine Translation: Can Linguistic Hierarchies Help?Fahimeh Saleh, Wray L. Buntine, Gholamreza Haffari, Lan Du 0002. 1313-1330 [doi]
- Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading ComprehensionGuoxin Yu, Jiwei Li, Ling Luo, Yuxian Meng, Xiang Ao 0001, Qing He 0003. 1331-1342 [doi]
- Generalization in Text-based Games via Hierarchical Reinforcement LearningYunqiu Xu, Meng Fang, Ling Chen, Yali Du, Chengqi Zhang. 1343-1353 [doi]
- A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue SummarizationYuejie Lei, Fujia Zheng, Yuanmeng Yan, Keqing He, Weiran Xu. 1354-1364 [doi]
- Constructing contrastive samples via summarization for text classification with limited annotationsYangkai Du, Tengfei Ma 0001, Lingfei Wu, Fangli Xu, Xuhong Zhang 0005, Bo Long, Shouling Ji. 1365-1376 [doi]
- End-to-end Neural Information Status ClassificationYufang Hou. 1377-1388 [doi]
- EventKE: Event-Enhanced Knowledge Graph EmbeddingZixuan Zhang, Hongwei Wang, Han Zhao, Hanghang Tong, Heng Ji. 1389-1400 [doi]
- Modeling Concentrated Cross-Attention for Neural Machine Translation with Gaussian Mixture ModelShaolei Zhang, Yang Feng. 1401-1411 [doi]
- Inconsistency Matters: A Knowledge-guided Dual-inconsistency Network for Multi-modal Rumor DetectionMengzhu Sun, Xi Zhang, Jianqiang Ma, Yazheng Liu. 1412-1423 [doi]
- EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge DistillationChenhe Dong, Guangrun Wang, Hang Xu, Jiefeng Peng, Xiaozhe Ren, Xiaodan Liang. 1424-1437 [doi]
- Uni-FedRec: A Unified Privacy-Preserving News Recommendation Framework for Model Training and Online ServingTao Qi, Fangzhao Wu, Chuhan Wu, Yongfeng Huang, Xing Xie 0001. 1438-1448 [doi]
- Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement LearningSayan Ghosh, Shashank Srivastava. 1449-1462 [doi]
- Topic-Guided Abstractive Multi-Document SummarizationPeng Cui 0006, Le Hu. 1463-1472 [doi]
- An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem SolvingQinzhuo Wu, Qi Zhang, Zhongyu Wei. 1473-1482 [doi]
- SciXGen: A Scientific Paper Dataset for Context-Aware Text GenerationHong Chen, Hiroya Takamura, Hideki Nakayama. 1483-1492 [doi]
- Don't Miss the Potential Customers! Retrieving Similar Ads to Improve User TargetingYi Feng, Ting Wang, Chuanyi Li, Vincent Ng, JiDong Ge, Bin Luo, Yucheng Hu, Xiaopeng Zhang. 1493-1503 [doi]
- Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous GraphNuttapong Chairatanakul, Noppayut Sriwatanasakdi, Nontawat Charoenphakdee, Xin Liu 0020, Tsuyoshi Murata. 1504-1517 [doi]
- Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising LearningXinghua Zhang 0001, Bowen Yu 0002, Tingwen Liu, Zhenyu Zhang 0006, Jiawei Sheng, Mengge Xue, Hongbo Xu. 1518-1529 [doi]
- Entity-Based Semantic Adequacy for Data-to-Text GenerationJuliette Faille, Albert Gatt, Claire Gardent. 1530-1540 [doi]
- MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News SummarizationXinnuo Xu, Ondrej Dusek, Shashi Narayan, Verena Rieser, Ioannis Konstas. 1541-1552 [doi]
- A Conditional Generative Matching Model for Multi-lingual Reply SuggestionBudhaditya Deb, Guoqing Zheng, Milad Shokouhi, Ahmed Hassan Awadallah. 1553-1568 [doi]
- Rethinking Sentiment Style TransferPing Yu, Yang Zhao, Chunyuan Li, Changyou Chen. 1569-1582 [doi]
- HypoGen: Hyperbole Generation with Commonsense and Counterfactual KnowledgeYufei Tian, Arvind Krishna Sridhar, Nanyun Peng. 1583-1593 [doi]
- Profiling News Discourse Structure Using Explicit Subtopic Structures Guided CriticsPrafulla Kumar Choubey, Ruihong Huang. 1594-1605 [doi]
- ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain DetectionIftitahu Ni'mah, Meng Fang, Vlado Menkovski, Mykola Pechenizkiy. 1606-1617 [doi]
- Learning from Language Description: Low-shot Named Entity Recognition via Decomposed FrameworkYaqing Wang, Haoda Chu, Chao Zhang, Jing Gao. 1618-1630 [doi]
- BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural NetworksTuan Lai, Heng Ji, ChengXiang Zhai. 1631-1639 [doi]
- Char2Subword: Extending the Subword Embedding Space Using Robust Character CompositionalityGustavo Aguilar, Bryan McCann, Tong Niu, Nazneen Rajani, Nitish Shirish Keskar, Thamar Solorio. 1640-1651 [doi]
- Exploring Multitask Learning for Low-Resource Abstractive SummarizationAhmed Magooda, Diane J. Litman, Mohamed Elaraby. 1652-1661 [doi]
- Conical Classification For Efficient One-Class Topic DeterminationSameer Khanna. 1662-1673 [doi]
- Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data AugmentationJarana Manotumruksa, Jeff Dalton 0001, Edgar Meij, Emine Yilmaz. 1674-1683 [doi]
- TIAGE: A Benchmark for Topic-Shift Aware Dialog ModelingHuiyuan Xie, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Ann A. Copestake. 1684-1690 [doi]
- Optimal Neural Program Synthesis from Multimodal SpecificationsXi Ye, Qiaochu Chen, Isil Dillig, Greg Durrett. 1691-1704 [doi]
- Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span AnnotationsShifeng Liu 0002, Yifang Sun, Bing Li, Wei Wang 0011, Florence T. Bourgeois, Adam G. Dunn. 1705-1715 [doi]
- When in Doubt: Improving Classification Performance with Alternating NormalizationMenglin Jia, Austin Reiter, Ser-Nam Lim, Yoav Artzi, Claire Cardie. 1716-1723 [doi]
- APGN: Adversarial and Parameter Generation Networks for Multi-Source Cross-Domain Dependency ParsingYing Li, Meishan Zhang, Zhenghua Li, Min Zhang, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan. 1724-1733 [doi]
- "Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative UnderstandingFaeze Brahman, Meng Huang, Oyvind Tafjord, Chao Zhao, Mrinmaya Sachan, Snigdha Chaturvedi. 1734-1752 [doi]
- Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge DistillationHumair Raj Khan, Deepak Gupta, Asif Ekbal. 1753-1767 [doi]
- An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment AnalysisYunlong Liang, Fandong Meng, Jinchao Zhang, Yufeng Chen 0005, Jinan Xu, Jie Zhou. 1768-1780 [doi]
- Semantic Alignment with Calibrated Similarity for Multilingual Sentence EmbeddingJiyeon Ham, Eun-Sol Kim. 1781-1791 [doi]
- fBERT: A Neural Transformer for Identifying Offensive ContentDiptanu Sarkar, Marcos Zampieri, Tharindu Ranasinghe, Alexander G. Ororbia. 1792-1798 [doi]
- WIKIBIAS: Detecting Multi-Span Subjective Biases in LanguageYang Zhong, Jingfeng Yang, Wei Xu, Diyi Yang. 1799-1814 [doi]
- UnClE: Explicitly Leveraging Semantic Similarity to Reduce the Parameters of Word EmbeddingsZhi Li, Yuchen Zhai, Chengyu Wang 0001, Minghui Qiu, Kailiang Li, Yin Zhang. 1815-1828 [doi]
- Grounded Graph Decoding improves Compositional Generalization in Question AnsweringYu Gai, Paras Jain 0001, Wendi Zhang, Joseph Gonzalez 0001, Dawn Song, Ion Stoica. 1829-1838 [doi]
- Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented GuesserDuo Zheng, Zipeng Xu, Fandong Meng, Xiaojie Wang, Jiaan Wang, Jie Zhou. 1839-1851 [doi]
- A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge BaseYu Feng, Jing Zhang, Gaole He, Wayne Xin Zhao, Lemao Liu, Quan Liu, Cuiping Li, Hong Chen. 1852-1861 [doi]
- RoR: Read-over-Read for Long Document Machine Reading ComprehensionJing Zhao, Junwei Bao 0001, Yifan Wang, Yongwei Zhou, Youzheng Wu, Xiaodong He 0002, Bowen Zhou. 1862-1872 [doi]
- Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic ParsingAkshat Shrivastava, Pierce Chuang, Arun Babu, Shrey Desai, Abhinav Arora, Alexander Zotov, Ahmed Aly. 1873-1886 [doi]
- Language Resource Efficient Learning for CaptioningJia Chen 0001, Yike Wu, Shiwan Zhao, Qin Jin. 1887-1895 [doi]
- Translation as Cross-Domain Knowledge: Attention Augmentation for Unsupervised Cross-Domain Segmenting and Labeling TasksRuixuan Luo, Yi Zhang, Sishuo Chen, Xu Sun. 1896-1906 [doi]
- ContractNLI: A Dataset for Document-level Natural Language Inference for ContractsYuta Koreeda, Christopher D. Manning. 1907-1919 [doi]
- Japanese Zero Anaphora Resolution Can Benefit from Parallel Texts Through Neural Transfer LearningMasato Umakoshi, Yugo Murawaki, Sadao Kurohashi. 1920-1934 [doi]
- Grouped-Attention for Content-Selection and Content-Plan GenerationBayu Distiawan Trisedya, Xiaojie Wang, Jianzhong Qi 0001, Rui Zhang 0003, Qingjun Cui. 1935-1944 [doi]
- An Explicit-Joint and Supervised-Contrastive Learning Framework for Few-Shot Intent Classification and Slot FillingHan Liu, Feng Zhang, Xiaotong Zhang 0003, Siyang Zhao, Xianchao Zhang. 1945-1955 [doi]
- Retrieve, Discriminate and Rewrite: A Simple and Effective Framework for Obtaining Affective Response in Retrieval-Based ChatbotsXin Lu, Yijian Tian, Yanyan Zhao, Bing Qin 0001. 1956-1969 [doi]
- Span Fine-tuning for Pre-trained Language ModelsRongzhou Bao, Zhuosheng Zhang 0001, Hai Zhao. 1970-1979 [doi]
- DIRECT: Direct and Indirect Responses in Conversational Text CorpusJunya Takayama, Tomoyuki Kajiwara, Yuki Arase. 1980-1989 [doi]
- Retrieval, Analogy, and Composition: A framework for Compositional Generalization in Image CaptioningZhan Shi, Hui Liu, Martin Renqiang Min, Christopher Malon, Li Erran Li, Xiaodan Zhu. 1990-2000 [doi]
- TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text GenerationAdaku Uchendu, Zeyu Ma, Thai Le, Rui Zhang, Dongwon Lee 0001. 2001-2016 [doi]
- Say 'YES' to Positivity: Detecting Toxic Language in Workplace CommunicationsMeghana Moorthy Bhat, Saghar Hosseini, Ahmed Hassan Awadallah, Paul N. Bennett, Weisheng Li. 2017-2029 [doi]
- Natural SQL: Making SQL Easier to Infer from Natural Language SpecificationsYujian Gan, Xinyun Chen, Jinxia Xie, Matthew Purver, John R. Woodward, John H. Drake, Qiaofu Zhang. 2030-2042 [doi]
- Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive SummarizationAhmed Magooda, Diane J. Litman. 2043-2052 [doi]
- Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Reading ComprehensionYiyang Li 0002, Hai Zhao. 2053-2063 [doi]
- Few-Shot Novel Concept Learning for Semantic ParsingSoham Dan, Osbert Bastani, Dan Roth. 2064-2075 [doi]
- Compositional Data and Task Augmentation for Instruction FollowingSoham Dan, Xinran Han, Dan Roth. 2076-2081 [doi]
- Are Factuality Checkers Reliable? Adversarial Meta-evaluation of Factuality in SummarizationYiran Chen, Pengfei Liu 0003, Xipeng Qiu. 2082-2095 [doi]
- On the Effects of Transformer Size on In- and Out-of-Domain CalibrationSoham Dan, Dan Roth. 2096-2101 [doi]
- Detecting Polarized Topics Using Partisanship-aware Contextualized Topic EmbeddingsZihao He, Negar Mokhberian, António Câmara, Andrés Abeliuk, Kristina Lerman. 2102-2118 [doi]
- GenerativeRE: Incorporating a Novel Copy Mechanism and Pretrained Model for Joint Entity and Relation ExtractionJiarun Cao, Sophia Ananiadou. 2119-2126 [doi]
- Re-entry Prediction for Online Conversations via Self-Supervised LearningLingzhi Wang, Xingshan Zeng, Huang Hu, Kam-Fai Wong, Daxin Jiang. 2127-2137 [doi]
- proScript: Partially Ordered Scripts GenerationKeisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark, Yejin Choi. 2138-2149 [doi]
- Speaker Turn Modeling for Dialogue Act ClassificationZihao He, Leili Tavabi, Kristina Lerman, Mohammad Soleymani 0001. 2150-2157 [doi]
- Unsupervised Domain Adaptation Method with Semantic-Structural Alignment for Dependency ParsingBoda Lin, Mingzheng Li, Si Li, Yong Luo. 2158-2167 [doi]
- Devil's Advocate: Novel Boosting Ensemble Method from Psychological Findings for Text ClassificationHwiyeol Jo, Jaeseo Lim, Byoung-Tak Zhang. 2168-2174 [doi]
- SideControl: Controlled Open-domain Dialogue Generation via Additive Side NetworksWanyu Du, Yangfeng Ji. 2175-2194 [doi]
- Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' TransferabilityWei-Tsung Kao, Hung-yi Lee. 2195-2208 [doi]
- Geo-BERT Pre-training Model for Query Rewriting in POI SearchXiao Liu, Juan Hu, Qi Shen, Huan Chen. 2209-2214 [doi]
- Leveraging Bidding Graphs for Advertiser-Aware Relevance Modeling in Sponsored SearchShuxian Bi, Chaozhuo Li, Xiao Han, Zheng Liu, Xing Xie, Haizhen Huang, Zengxuan Wen. 2215-2224 [doi]
- GPT3Mix: Leveraging Large-scale Language Models for Text AugmentationKang Min Yoo, Dongju Park, Jaewook Kang, Sang-Woo Lee, Woo-Myoung Park. 2225-2239 [doi]
- Context-aware Entity Typing in Knowledge GraphsWeiran Pan, Wei Wei 0002, Xian-Ling Mao. 2240-2250 [doi]
- Attribute Alignment: Controlling Text Generation from Pre-trained Language ModelsDian Yu, Zhou Yu, Kenji Sagae. 2251-2268 [doi]
- Generate & Rank: A Multi-task Framework for Math Word ProblemsJianhao Shen, Yichun Yin, Lin Li, Lifeng Shang, Xin Jiang, Ming Zhang 0004, Qun Liu 0001. 2269-2279 [doi]
- MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringJunjie Wang, Yatai Ji, Jiaqi Sun, Yujiu Yang, Tetsuya Sakai. 2280-2292 [doi]
- UniteD-SRL: A Unified Dataset for Span- and Dependency-Based Multilingual and Cross-Lingual Semantic Role LabelingRocco Tripodi, Simone Conia, Roberto Navigli. 2293-2305 [doi]
- Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer RetrievalYanmeng Wang, Jun Bai, Ye Wang, Jianfei Zhang, Wenge Rong, Zongcheng Ji, Shaojun Wang, Jing Xiao. 2306-2315 [doi]
- A Neural Graph-based Local Coherence ModelMohsen Mesgar, Leonardo F. R. Ribeiro, Iryna Gurevych. 2316-2321 [doi]
- GiBERT: Enhancing BERT with Linguistic Information using a Lightweight Gated Injection MethodNicole Peinelt, Marek Rei, Maria Liakata. 2322-2336 [doi]
- RollingLDA: An Update Algorithm of Latent Dirichlet Allocation to Construct Consistent Time Series from Textual DataJonas Rieger, Carsten Jentsch, Jörg Rahnenführer. 2337-2347 [doi]
- What If Sentence-hood is Hard to Define: A Case Study in Chinese Reading ComprehensionJiawei Wang, Hai Zhao, Yinggong Zhao, Libin Shen. 2348-2359 [doi]
- Refining BERT Embeddings for Document Hashing via Mutual Information MaximizationZijing Ou, Qinliang Su, Jianxing Yu, Ruihui Zhao, Yefeng Zheng, Bang Liu. 2360-2369 [doi]
- REBEL: Relation Extraction By End-to-end Language generationPere-Lluís Huguet Cabot, Roberto Navigli. 2370-2381 [doi]
- Wine is not v i n. On the Compatibility of Tokenizations across LanguagesAntonis Maronikolakis, Philipp Dufter, Hinrich Schütze. 2382-2399 [doi]
- Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social MediaPaul Röttger, Janet B. Pierrehumbert. 2400-2412 [doi]
- Skim-Attention: Learning to Focus via Document LayoutLaura Nguyen, Thomas Scialom, Jacopo Staiano, Benjamin Piwowarski. 2413-2427 [doi]
- Attention-based Contrastive Learning for Winograd SchemasTassilo Klein, Moin Nabi. 2428-2434 [doi]
- Give the Truth: Incorporate Semantic Slot into Abstractive Dialogue SummarizationLulu Zhao, Weihao Zeng, Weiran Xu, Jun Guo 0002. 2435-2446 [doi]
- Challenges in Detoxifying Language ModelsJohannes Welbl, Amelia Glaese, Jonathan Uesato, Sumanth Dathathri, John Mellor, Lisa Anne Hendricks, Kirsty Anderson, Pushmeet Kohli, Ben Coppin, Po-Sen Huang. 2447-2469 [doi]
- Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine TranslationShahar Levy, Koren Lazar, Gabriel Stanovsky. 2470-2480 [doi]
- Competence-based Curriculum Learning for Multilingual Machine TranslationMingliang Zhang, Fandong Meng, Yunhai Tong, Jie Zhou. 2481-2493 [doi]
- Informed Sampling for Diversity in Concept-to-Text NLGGiulio Zhou, Gerasimos Lampouras. 2494-2509 [doi]
- Novel Natural Language Summarization of Program Code via Leveraging Multiple Input RepresentationsFuxiang Chen, Mijung Kim, Jaegul Choo. 2510-2520 [doi]
- WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NERSimone Tedeschi, Valentino Maiorca, Niccolò Campolungo, Francesco Cecconi, Roberto Navigli. 2521-2533 [doi]
- Beyond Grammatical Error Correction: Improving L1-influenced research writing in English using pre-trained encoder-decoder modelsGustavo Zomer, Ana Frankenberg-Garcia. 2534-2540 [doi]
- Classification and Geotemporal Analysis of Quality-of-Life Issues in Tenant ReviewsAdam Haber, Zeev Waks. 2541-2553 [doi]
- Probing Pre-trained Language Models for Semantic Attributes and their ValuesMeriem Beloucif, Chris Biemann. 2554-2559 [doi]
- Uncovering the Limits of Text-based Emotion DetectionNurudin Alvarez-Gonzalez, Andreas Kaltenbrunner, Vicenç Gómez. 2560-2583 [doi]
- Named Entity Recognition for Entity Linking: What Works and What's NextSimone Tedeschi, Simone Conia, Francesco Cecconi, Roberto Navigli. 2584-2596 [doi]
- Learning Numeracy: A Simple Yet Effective Number Embedding Approach Using Knowledge GraphHanyu Duan, Yi Yang, Kar Yan Tam. 2597-2602 [doi]
- Weakly Supervised Semantic Parsing by Learning from MistakesJiaqi Guo, Jian-Guang Lou, Ting Liu, Dongmei Zhang. 2603-2617 [doi]
- CodeQA: A Question Answering Dataset for Source Code ComprehensionChenxiao Liu, Xiaojun Wan 0001. 2618-2632 [doi]
- Subword Mapping and Anchoring across LanguagesGiorgos Vernikos, Andrei Popescu-Belis. 2633-2647 [doi]
- CDLM: Cross-Document Language ModelingAvi Caciularu, Arman Cohan, Iz Beltagy, Matthew E. Peters, Arie Cattan, Ido Dagan. 2648-2662 [doi]
- Patterns of Polysemy and Homonymy in Contextualised Language ModelsJanosch Haber, Massimo Poesio. 2663-2676 [doi]
- Cross-Lingual Leveled Reading Based on Language-Invariant FeaturesSimin Rao, Hua Zheng, Sujian Li. 2677-2682 [doi]
- Controlled Neural Sentence-Level Reframing of News ArticlesWei-Fan Chen, Khalid Al Khatib, Benno Stein 0001, Henning Wachsmuth. 2683-2693 [doi]
- DialogueTRM: Exploring Multi-Modal Emotional Dynamics in a ConversationYuzhao Mao, Guang Liu 0007, Xiaojie Wang, Weiguo Gao, Xuan Li. 2694-2704 [doi]
- Adversarial Examples for Evaluating Math Word Problem SolversVivek Kumar, Rishabh Maheshwary, Vikram Pudi. 2705-2712 [doi]
- Improving Numerical Reasoning Skills in the Modular Approach for Complex Question Answering on TextXiaoyu Guo, Yuan-Fang Li, Gholamreza Haffari. 2713-2718 [doi]
- Retrieval Augmented Code Generation and SummarizationMd. Rizwan Parvez, Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang. 2719-2734 [doi]
- Multilingual Translation via Grafting Pre-trained Language ModelsZewei Sun, Mingxuan Wang, Lei Li 0005. 2735-2747 [doi]
- AEDA: An Easier Data Augmentation Technique for Text ClassificationAkbar Karimi, Leonardo Rossi, Andrea Prati 0001. 2748-2754 [doi]
- A Comprehensive Comparison of Word Embeddings in Event & Entity Coreference ResolutionJudicael Poumay, Ashwin Ittoo. 2755-2764 [doi]
- Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech RecognitionGuolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin. 2765-2777 [doi]
- Multilingual AMR Parsing with Noisy Knowledge DistillationDeng Cai 0002, Xin Li 0056, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam. 2778-2789 [doi]
- Open-Domain Contextual Link Prediction and its Complementarity with Entailment GraphsMohammad Javad Hosseini, Shay B. Cohen, Mark Johnson, Mark Steedman. 2790-2802 [doi]
- Analysis of Language Change in Collaborative Instruction FollowingAnna Effenberger, Rhia Singh, Eva Yan, Alane Suhr, Yoav Artzi. 2803-2811 [doi]
- Counter-Interference Adapter for Multilingual Machine TranslationYaoming Zhu, JiangTao Feng, Chengqi Zhao, Mingxuan Wang, Lei Li. 2812-2823 [doi]
- Progressive Transformer-Based Generation of Radiology ReportsFarhad Nooralahzadeh, Nicolas Perez Gonzalez, Thomas Frauenfelder, Koji Fujimoto, Michael Krauthammer. 2824-2832 [doi]
- "Be nice to your wife! The restaurants are closed": Can Gender Stereotype Detection Improve Sexism Classification?Patricia Chiril, Farah Benamara, Véronique Moriceau. 2833-2844 [doi]
- Automatic Discrimination between Inherited and Borrowed Latin Words in Romance LanguagesAlina Maria Cristea, Liviu P. Dinu, Simona Georgescu, Mihnea-Lucian Mihai, Ana Sabina Uban. 2845-2855 [doi]
- Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt CollectionsRuiqi Zhong, Kristy Lee, Zheng Zhang, Dan Klein. 2856-2878 [doi]
- Knowledge-Interactive Network with Sentiment Polarity Intensity-Aware Multi-Task Learning for Emotion Recognition in ConversationsYunhe Xie, Kailai Yang, Chengjie Sun, Bingquan Liu, Zhenzhou Ji. 2879-2889 [doi]
- Minimizing Annotation Effort via Max-Volume Spectral SamplingAriadna Quattoni, Xavier Carreras. 2890-2899 [doi]
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine TranslationXuebo Liu 0002, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi 0001, Zhaopeng Tu. 2900-2907 [doi]
- Lexicon-Based Graph Convolutional Network for Chinese Word SegmentationKaiyu Huang, Hao Yu, Junpeng Liu, Wei Liu, Jingxiang Cao, Degen Huang. 2908-2917 [doi]
- KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense ReasoningHaonan Li 0003, Yeyun Gong, Jian Jiao 0007, Ruofei Zhang, Timothy Baldwin, Nan Duan. 2918-2928 [doi]
- Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpusDaniela Trotta, Raffaele Guarasci, Elisa Leonardelli, Sara Tonelli. 2929-2940 [doi]
- Hyperbolic Hierarchy-Aware Knowledge Graph Embedding for Link PredictionZhe Pan, Peng Wang. 2941-2948 [doi]
- A Discourse-Aware Graph Neural Network for Emotion Recognition in Multi-Party ConversationYang Sun, Nan Yu, Guohong Fu. 2949-2958 [doi]
- MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance DetectionMatthew Matero, Nikita Soni, Niranjan Balasubramanian, H. Andrew Schwartz. 2959-2966 [doi]
- LMSOC: An Approach for Socially Sensitive PretrainingVivek Kulkarni, Shubhanshu Mishra, Aria Haghighi. 2967-2975 [doi]
- Extract, Integrate, Compete: Towards Verification Style Reading ComprehensionChen Zhang, Yuxuan Lai, Yansong Feng, Dongyan Zhao 0001. 2976-2986 [doi]
- Comparing learnability of two dependency schemes: 'semantic' (UD) and 'syntactic' (SUD)Ryszard Tuora, Adam Przepiórkowski, Aleksander Leczkowski. 2987-2996 [doi]
- Argumentation-Driven Evidence Association in Criminal CasesYefei Teng, WenHan Chao. 2997-3001 [doi]
- Eliminating Sentiment Bias for Aspect-Level Sentiment Classification with Unsupervised Opinion ExtractionBo Wang, Tao Shen, Guodong Long, Tianyi Zhou, Yi Chang. 3002-3012 [doi]
- Data Efficient Masked Language Modeling for Vision and LanguageYonatan Bitton, Michael Elhadad, Gabriel Stanovsky, Roy Schwartz 0001. 3013-3028 [doi]
- Improving Multilingual Neural Machine Translation with Auxiliary Source LanguagesWeijia Xu, Yuwei Yin, Shuming Ma, Dongdong Zhang 0001, Haoyang Huang. 3029-3041 [doi]
- How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on IsotropySara Rajaee, Mohammad Taher Pilehvar. 3042-3049 [doi]
- Locality Preserving Sentence EncodingChangrong Min, Yonghe Chu, Liang Yang, Bo Xu 0009, Hongfei Lin. 3050-3060 [doi]
- Knowledge Representation Learning with Contrastive Completion CodingBo Ouyang, Wenbing Huang 0001, Runfa Chen, Zhixing Tan, Yang Liu, Maosong Sun, Jihong Zhu. 3061-3073 [doi]
- Knowledge-Enhanced Evidence Retrieval for Counterargument GenerationYohan Jo, Haneul Yoo, JinYeong Bak, Alice Oh, Chris Reed, Eduard H. Hovy. 3074-3094 [doi]
- Investigating Numeracy Learning Ability of a Text-to-Text Transfer ModelKuntal Kumar Pal, Chitta Baral. 3095-3101 [doi]
- Modeling Mathematical Notation Semantics in Academic PapersHwiyeol Jo, Dongyeop Kang, Andrew Head, Marti A. Hearst. 3102-3115 [doi]
- Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional LensSaad Hassan, Matt Huenerfauth, Cecilia Ovesdotter Alm. 3116-3123 [doi]
- Constructing Emotional Consensus and Utilizing Unpaired Data for Empathetic Dialogue GenerationLei Shen, Jinchao Zhang, Jiao Ou, Xiaofang Zhao, Jie Zhou. 3124-3134 [doi]
- Automatic rule generation for time expression normalizationWentao Ding, Jianhao Chen, Jinmao Li, Yuzhong Qu. 3135-3144 [doi]
- RW-KD: Sample-wise Loss Terms Re-Weighting for Knowledge DistillationPeng Lu, Abbas Ghaddar, Ahmad Rashid, Mehdi Rezagholizadeh, Ali Ghodsi 0001, Philippe Langlais. 3145-3152 [doi]
- Visual Cues and Error Correction for Translation RobustnessZhenhao Li, Marek Rei, Lucia Specia. 3153-3168 [doi]
- Beyond the Tip of the Iceberg: Assessing Coherence of Text ClassifiersShane Storks, Joyce Chai. 3169-3177 [doi]
- Does Pretraining for Summarization Require Knowledge Transfer?Kundan Krishna, Jeffrey P. Bigham, Zachary C. Lipton. 3178-3189 [doi]
- Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed BanditsJulia Kreutzer, David Vilar, Artem Sokolov. 3190-3204 [doi]
- Sometimes We Want Ungrammatical TranslationsPrasanna Parthasarathi, Koustuv Sinha, Joelle Pineau, Adina Williams. 3205-3227 [doi]
- An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal DialogXingyao Wang, David Jurgens. 3228-3257 [doi]
- SciCap: Generating Captions for Scientific FiguresTing-Yao Hsu, C. Lee Giles, Ting-Hao Huang. 3258-3264 [doi]
- SentNoB: A Dataset for Analysing Sentiment on Noisy Bangla TextsKhondoker Ittehadul Islam, Sudipta Kar, Md Saiful Islam, Mohammad Ruhul Amin. 3265-3271 [doi]
- Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic DataMassimo Nicosia, Zhongdi Qu, Yasemin Altun. 3272-3284 [doi]
- NewsBERT: Distilling Pre-trained Language Model for Intelligent News ApplicationChuhan Wu, Fangzhao Wu, Yang Yu, Tao Qi, Yongfeng Huang, Qi Liu. 3285-3295 [doi]
- SD-QA: Spoken Dialectal Question Answering for the Real WorldFahim Faisal, Sharlina Keshava, Md Mahfuz Ibn Alam, Antonios Anastasopoulos. 3296-3315 [doi]
- The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine TranslationOrevaoghene Ahia, Julia Kreutzer, Sara Hooker. 3316-3333 [doi]
- Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic CoherenceKelvin Lo, Yuan Jin, Weicong Tan, Ming Liu, Lan Du, Wray L. Buntine. 3334-3340 [doi]
- Self-Supervised Neural Topic ModelingSeyed Ali Bahrainian, Martin Jaggi, Carsten Eickhoff. 3341-3350 [doi]
- Coreference-aware Surprisal Predicts Brain ResponseEvan Jaffe, Byung-Doh Oh, William Schuler. 3351-3356 [doi]
- Distilling the Knowledge of Large-scale Generative Models into Retrieval Models for Efficient Open-domain ConversationBeomsu Kim, Seokjun Seo, Seungju Han, Enkhbayar Erdenee, Buru Chang. 3357-3373 [doi]
- Modeling Users and Online Communities for Abuse Detection: A Position on Ethics and ExplainabilityPushkar Mishra, Helen Yannakoudakis, Ekaterina Shutova. 3374-3385 [doi]
- Detecting Community Sensitive Norm Violations in Online ConversationsChan Young Park, Julia Mendelsohn, Karthik Radhakrishnan, Kinjal Jain, Tushar Kanakagiri, David Jurgens, Yulia Tsvetkov. 3386-3397 [doi]
- SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence RepresentationsHooman Sedghamiz, Shivam Raval, Enrico Santus, Tuka Alhanai, Mohammad M. Ghassemi. 3398-3403 [doi]
- mDAPT: Multilingual Domain Adaptive Pretraining in a Single ModelRasmus Kær Jørgensen, Mareike Hartmann, Xiang Dai, Desmond Elliott. 3404-3418 [doi]
- COSMic: A Coherence-Aware Generation Metric for Image DescriptionsMert Inan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone, Malihe Alikhani. 3419-3430 [doi]
- Relation-Guided Pre-Training for Open-Domain Question AnsweringZiniu Hu, Yizhou Sun, Kai-Wei Chang. 3431-3448 [doi]
- MURAL: Multimodal, Multitask Representations Across LanguagesAashi Jain, Mandy Guo, Krishna Srinivasan, Ting Chen, Sneha Kudugunta 0001, Chao Jia, Yinfei Yang, Jason Baldridge. 3449-3463 [doi]
- AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language ModelsHarish Tayyar Madabushi, Edward Gow-Smith, Carolina Scarton, Aline Villavicencio. 3464-3477 [doi]
- Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human DemonstrationWeiyan Shi, Yu Li, Saurav Sahay, Zhou Yu. 3478-3492 [doi]
- A Computational Exploration of Pejorative Language in Social MediaLiviu P. Dinu, Ioan-Bogdan Iordache, Ana Sabina Uban, Marcos Zampieri. 3493-3498 [doi]
- Evidence-based Fact-Checking of Health-related ClaimsMourad Sarrouti, Asma Ben Abacha, Yassine Mrabet, Dina Demner-Fushman. 3499-3512 [doi]
- Learning and Analyzing Generation Order for Undirected Sequence ModelsYichen Jiang, Mohit Bansal. 3513-3523 [doi]
- Automatic Bilingual Markup TransferThomas Zenkel, Joern Wuebker, John DeNero. 3524-3533 [doi]
- Exploring a Unified Sequence-To-Sequence Transformer for Medical Product Safety Monitoring in Social MediaShivam Raval, Hooman Sedghamiz, Enrico Santus, Tuka Alhanai, Mohammad M. Ghassemi, Emmanuele Chersoni. 3534-3546 [doi]
- Disentangling Generative Factors in Natural Language with Discrete Variational AutoencodersGiangiacomo Mercatali, André Freitas. 3547-3556 [doi]
- MSD: Saliency-aware Knowledge Distillation for Multimodal UnderstandingWoojeong Jin, Maziar Sanjabi, Shaoliang Nie, Liang Tan, Xiang Ren 0001, Hamed Firooz. 3557-3569 [doi]
- Do UD Trees Match Mention Spans in Coreference Annotations?Martin Popel, Zdenek Zabokrtský, Anna Nedoluzhko, Michal Novák, Daniel Zeman. 3570-3576 [doi]
- Beyond Distillation: Task-level Mixture-of-Experts for Efficient InferenceSneha Kudugunta, Yanping Huang, Ankur Bapna, Maxim Krikun, Dmitry Lepikhin, Minh-Thang Luong, Orhan Firat. 3577-3599 [doi]
- TAG: Gradient Attack on Transformer-based Language ModelsJieren Deng, Yijue Wang, Ji Li, Chenghong Wang, Chao Shang, Hang Liu, Sanguthevar Rajasekaran, Caiwen Ding. 3600-3610 [doi]
- Generating Realistic Natural Language CounterfactualsMarcel Robeer, Floris Bex, Ad Feelders. 3611-3625 [doi]
- Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer ApproachAnup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou. 3626-3634 [doi]
- Model-based analysis of brain activity reveals the hierarchy of language in 305 subjectsCharlotte Caucheteux, Alexandre Gramfort, Jean-Remi King. 3635-3644 [doi]
- Gated Transformer for Robust De-noised Sequence-to-Sequence ModellingAyan Sengupta, Amit Kumar, Sourabh Kumar Bhattacharjee, Suman Roy 0001. 3645-3657 [doi]
- Token-wise Curriculum Learning for Neural Machine TranslationChen Liang, Haoming Jiang, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Tuo Zhao. 3658-3670 [doi]
- RelDiff: Enriching Knowledge Graph Relation Representations for Sensitivity ClassificationHitarth Narvala, Graham McDonald, Iadh Ounis. 3671-3681 [doi]
- Post-Editing Extractive Summaries by Definiteness PredictionJad Kabbara, Jackie Chi Kit Cheung. 3682-3692 [doi]
- Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient ConversationsLongxiang Zhang, Renato Negrinho, Arindam Ghosh, Vasudevan Jagannathan, Hamid Reza Hassanzadeh, Thomas Schaaf, Matthew R. Gormley. 3693-3712 [doi]
- Distilling Knowledge for Empathy DetectionMahshid Hosseini, Cornelia Caragea. 3713-3724 [doi]
- Adapting Entities across Languages and CulturesDenis Peskov, Viktor Hangya, Jordan L. Boyd-Graber, Alexander Fraser 0001. 3725-3750 [doi]
- ODIST: Open World Classification via Distributionally Shifted InstancesLei Shu, Yassine Benajiba, Saab Mansour, Yi Zhang. 3751-3756 [doi]
- LAMAD: A Linguistic Attentional Model for Arabic Text DiacritizationRaeed Al-Sabri, Jianliang Gao. 3757-3764 [doi]
- Sequence-to-Lattice Models for Fast TranslationYuntian Deng, Alexander M. Rush. 3765-3772 [doi]
- Towards Realistic Single-Task Continuous Learning Research for NERJustin Payan, Yuval Merhav, He Xie, Satyapriya Krishna, Anil Ramakrishna, Mukund Sridhar, Rahul Gupta. 3773-3783 [doi]
- Retrieval Augmentation Reduces Hallucination in ConversationKurt Shuster 0001, Spencer Poff, Moya Chen, Douwe Kiela, Jason Weston. 3784-3803 [doi]
- Towards Automatic Bias Detection in Knowledge GraphsDaphna Keidar, Mian Zhong, Ce Zhang, Yash Raj Shrestha, Bibek Paudel. 3804-3811 [doi]
- Searching for More Efficient Dynamic ProgramsTim Vieira, Ryan Cotterell, Jason Eisner. 3812-3830 [doi]
- Revisiting Robust Neural Machine Translation: A Transformer Case StudyPeyman Passban, Puneeth S. M. Saladi, Qun Liu 0001. 3831-3840 [doi]
- Can NLI Models Verify QA Systems' Predictions?Jifan Chen, Eunsol Choi, Greg Durrett. 3841-3854 [doi]
- Parameter-Efficient Domain Knowledge Integration from Multiple Sources for Biomedical Pre-trained Language ModelsQiuhao Lu, Dejing Dou, Thien Huu Nguyen. 3855-3865 [doi]
- Uncovering Implicit Gender Bias in Narratives through Commonsense InferenceTenghao Huang, Faeze Brahman, Vered Shwartz, Snigdha Chaturvedi. 3866-3873 [doi]
- Contrastive Document Representation Learning with Graph Attention NetworksPeng Xu, Xinchi Chen, Xiaofei Ma, Zhiheng Huang, Bing Xiang. 3874-3884 [doi]
- Convex Aggregation for Opinion SummarizationHayate Iso, Xiaolan Wang 0001, Yoshihiko Suhara, Stefanos Angelidis, Wang Chiew Tan. 3885-3903 [doi]
- Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized EmbeddingsSawsan Alqahtani, Garima Lalwani, Yi Zhang, Salvatore Romeo, Saab Mansour. 3904-3919 [doi]
- Uncertainty-Aware Machine Translation EvaluationTaisiya Glushkova, Chrysoula Zerva, Ricardo Rei, André F. T. Martins. 3920-3938 [doi]
- Neural Unification for Logic Reasoning over Natural LanguageGabriele Picco, Thanh Lam Hoang, Marco Luca Sbodio, Vanessa López. 3939-3950 [doi]
- From None to Severe: Predicting Severity in Movie ScriptsYigeng Zhang, Mahsa Shafaei, Fabio A. González, Thamar Solorio. 3951-3956 [doi]
- Benchmarking Meta-embeddings: What Works and What Does NotIker García-Ferrero, Rodrigo Agerri, German Rigau. 3957-3972 [doi]
- A Plug-and-Play Method for Controlled Text GenerationDamian Pascual, Beni Egressy, Clara Meister, Ryan Cotterell, Roger Wattenhofer. 3973-3997 [doi]
- A Corpus-based Syntactic Analysis of Two-termed Unlike CoordinationJulie Kallini, Christiane Fellbaum. 3998-4008 [doi]
- Weakly Supervised Contrastive Learning for Chest X-Ray Report Generationan Yan, Zexue He, Xing Lu, Jiang Du, Eric Y. Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu. 4009-4015 [doi]
- NUANCED: Natural Utterance Annotation for Nuanced Conversation with Estimated DistributionsZhiYu Chen, Honglei Liu, Hu Xu, Seungwhan Moon, Hao Zhou, Bing Liu. 4016-4024 [doi]
- Table-based Fact Verification With Salience-aware LearningFei Wang 0060, Kexuan Sun 0002, Jay Pujara, Pedro A. Szekely, Muhao Chen. 4025-4036 [doi]
- Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence CoverageIsidora Chara Tourni, Lei Guo, Taufiq Husada Daryanto, Fabian Zhafransyah, Edward Edberg Halim, Mona Jalal, Boqi Chen, Sha Lai, Hengchang Hu, Margrit Betke, Prakash Ishwar, Derry Tanti Wijaya. 4037-4050 [doi]
- Multi-task Learning to Enable Location Mention Identification in the Early Hours of a Crisis EventSarthak Khanal, Doina Caragea. 4051-4056 [doi]
- Graph-Based Decoding for Task Oriented Semantic ParsingJeremy R. Cole, Nanjiang Jiang, Panupong Pasupat, Luheng He, Peter Shaw. 4057-4065 [doi]
- Expected Validation Performance and Estimation of a Random Variable's MaximumJesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz 0001, Noah A. Smith. 4066-4073 [doi]
- How May I Help You? Using Neural Text Simplification to Improve Downstream NLP TasksHoang-Van, Zheng Tang, Mihai Surdeanu. 4074-4080 [doi]
- Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative TransformersMachel Reid, Edison Marrese-Taylor, Yutaka Matsuo. 4081-4090 [doi]
- Leveraging Information Bottleneck for Scientific Document SummarizationJiaxin Ju, Ming Liu, Huan Yee Koh, Yuan Jin, Lan Du, Shirui Pan. 4091-4098 [doi]
- Reconsidering the Past: Optimizing Hidden States in Language ModelsDavis Yoshida, Kevin Gimpel. 4099-4105 [doi]
- Attend, Memorize and Generate: Towards Faithful Table-to-Text Generation in Few ShotsWenting Zhao 0006, Ye Liu, Yao Wan, Philip S. Yu. 4106-4117 [doi]
- ARCH: Efficient Adversarial Regularized Training with CachingSimiao Zuo, Chen Liang, Haoming Jiang, Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen, Tuo Zhao. 4118-4131 [doi]
- Probing Commonsense Explanation in Dialogue Response GenerationPei Zhou, Pegah Jandaghi, Hyundong Cho, Bill Yuchen Lin, Jay Pujara, Xiang Ren. 4132-4146 [doi]
- NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering DatasetQiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim. 4147-4161 [doi]
- Textual Time Travel: A Temporally Informed Approach to Theory of MindAkshatha Arodi, Jackie Chi Kit Cheung. 4162-4172 [doi]
- Detect and Perturb: Neutral Rewriting of Biased and Sensitive Text via Gradient-based DecodingZexue He, Bodhisattwa Prasad Majumder, Julian McAuley. 4173-4181 [doi]
- HyperExpan: Taxonomy Expansion with Hyperbolic Representation LearningMingyu Derek Ma, Muhao Chen, Te-Lin Wu, Nanyun Peng. 4182-4194 [doi]
- Want To Reduce Labeling Cost? GPT-3 Can HelpShuohang Wang, Yang Liu, Yichong Xu, Chenguang Zhu, Michael Zeng. 4195-4205 [doi]
- Written Justifications are Key to Aggregate Crowdsourced ForecastsSaketh Kotamraju, Eduardo Blanco 0002. 4206-4216 [doi]
- Cleaning Dirty Books: Post-OCR Processing for Previously Scanned TextsAllen Kim, Charuta Pethe, Naoya Inoue, Steven Skiena. 4217-4226 [doi]
- Bag of Tricks for Optimizing Transformer EfficiencyYe-Lin, Yanyang Li, Tong Xiao, Jingbo Zhu. 4227-4233 [doi]
- Non-Parametric Unsupervised Domain Adaptation for Neural Machine TranslationXin Zheng, Zhirui Zhang, Shujian Huang, Boxing Chen, Jun Xie, Weihua Luo, Jiajun Chen. 4234-4241 [doi]
- The Topic Confusion Task: A Novel Evaluation Scenario for Authorship AttributionMalik H. Altakrori, Jackie Chi Kit Cheung, Benjamin C. M. Fung. 4242-4256 [doi]
- Micromodels for Efficient, Explainable, and Reusable Systems: A Case Study on Mental HealthAndrew Lee, Jonathan K. Kummerfeld, Larry An, Rada Mihalcea. 4257-4272 [doi]
- Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language ModelsJaromír Savelka, Kevin D. Ashley. 4273-4283 [doi]
- FCM: A Fine-grained Comparison Model for Multi-turn Dialogue ReasoningXu Wang, Hainan Zhang, Shuai Zhao 0001, Yanyan Zou, Hongshen Chen, Zhuoye Ding, Bo Cheng 0001, Yanyan Lan. 4284-4293 [doi]
- Reference-based Weak Supervision for Answer Sentence Selection using Web DataVivek Krishnamurthy, Thuy Vu, Alessandro Moschitti. 4294-4299 [doi]
- A Deep Decomposable Model for Disentangling Syntax and Semantics in Sentence RepresentationDingcheng Li, Hongliang Fei, Shaogang Ren, Ping Li. 4300-4310 [doi]
- Improved Word Sense Disambiguation with Enhanced Sense RepresentationsYang Song, Xin Cai Ong, Hwee Tou Ng, Qian Lin. 4311-4320 [doi]
- Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent VariablesWeizhi Wang, Zhirui Zhang, Yichao Du, Boxing Chen, Jun Xie, Weihua Luo. 4321-4327 [doi]
- FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech RecognitionYichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Wenjie Liu, Linquan Liu, Xiang-Yang Li, Tao Qin, Edward Lin, Tie-Yan Liu. 4328-4337 [doi]
- Task-Oriented Clustering for DialoguesChenxu Lv, Hengtong Lu, Shuyu Lei, Huixing Jiang, Wei Wu, Caixia Yuan, Xiaojie Wang. 4338-4347 [doi]
- Mitigating Data Poisoning in Text Classification with Differential PrivacyChang Xu, Jun Wang, Francisco Guzmán, Benjamin I. P. Rubinstein, Trevor Cohn. 4348-4356 [doi]
- Does Vision-and-Language Pretraining Improve Lexical Grounding?Tian Yun, Chen Sun, Ellie Pavlick. 4357-4366 [doi]
- Character-based PCFG Induction for Modeling the Syntactic Acquisition of Morphologically Rich LanguagesLifeng Jin, Byung-Doh Oh, William Schuler. 4367-4378 [doi]
- Block-wise Word Embedding Compression Revisited: Better Weighting and StructuringJong-Ryul Lee, Yong-Ju Lee, Yong-Hyuk Moon. 4379-4388 [doi]
- Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-SwitchingParul Chopra, Sai Krishna Rallabandi, Alan W. Black, Khyathi Raghavi Chandu. 4389-4397 [doi]
- Influence Tuning: Demoting Spurious Correlations via Instance Attribution and Instance-Driven UpdatesXiaochuang Han, Yulia Tsvetkov. 4398-4409 [doi]
- Learning Task Sampling Policy for Multitask LearningDhanasekar Sundararaman, Henry Tsai, Kuang-Huei Lee, Iulia Turc, Lawrence Carin. 4410-4415 [doi]
- Competing Independent Modules for Knowledge Integration and OptimizationParsa Bagherzadeh, Sabine Bergler. 4416-4425 [doi]
- An Exploratory Study on Long Dialogue Summarization: What Works and What's NextYusen Zhang 0001, Ansong Ni, Tao Yu 0009, Rui Zhang, Chenguang Zhu, Budhaditya Deb, Asli Celikyilmaz, Ahmed Hassan Awadallah, Dragomir R. Radev. 4426-4433 [doi]
- Improving Text Auto-Completion with Next Phrase PredictionDong-Ho Lee, Zhiqiang Hu, Roy Ka-Wei Lee. 4434-4438 [doi]
- MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their TargetsShraman Pramanick, Shivam Sharma, Dimitar Dimitrov, Md. Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty 0002. 4439-4455 [doi]
- NICE: Neural Image Commenting with EmpathyKezhen Chen, Qiuyuan Huang, Daniel McDuff, Xiang Gao, Hamid Palangi, Jianfeng Wang, Kenneth D. Forbus, Jianfeng Gao. 4456-4472 [doi]
- HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter NotebooksXuye Liu, Dakuo Wang, April Yi Wang, Yufang Hou, Lingfei Wu. 4473-4485 [doi]
- A multilabel approach to morphosyntactic probingNaomi Tachikawa Shapiro, Amandalynne Paullada, Shane Steinert-Threlkeld. 4486-4524 [doi]
- Co-Teaching Student-Model through Submission Results of Shared TaskKouta Nakayama, Shuhei Kurita, Akio Kobayashi, Yukino Baba, Satoshi Sekine. 4525-4535 [doi]
- KLMo: Knowledge Graph Enhanced Pretrained Language Model with Fine-Grained RelationshipsLei He, Suncong Zheng, Tao Yang, Feng Zhang. 4536-4542 [doi]
- Do We Know What We Don't Know? Studying Unanswerable Questions beyond SQuAD 2.0Elior Sulem, Jamaal Hay, Dan Roth. 4543-4548 [doi]
- Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe PredictionBoer Lyu, Lu Chen, Kai Yu 0004. 4549-4555 [doi]
- Active Learning for Rumor Identification on Social MediaParsa Farinneya, Mohammad Mahdi Abdollah Pour, Sardar Hamidian, Mona T. Diab. 4556-4565 [doi]
- Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical TextMaya Varma, Laurel J. Orr, Sen Wu 0002, Megan Leszczynski, Xiao Ling, Christopher Ré. 4566-4575 [doi]
- Self-Training using Rules of Grammar for Few-Shot NLUJoonghyuk Hahn, Hyunjoon Cheon, Kyuyeol Han, Cheongjae Lee, Junseok Kim, Yo-Sub Han. 4576-4581 [doi]
- Aspect-based Sentiment Analysis in Question Answering ForumsWenxuan Zhang, Yang Deng, Xin Li, Lidong Bing, Wai Lam. 4582-4591 [doi]
- ForumSum: A Multi-Speaker Conversation Summarization DatasetMisha Khalman, Yao Zhao, Mohammad Saleh. 4592-4599 [doi]
- Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA FrameworkAbhilash Nandy, Soumya Sharma, Shubham Maddhashiya, Kapil Sachdeva, Pawan Goyal 0002, Niloy Ganguly. 4600-4609 [doi]
- Comprehensive Punctuation Restoration for English and PolishMichal Pogoda, Tomasz Walkowiak. 4610-4619 [doi]
- Syntactically Diverse Adversarial Network for Knowledge-Grounded Conversation GenerationFuwei Cui, Hui Di, Hongjie Ren, Kazushige Ouchi, Ze Liu, Jinan Xu. 4620-4630 [doi]
- QACE: Asking Questions to Evaluate an Image CaptionHwanhee Lee, Thomas Scialom, Seunghyun Yoon 0002, Franck Dernoncourt, Kyomin Jung. 4631-4638 [doi]
- Secoco: Self-Correcting Encoding for Neural Machine TranslationTao Wang, Chengqi Zhao, Mingxuan Wang, Lei Li, Hang Li, Deyi Xiong. 4639-4644 [doi]
- Simple or Complex? Complexity-controllable Question Generation with Soft Templates and Deep Mixture of Experts ModelSheng Bi, Xiya Cheng, Yuan-Fang Li, Lizhen Qu, Shirong Shen, Guilin Qi, Lu Pan, Yinlin Jiang. 4645-4654 [doi]
- Predicting Anti-Asian Hateful Users on Twitter during COVID-19Jisun An, Haewoon Kwak, Claire Seungeun Lee, Bogang Jun, Yong-Yeol Ahn. 4655-4666 [doi]
- Fine-grained Typing of Emerging Entities in MicroblogsSatoshi Akasaki, Naoki Yoshinaga 0001, Masashi Toyoda. 4667-4679 [doi]
- Data-Efficient Language Shaped Few-shot Image ClassificationZhenwen Liang, Xiangliang Zhang 0001. 4680-4686 [doi]
- Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine TranslationKe Wang, Yangbin Shi, Jiayi Wang, Yuqi Zhang, Yu Zhao, Xiaolin Zheng. 4687-4698 [doi]
- Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate SpeechTomer Wullach, Amir Adler, Einat Minkov. 4699-4705 [doi]
- AutoEQA: Auto-Encoding Questions for Extractive Question AnsweringStalin Varanasi, Saadullah Amin, Guenter Neumann. 4706-4712 [doi]
- A Multi-label Multi-hop Relation Detection Model based on Relation-aware Sequence GenerationLinhai Zhang, Deyu Zhou, Chao Lin, Yulan He. 4713-4719 [doi]
- Don't Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation TechniquesHossein Amirkhani, Mohammad Taher Pilehvar. 4720-4728 [doi]
- Stacked AMR Parsing with Silver DataQingrong Xia, Zhenghua Li, Rui Wang, Min Zhang. 4729-4738 [doi]
- Speculative Sampling in Variational Autoencoders for Dialogue Response GenerationShoetsu Sato, Naoki Yoshinaga, Masashi Toyoda, Masaru Kitsuregawa. 4739-4745 [doi]
- Perceived and Intended Sarcasm Detection with Graph Attention NetworksJoan Plepi, Lucie Flek. 4746-4753 [doi]
- Contrastive Representation Learning for Exemplar-Guided Paraphrase GenerationHaoran Yang, Wai Lam, Piji Li. 4754-4761 [doi]
- MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual TransferAlan Ansell, Edoardo Maria Ponti, Jonas Pfeiffer, Sebastian Ruder, Goran Glavas, Ivan Vulic, Anna Korhonen. 4762-4781 [doi]
- Sustainable Modular Debiasing of Language ModelsAnne Lauscher, Tobias Lüken, Goran Glavas. 4782-4797 [doi]
- A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base Question AnsweringDeyu Zhou, Yanzheng Xiang, Linhai Zhang, Chenchen Ye, Qian-Wen Zhang, Yunbo Cao. 4798-4808 [doi]
- Counterfactual Adversarial Learning with Representation InterpolationWei Wang, Boxin Wang, Ning Shi, Jinfeng Li, Bingyu Zhu, Xiangyu Liu, Rong Zhang. 4809-4820 [doi]
- 'Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLPAnna Rogers, Timothy Baldwin, Kobi Leins. 4821-4833 [doi]
- Counter-Contrastive Learning for Language GANsYekun Chai, Haidong Zhang, Qiyue Yin, Junge Zhang. 4834-4839 [doi]
- Incorporating Circumstances into Narrative Event PredictionShichao Wang, Xiangrui Cai, Hongbin Wang, Xiaojie Yuan. 4840-4849 [doi]
- MultiFix: Learning to Repair Multiple Errors by Optimal Alignment LearningHyeontae Seo, Yo-Sub Han, Sang-Ki Ko. 4850-4855 [doi]
- HOTTER: Hierarchical Optimal Topic Transport with Explanatory Context RepresentationsSabine Wehnert, Christian Scheel, Simona Szakács-Behling, Maret Nieländer, Patrick Mielke, Ernesto William De Luca. 4856-4866 [doi]
- Grammatical Error Correction with Contrastive Learning in Low Error Density DomainsHannan Cao, Wenmian Yang, Hwee Tou Ng. 4867-4874 [doi]
- Improving Unsupervised Commonsense Reasoning Using Knowledge-Enabled Natural Language InferenceCanming Huang, Weinan He, Yongmei Liu. 4875-4885 [doi]
- Does Putting a Linguist in the Loop Improve NLU Data Collection?Alicia Parrish, William Huang, Omar Agha, Soo-Hwan Lee, Nikita Nangia, Alex Warstadt, Karmanya Aggarwal, Emily Allaway, Tal Linzen, Samuel R. Bowman. 4886-4901 [doi]
- Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language UnderstandingShane Storks, Qiaozi Gao, Yichi Zhang, Joyce Chai. 4902-4918 [doi]
- Making Heads and Tails of Models with Marginal Calibration for Sparse TagsetsMichael Kranzlein, Nelson F. Liu, Nathan Schneider. 4919-4928 [doi]
- GeDi: Generative Discriminator Guided Sequence GenerationBen Krause, Akhilesh Deepak Gotmare, Bryan McCann, Nitish Shirish Keskar, Shafiq R. Joty, Richard Socher, Nazneen Fatema Rajani. 4929-4952 [doi]