Abstract is missing.
- Detecting Attackable Sentences in ArgumentsYohan Jo, Seojin Bang, Emaad Manzoor, Eduard H. Hovy, Chris Reed. 1-23 [doi]
- Extracting Implicitly Asserted Propositions in ArgumentationYohan Jo, Jacky Visser, Chris Reed, Eduard H. Hovy. 24-38 [doi]
- Quantitative argument summarization and beyond: Cross-domain key point analysisRoy Bar-Haim, Yoav Kantor, Lilach Eden, Roni Friedman, Dan Lahav, Noam Slonim. 39-49 [doi]
- Unsupervised stance detection for arguments from consequencesJonathan Kobbe, Ioana Hulpus, Heiner Stuckenschmidt. 50-60 [doi]
- BLEU might be Guilty but References are not InnocentMarkus Freitag, David Grangier, Isaac Caswell. 61-71 [doi]
- Statistical Power and Translationese in Machine Translation EvaluationYvette Graham, Barry Haddow, Philipp Koehn. 72-81 [doi]
- Simulated multiple reference training improves low-resource machine translationHuda Khayrallah, Brian Thompson, Matt Post, Philipp Koehn. 82-89 [doi]
- Automatic Machine Translation Evaluation in Many Languages via Zero-Shot ParaphrasingBrian Thompson, Matt Post. 90-121 [doi]
- PRover: Proof Generation for Interpretable Reasoning over RulesSwarnadeep Saha, Sayan Ghosh, Shashank Srivastava, Mohit Bansal. 122-136 [doi]
- Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-AnsweringHarsh Jhamtani, Peter Clark. 137-150 [doi]
- Self-Supervised Knowledge Triplet Learning for Zero-Shot Question AnsweringPratyay Banerjee, Chitta Baral. 151-162 [doi]
- More Bang for Your Buck: Natural Perturbation for Robust Question AnsweringDaniel Khashabi, Tushar Khot, Ashish Sabharwal. 163-170 [doi]
- A matter of framing: The impact of linguistic formalism on probing resultsIlia Kuznetsov, Iryna Gurevych. 171-182 [doi]
- Information-Theoretic Probing with Minimum Description LengthElena Voita, Ivan Titov. 183-196 [doi]
- Intrinsic Probing through Dimension SelectionLucas Torroba Hennigen, Adina Williams, Ryan Cotterell. 197-216 [doi]
- Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)Alex Warstadt, Yian Zhang, Xiaocheng Li, Haokun Liu, Samuel R. Bowman. 217-235 [doi]
- Repulsive Attention: Rethinking Multi-head Attention as Bayesian InferenceBang An, Jie Lyu 0004, Zhenyi Wang, Chunyuan Li, Changwei Hu, Fei Tan, Ruiyi Zhang, Yifan Hu, Changyou Chen. 236-255 [doi]
- KERMIT: Complementing Transformer Architectures with Encoders of Explicit Syntactic InterpretationsFabio Massimo Zanzotto, Andrea Santilli, Leonardo Ranaldi, Dario Onorati, Pierfrancesco Tommasino, Francesca Fallucchi. 256-267 [doi]
- ETC: Encoding Long and Structured Inputs in TransformersJoshua Ainslie, Santiago Ontañón, Chris Alberti, Vaclav Cvicek, Zachary Fisher, Philip Pham, Anirudh Ravula, Sumit Sanghai, Qifan Wang, Li Yang. 268-284 [doi]
- Pre-Training Transformers as Energy-Based Cloze ModelsKevin Clark, Minh-Thang Luong, Quoc V. Le, Christopher D. Manning. 285-294 [doi]
- Calibration of Pre-trained TransformersShrey Desai, Greg Durrett. 295-302 [doi]
- Near-imperceptible Neural Linguistic Steganography via Self-Adjusting Arithmetic CodingJiaming Shen, Heng Ji, Jiawei Han 0001. 303-313 [doi]
- Multi-Dimensional Gender Bias ClassificationEmily Dinan, Angela Fan, Ledell Wu, Jason Weston, Douwe Kiela, Adina Williams. 314-331 [doi]
- FIND: Human-in-the-Loop Debugging Deep Text ClassifiersPiyawat Lertvittayakumjorn, Lucia Specia, Francesca Toni. 332-348 [doi]
- Conversational Document Prediction to Assist Customer Care AgentsJatin Ganhotra, Haggai Roitman, Doron Cohen, Nathaniel Mills, R. Chulaka Gunasekara, Yosi Mass, Sachindra Joshi, Luis A. Lastras, David Konopnicki. 349-356 [doi]
- Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLUBrielen Madureira, David Schlangen. 357-374 [doi]
- Augmented Natural Language for Generative Sequence LabelingBen Athiwaratkun, Cícero Nogueira dos Santos, Jason Krone, Bing Xiang. 375-385 [doi]
- Dialogue Response Ranking Training with Large-Scale Human Feedback DataXiang Gao, Yizhe Zhang, Michel Galley, Chris Brockett, Bill Dolan. 386-395 [doi]
- Semantic Evaluation for Text-to-SQL with Distilled Test SuitesRuiqi Zhong, Tao Yu, Dan Klein. 396-411 [doi]
- Cross-Thought for Sentence Encoder Pre-trainingShuohang Wang, Yuwei Fang, Siqi Sun, Zhe Gan, Yu Cheng 0001, Jingjing Liu 0001, Jing Jiang 0001. 412-421 [doi]
- AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training DataSilei Xu, Sina J. Semnani, Giovanni Campagna, Monica S. Lam. 422-434 [doi]
- A Spectral Method for Unsupervised Multi-Document SummarizationKexiang Wang, Baobao Chang, Zhifang Sui. 435-445 [doi]
- What Have We Achieved on Text Summarization?Dandan Huang, Leyang Cui, Sen Yang, Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang 0004. 446-469 [doi]
- Q-learning with Language Model for Edit-based Unsupervised SummarizationRyosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana. 470-484 [doi]
- Friendly Topic Assistant for Transformer Based Abstractive SummarizationZhengjue Wang, Zhibin Duan, Hao Zhang 0050, Chaojie Wang, Long Tian, Bo Chen 0001, Mingyuan Zhou. 485-497 [doi]
- Contrastive Distillation on Intermediate Representations for Language Model CompressionSiqi Sun, Zhe Gan, Yuwei Fang, Yu Cheng 0001, Shuohang Wang, Jingjing Liu 0001. 498-508 [doi]
- TernaryBERT: Distillation-aware Ultra-low Bit BERTWei Zhang, Lu Hou, Yichun Yin, Lifeng Shang, Xiao Chen, Xin Jiang, Qun Liu. 509-521 [doi]
- Self-Supervised Meta-Learning for Few-Shot Natural Language Classification TasksTrapit Bansal, Rishikesh Jha, Tsendsuren Munkhdalai, Andrew McCallum. 522-534 [doi]
- Efficient Meta Lifelong-Learning with Limited MemoryZirui Wang, Sanket Vaibhav Mehta, Barnabás Póczos, Jaime G. Carbonell. 535-548 [doi]
- Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual EmbeddingsPhillip Keung, Yichao Lu, Julian Salazar, Vikas Bhardwaj. 549-554 [doi]
- A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERTMasaaki Nagata, Katsuki Chousa, Masaaki Nishino. 555-565 [doi]
- Accurate Word Alignment Induction from Neural Machine TranslationYun Chen, Yang Liu, Guanhua Chen, Xin Jiang, Qun Liu. 566-576 [doi]
- ChrEn: Cherokee-English Machine Translation for Endangered Language RevitalizationShiyue Zhang, Benjamin Frey, Mohit Bansal. 577-595 [doi]
- Unsupervised Discovery of Implicit Gender BiasAnjalie Field, Yulia Tsvetkov. 596-608 [doi]
- Condolence and Empathy in Online CommunitiesNaitian Zhou, David Jurgens. 609-626 [doi]
- An Embedding Model for Estimating Legislative Preferences from the Frequency and Sentiment of TweetsGregory Spell, Brian Guay, Sunshine Hillygus, Lawrence Carin. 627-641 [doi]
- Measuring Information Propagation in Literary Social NetworksMatthew Sims, David Bamman. 642-652 [doi]
- Social Chemistry 101: Learning to Reason about Social and Moral NormsMaxwell Forbes, Jena D. Hwang, Vered Shwartz, Maarten Sap, Yejin Choi. 653-670 [doi]
- Event Extraction by Answering (Almost) Natural QuestionsXinya Du, Claire Cardie. 671-683 [doi]
- Connecting the Dots: Event Graph Schema Induction with Path Language ModelingManling Li, Qi Zeng, Ying Lin, KyungHyun Cho, Heng Ji, Jonathan May, Nathanael Chambers, Clare R. Voss. 684-695 [doi]
- Joint Constrained Learning for Event-Event Relation ExtractionHaoyu Wang, Muhao Chen, Hongming Zhang, Dan Roth. 696-706 [doi]
- Incremental Event Detection via Knowledge Consolidation NetworksPengfei Cao, Yubo Chen 0001, Jun Zhao 0001, Taifeng Wang. 707-717 [doi]
- Semi-supervised New Event Type Induction and Event DetectionLifu Huang, Heng Ji. 718-724 [doi]
- Language Generation with Multi-Hop Reasoning on Commonsense Knowledge GraphHaozhe Ji, Pei Ke, Shaohan Huang, Furu Wei, Xiaoyan Zhu 0001, Minlie Huang. 725-736 [doi]
- Reformulating Unsupervised Style Transfer as Paraphrase GenerationKalpesh Krishna, John Wieting, Mohit Iyyer. 737-762 [doi]
- De-Biased Court's View Generation with CausalityYiquan Wu, Kun Kuang, Yating Zhang, Xiaozhong Liu, Changlong Sun, Jun Xiao, Yueting Zhuang, Luo Si, Fei Wu 0001. 763-780 [doi]
- PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text GenerationXinyu Hua, Lu Wang. 781-793 [doi]
- Back to the Future: Unsupervised Backprop-based Decoding for Counterfactual and Abductive Commonsense ReasoningLianhui Qin, Vered Shwartz, Peter West, Chandra Bhagavatula, Jena D. Hwang, Ronan Le Bras, Antoine Bosselut, Yejin Choi. 794-805 [doi]
- Where Are You? Localization from Embodied DialogMeera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson. 806-822 [doi]
- Learning to Represent Image and Text with Denotation GraphBowen Zhang 0002, Hexiang Hu, Vihan Jain, Eugene Ie, Fei Sha. 823-839 [doi]
- Video2Commonsense: Generating Commonsense Descriptions to Enrich Video CaptioningZhiyuan Fang, Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang. 840-860 [doi]
- Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!Jack Hessel, Lillian Lee. 861-877 [doi]
- MUTANT: A Training Paradigm for Out-of-Distribution Generalization in Visual Question AnsweringTejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang. 878-892 [doi]
- Mitigating Gender Bias for Neural Dialogue Generation with Adversarial LearningHaochen Liu, Wentao Wang, Yiqi Wang, Hui Liu, Zitao Liu, Jiliang Tang. 893-903 [doi]
- Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-ConsciousnessHyunwoo Kim, Byeongchang Kim 0002, Gunhee Kim. 904-916 [doi]
- TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented DialogueChien-Sheng Wu, Steven C. H. Hoi, Richard Socher, Caiming Xiong. 917-929 [doi]
- RiSAWOZ: A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Oriented Dialogue ModelingJun Quan, Shian Zhang, Qian Cao, Zizhong Li, Deyi Xiong. 930-940 [doi]
- Filtering Noisy Dialogue Corpora by Connectivity and Content RelatednessReina Akama, Sho Yokoi, Jun Suzuki, Kentaro Inui. 941-958 [doi]
- Latent Geographical Factors for Analyzing the Evolution of Dialects in ContactYugo Murawaki. 959-976 [doi]
- Predicting Reference: What do Language Models Learn about Discourse Models?Shiva Upadhye, Leon Bergen, Andrew Kehler. 977-982 [doi]
- Word class flexibility: A deep contextualized approachBai Li, Guillaume Thomas, Yang Xu, Frank Rudzicz. 983-994 [doi]
- Shallow-to-Deep Training for Neural Machine TranslationBei Li, Ziyang Wang, Hui Liu, Yufan Jiang, Quan Du, Tong Xiao, Huizhen Wang, Jingbo Zhu. 995-1005 [doi]
- Iterative Refinement in the Continuous Space for Non-Autoregressive Neural Machine TranslationJason Lee, Raphael Shu, KyungHyun Cho. 1006-1015 [doi]
- Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate LayersYimeng Wu, Peyman Passban, Mehdi Rezagholizadeh, Qun Liu. 1016-1021 [doi]
- Multi-task Learning for Multilingual Neural Machine TranslationYiren Wang, ChengXiang Zhai, Hany Hassan. 1022-1034 [doi]
- Token-level Adaptive Training for Neural Machine TranslationShuhao Gu, Jinchao Zhang, Fandong Meng, Yang Feng 0004, Wanying Xie, Jie Zhou 0016, Dong Yu 0003. 1035-1046 [doi]
- Multi-Unit Transformers for Neural Machine TranslationJianhao Yan, Fandong Meng, Jie Zhou 0016. 1047-1059 [doi]
- On the Sparsity of Neural Machine Translation ModelsYong Wang, Longyue Wang, Victor O. K. Li, Zhaopeng Tu. 1060-1066 [doi]
- Incorporating a Local Translation Mechanism into Non-autoregressive TranslationXiang Kong, Zhisong Zhang, Eduard H. Hovy. 1067-1073 [doi]
- Self-Paced Learning for Neural Machine TranslationYu Wan 0004, Baosong Yang, Derek F. Wong, Yikai Zhou, Lidia S. Chao, Haibo Zhang, Boxing Chen. 1074-1080 [doi]
- Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine TranslationPei Zhang, Boxing Chen, Niyu Ge, Kai Fan. 1081-1087 [doi]
- Generating Diverse Translation from Model Distribution with DropoutXuanfu Wu, Yang Feng 0004, Chenze Shao. 1088-1097 [doi]
- Non-Autoregressive Machine Translation with Latent AlignmentsChitwan Saharia, William Chan, Saurabh Saxena, Mohammad Norouzi 0002. 1098-1108 [doi]
- Look at the First Sentence: Position Bias in Question AnsweringMiyoung Ko, Jinhyuk Lee, Hyunjae Kim, Gangwoo Kim, Jaewoo Kang. 1109-1121 [doi]
- ProtoQA: A Question Answering Dataset for Prototypical Common-Sense ReasoningMichael Boratko, Xiang Li 0069, Tim O'Gorman, Rajarshi Das, Dan Le, Andrew McCallum. 1122-1136 [doi]
- IIRC: A Dataset of Incomplete Information Reading Comprehension QuestionsJames Ferguson, Matt Gardner 0001, Hannaneh Hajishirzi, Tushar Khot, Pradeep Dasigi. 1137-1147 [doi]
- Unsupervised Adaptation of Question Answering Systems via Generative Self-trainingSteven J. Rennie, Etienne Marcheret, Neil Mallinar, David Nahamoo, Vaibhava Goel. 1148-1157 [doi]
- TORQUE: A Reading Comprehension Dataset of Temporal Ordering QuestionsQiang Ning, Hao Wu 0034, Rujun Han, Nanyun Peng, Matt Gardner 0001, Dan Roth. 1158-1172 [doi]
- ToTTo: A Controlled Table-To-Text Generation DatasetAnkur P. Parikh, Xuezhi Wang 0002, Sebastian Gehrmann, Manaal Faruqui, Bhuwan Dhingra, Diyi Yang, Dipanjan Das 0001. 1173-1186 [doi]
- ENT-DESC: Entity Description Generation by Exploring Knowledge GraphLiYing Cheng, Dekun Wu, Lidong Bing, Yan Zhang 0004, Zhanming Jie, Wei Lu 0011, Luo Si. 1187-1197 [doi]
- Small but Mighty: New Benchmarks for Split and RephraseLi Zhang, Huaiyu Zhu 0001, Siddhartha Brahma, Yunyao Li 0001. 1198-1205 [doi]
- Online Back-Parsing for AMR-to-Text GenerationXuefeng Bai, Linfeng Song, Yue Zhang 0004. 1206-1219 [doi]
- Reading Between the Lines: Exploring Infilling in Visual NarrativesKhyathi Raghavi Chandu, Ruo-Ping Dong, Alan W. Black. 1220-1229 [doi]
- Acrostic Poem GenerationRajat Agarwal, Katharina Kann. 1230-1240 [doi]
- Local Additivity Based Data Augmentation for Semi-supervised NERJiaao Chen, Zhenghui Wang, Ran Tian, Zichao Yang, Diyi Yang. 1241-1251 [doi]
- Grounded Compositional Outputs for Adaptive Language ModelingNikolaos Pappas 0002, Phoebe Mulcaire, Noah A. Smith. 1252-1267 [doi]
- SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain RobustnessNathan Ng, KyungHyun Cho, Marzyeh Ghassemi. 1268-1283 [doi]
- SetConv: A New Approach for Learning from Imbalanced DataYang Gao 0027, Yi-Fan Li, Yu Lin, Charu C. Aggarwal, Latifur Khan. 1284-1294 [doi]
- Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question AnsweringYanlin Feng, Xinyue Chen, Bill Yuchen Lin, PeiFeng Wang, Jun Yan, Xiang Ren. 1295-1309 [doi]
- Improving Bilingual Lexicon Induction for Low Frequency WordsJiaji Huang, Xingyu Cai, Kenneth Church 0001. 1310-1314 [doi]
- Learning VAE-LDA Models with Rounded Reparameterization TrickRunzhi Tian, Yongyi Mao, Richong Zhang. 1315-1325 [doi]
- Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution DataLingkai Kong, Haoming Jiang, Yuchen Zhuang, Jie Lyu 0005, Tuo Zhao, Chao Zhang. 1326-1340 [doi]
- Scaling Hidden Markov Language ModelsJustin T. Chiu, Alexander M. Rush. 1341-1349 [doi]
- Coding Textual Inputs Boosts the Accuracy of Neural NetworksAbdul Rafae Khan, Jia Xu 0004, Weiwei Sun 0007. 1350-1360 [doi]
- Learning from Task DescriptionsOrion Weller, Nicholas Lourie, Matt Gardner 0001, Matthew E. Peters. 1361-1375 [doi]
- Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online TopicsKeyang Ding, Jing Li, Yuji Zhang. 1376-1382 [doi]
- Named Entity Recognition for Social Media Texts with Semantic AugmentationYuyang Nie, Yuanhe Tian, Xiang Wan, Yan Song, Bo Dai. 1383-1391 [doi]
- Coupled Hierarchical Transformer for Stance-Aware Rumor Verification in Social Media ConversationsJianfei Yu, Jing Jiang 0001, Ling Min Serena Khoo, Hai Leong Chieu, Rui Xia. 1392-1401 [doi]
- Social Media Attributions in the Context of Water CrisisRupak Sarkar, Sayantan Mahinder, Hirak Sarkar, Ashiqur R. KhudaBukhsh. 1402-1412 [doi]
- On the Reliability and Validity of Detecting Approval of Political Actors in TweetsIndira Sen, Fabian Flöck, Claudia Wagner. 1413-1426 [doi]
- Towards Medical Machine Reading Comprehension with Structural Knowledge and Plain TextDongfang Li, Baotian Hu, Qingcai Chen, Weihua Peng, Anqi Wang. 1427-1438 [doi]
- Generating Radiology Reports via Memory-driven TransformerZhihong Chen, Yan Song, Tsung-Hui Chang, Xiang Wan. 1439-1449 [doi]
- Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency DetectionJingfeng Yang, Diyi Yang, Zhaoran Ma. 1450-1460 [doi]
- Predicting Clinical Trial Results by Implicit Evidence IntegrationQiao Jin, Chuanqi Tan, Mosha Chen, Xiaozhong Liu, Songfang Huang. 1461-1477 [doi]
- Explainable Clinical Decision Support from TextJinyue Feng, Chantal Shaib, Frank Rudzicz. 1478-1489 [doi]
- A Knowledge-driven Generative Model for Multi-implication Chinese Medical Procedure Entity NormalizationJinghui Yan, Yining Wang, Lu Xiang, Yu Zhou 0001, Chengqing Zong. 1490-1499 [doi]
- Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERTAkshay Smit, Saahil Jain, Pranav Rajpurkar, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren. 1500-1519 [doi]
- Benchmarking Meaning Representations in Neural Semantic ParsingJiaqi Guo, Qian Liu, Jian-Guang Lou, Zhenwen Li, Xueqing Liu, Tao Xie 0001, Ting Liu 0002. 1520-1540 [doi]
- Analogous Process Structure Induction for Sub-event Sequence PredictionHongming Zhang, Muhao Chen, Haoyu Wang, Yangqiu Song, Dan Roth. 1541-1550 [doi]
- SLM: Learning a Discourse Language Representation with Sentence UnshufflingHaejun Lee, Drew A. Hudson, Kangwook Lee, Christopher D. Manning. 1551-1562 [doi]
- Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to RankEleftheria Briakou, Marine Carpuat. 1563-1580 [doi]
- A Bilingual Generative Transformer for Semantic Sentence EmbeddingJohn Wieting, Graham Neubig, Taylor Berg-Kirkpatrick. 1581-1594 [doi]
- Semantically Inspired AMR Alignment for the Portuguese LanguageRafael T. Anchiêta, Thiago A. S. Pardo. 1595-1600 [doi]
- An Unsupervised Sentence Embedding Method by Mutual Information MaximizationYan Zhang 0004, Ruidan He, Zuozhu Liu, Kwan Hui Lim 0001, Lidong Bing. 1601-1610 [doi]
- Compositional Phrase Alignment and BeyondYuki Arase, Jun'ichi Tsujii. 1611-1623 [doi]
- Table Fact Verification with Structure-Aware TransformerHongzhi Zhang, Yingyao Wang, Sirui Wang, Xuezhi Cao, Fuzheng Zhang, Zhongyuan Wang. 1624-1629 [doi]
- Double Graph Based Reasoning for Document-level Relation ExtractionShuang Zeng, Runxin Xu, Baobao Chang, Lei Li 0005. 1630-1640 [doi]
- Event Extraction as Machine Reading ComprehensionJian Liu, Yubo Chen 0001, Kang Liu 0001, Wei Bi, Xiaojiang Liu. 1641-1651 [doi]
- MAVEN: A Massive General Domain Event Detection DatasetXiaozhi Wang, Ziqi Wang, Xu Han 0007, Wangyi Jiang, Rong Han, Zhiyuan Liu 0001, Juanzi Li, Peng Li 0030, Yankai Lin, Jie Zhou 0016. 1652-1671 [doi]
- Knowledge Graph Alignment with Entity-Pair EmbeddingZhichun Wang, Jinjian Yang, Xiaoju Ye. 1672-1680 [doi]
- Adaptive Attentional Network for Few-Shot Knowledge Graph CompletionJiawei Sheng, Shu Guo, Zhenyu Chen, Juwei Yue, Lihong Wang, Tingwen Liu, Hongbo Xu. 1681-1691 [doi]
- Pre-training Entity Relation Encoder with Intra-span and Inter-span InformationYijun Wang, Changzhi Sun, Yuanbin Wu, Junchi Yan, Peng Gao, Guotong Xie. 1692-1705 [doi]
- Two are Better than One: Joint Entity and Relation Extraction with Table-Sequence EncodersJue Wang, Wei Lu 0011. 1706-1721 [doi]
- Beyond [CLS] through Ranking by GenerationCícero Nogueira dos Santos, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, Bing Xiang. 1722-1727 [doi]
- Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!Suzanna Sia, Ayush Dalmia, Sabrina J. Mielke. 1728-1736 [doi]
- Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement LearningYuning Mao, Yanru Qu, Yiqing Xie, Xiang Ren, Jiawei Han 0001. 1737-1751 [doi]
- Improving Neural Topic Models using Knowledge DistillationAlexander Hoyle, Pranav Goel, Philip Resnik. 1752-1771 [doi]
- Short Text Topic Modeling with Topic Distribution Quantization and Negative Sampling DecoderXiaobao Wu, Chunping Li, Yan Zhu, Yishu Miao. 1772-1782 [doi]
- Querying Across Genres for Medical Claims in NewsChaoyuan Zuo, Narayan Acharya, Ritwik Banerjee. 1783-1789 [doi]
- Incorporating Multimodal Information in Open-Domain Web Keyphrase ExtractionYansen Wang, Zhen Fan 0003, Carolyn Penstein Rosé. 1790-1800 [doi]
- CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and FrenchAmirAli Bagher Zadeh, Yansheng Cao, Smon Hessner, Paul Pu Liang, Soujanya Poria, Louis-Philippe Morency. 1801-1812 [doi]
- Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency DetectionShaolei Wang, Zhongyuan Wang, Wanxiang Che, Ting Liu 0001. 1813-1822 [doi]
- Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language AnalysisYao-Hung Hubert Tsai, Martin Ma, Muqiao Yang, Ruslan Salakhutdinov, Louis-Philippe Morency. 1823-1833 [doi]
- Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain VideosNayu Liu, Xian Sun, Hongfeng Yu, Wenkai Zhang, Guangluan Xu. 1834-1845 [doi]
- BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded DialoguesHung Le, Doyen Sahoo, Nancy F. Chen, Steven C. H. Hoi. 1846-1859 [doi]
- UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented DialoguesHung Le, Doyen Sahoo, Chenghao Liu, Nancy F. Chen, Steven C. H. Hoi. 1860-1877 [doi]
- GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented Dialogue SystemsShiquan Yang, Rui Zhang, Sarah M. Erfani. 1878-1888 [doi]
- Structured Attention for Unsupervised Dialogue Structure InductionLiang Qiu, Yizhou Zhao, Weiyan Shi, Yuan Liang, Feng Shi, Tao Yuan, Zhou Yu, Song Chun Zhu. 1889-1899 [doi]
- Cross Copy Network for Dialogue GenerationChangzhen Ji, Xin Zhou, Yating Zhang, Xiaozhong Liu, Changlong Sun, Conghui Zhu, Tiejun Zhao. 1900-1910 [doi]
- Multi-turn Response Selection using Dialogue Dependency RelationsQi Jia, Yizhu Liu, Siyu Ren, Kenny Q. Zhu, Haifeng Tang. 1911-1920 [doi]
- Parallel Interactive Networks for Multi-Domain Dialogue State GenerationJunfan Chen, Richong Zhang, Yongyi Mao, Jie Xu 0007. 1921-1931 [doi]
- SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot FillingDi Wu, Liang Ding, Fan Lu, Jian Xie. 1932-1937 [doi]
- An Information Bottleneck Approach for Controlling Conciseness in Rationale ExtractionBhargavi Paranjape, Mandar Joshi, John Thickstun, Hannaneh Hajishirzi, Luke Zettlemoyer. 1938-1952 [doi]
- CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language ModelsNikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman. 1953-1967 [doi]
- LOGAN: Local Group Bias Detection by ClusteringJieyu Zhao, Kai-Wei Chang. 1968-1977 [doi]
- RNNs can generate bounded hierarchical languages with optimal memoryJohn Hewitt, Michael Hahn 0001, Surya Ganguli, Percy Liang, Christopher D. Manning. 1978-2010 [doi]
- Detecting Independent Pronoun Bias with Partially-Synthetic Data GenerationRobert Munro, Alex Morrison. 2011-2017 [doi]
- Visually Grounded Continual Learning of Compositional PhrasesXisen Jin, Junyi Du, Arka Sadhu, Ram Nevatia, Xiang Ren. 2018-2029 [doi]
- MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase GroundingQinxin Wang, Hao Tan, Sheng Shen, Michael W. Mahoney, Zhewei Yao. 2030-2038 [doi]
- Domain-Specific Lexical Grounding in Noisy Visual-Textual DocumentsGregory Yauney, Jack Hessel, David Mimno. 2039-2045 [doi]
- HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-trainingLinjie Li, Yen-Chun Chen 0001, Yu Cheng 0001, Zhe Gan, Licheng Yu, Jingjing Liu 0001. 2046-2065 [doi]
- Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded SupervisionHao Tan, Mohit Bansal. 2066-2080 [doi]
- Detecting Cross-Modal Inconsistency to Defend Against Neural Fake NewsReuben Tan, Bryan A. Plummer, Kate Saenko. 2081-2106 [doi]
- Enhancing Aspect Term Extraction with Soft PrototypesZhuang Chen 0002, Tieyun Qian. 2107-2117 [doi]
- FedED: Federated Learning via Ensemble Distillation for Medical Relation ExtractionDianbo Sui, Yubo Chen 0001, Jun Zhao 0001, Yantao Jia, Yuantao Xie, Weijian Sun. 2118-2128 [doi]
- Multimodal Joint Attribute Prediction and Value Extraction for E-commerce ProductTiangang Zhu, Yue Wang, Haoran Li 0001, Youzheng Wu, Xiaodong He 0002, Bowen Zhou. 2129-2139 [doi]
- A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpressionMingming Sun, Wenyue Hua, Zoey Liu, Xin Wang, Kangjie Zheng, Ping Li 0001. 2140-2150 [doi]
- Retrofitting Structure-aware Transformer Language Model for End TasksHao Fei 0001, Yafeng Ren, Donghong Ji. 2151-2161 [doi]
- Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text GenerationYan Zhang 0004, Zhijiang Guo, Zhiyang Teng, Wei Lu 0011, Shay B. Cohen, Zuozhu Liu, Lidong Bing. 2162-2172 [doi]
- If beam search is the answer, what was the question?Clara Meister, Ryan Cotterell, Tim Vieira. 2173-2185 [doi]
- Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure LearningTsvetomila Mihaylova, Vlad Niculae, André F. T. Martins. 2186-2202 [doi]
- Is the Best Better? Bayesian Statistical Model Comparison for Natural Language ProcessingPiotr Szymanski, Kyle Gorman. 2203-2212 [doi]
- Exploring Logically Dependent Multi-task Learning with Causal InferenceWenqing Chen, Jidong Tian, Liqiang Xiao, Hao He 0007, Yaohui Jin. 2213-2225 [doi]
- Masking as an Efficient Alternative to Finetuning for Pretrained Language ModelsMengjie Zhao, Tao Lin, Fei Mi, Martin Jaggi, Hinrich Schütze. 2226-2241 [doi]
- Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement LearningXiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong. 2242-2254 [doi]
- Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine TranslationWenxiang Jiao, Xing Wang 0007, Shilin He, Irwin King, Michael R. Lyu, Zhaopeng Tu. 2255-2266 [doi]
- Pronoun-Targeted Fine-tuning for NMT with Hybrid LossesPrathyusha Jwalapuram, Shafiq R. Joty, Youlin Shen. 2267-2279 [doi]
- Learning Adaptive Segmentation Policy for Simultaneous TranslationRuiqing Zhang, Chuanqiang Zhang, Zhongjun He, Hua Wu 0003, Haifeng Wang. 2280-2289 [doi]
- Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous LanguagesZheng Li 0018, Mukul Kumar, William Headden, Bing Yin, Ying Wei 0001, Yu Zhang, Qiang Yang 0001. 2290-2301 [doi]
- UDapter: Language Adaptation for Truly Universal Dependency ParsingAhmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord. 2302-2315 [doi]
- Uncertainty-Aware Label Refinement for Sequence LabelingTao Gui, Jiacheng Ye, Qi Zhang 0001, Zhengyan Li, Zichu Fei, Yeyun Gong, Xuanjing Huang. 2316-2326 [doi]
- Adversarial Attack and Defense of Structured Prediction ModelsWenjuan Han, Liwen Zhang, Yong Jiang, Kewei Tu. 2327-2338 [doi]
- Position-Aware Tagging for Aspect Sentiment Triplet ExtractionLu Xu, Hao Li, Wei Lu 0011, Lidong Bing. 2339-2349 [doi]
- Simultaneous Machine Translation with Visual ContextOzan Caglayan, Julia Ive, Veneta Haralampieva, Pranava Madhyastha, Loïc Barrault, Lucia Specia. 2350-2361 [doi]
- XCOPA: A Multilingual Dataset for Causal Commonsense ReasoningEdoardo Maria Ponti, Goran Glavas, Olga Majewska, Qianchu Liu, Ivan Vulic, Anna Korhonen. 2362-2376 [doi]
- The Secret is in the Spectra: Predicting Cross-lingual Task Performance with Spectral Similarity MeasuresHaim Dubossarsky, Ivan Vulic, Roi Reichart, Anna Korhonen. 2377-2390 [doi]
- Bridging Linguistic Typology and Multilingual Machine Translation with Multi-View Language RepresentationsArturo Oncevay, Barry Haddow, Alexandra Birch. 2391-2406 [doi]
- AnswerFact: Fact Checking in Product Question AnsweringWenxuan Zhang, Yang Deng, Jing Ma, Wai Lam. 2407-2417 [doi]
- Context-Aware Answer Extraction in Question AnsweringYeon Seonwoo, Ji-Hoon Kim, Jung-Woo Ha, Alice Oh. 2418-2428 [doi]
- What do Models Learn from Question Answering Datasets?Priyanka Sen, Amir Saffari. 2429-2438 [doi]
- Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingYifan Gao 0001, Chien-Sheng Wu, Jingjing Li, Shafiq R. Joty, Steven C. H. Hoi, Caiming Xiong, Irwin King, Michael R. Lyu. 2439-2449 [doi]
- A Method for Building a Commonsense Inference Dataset based on Basic EventsKazumasa Omura, Daisuke Kawahara, Sadao Kurohashi. 2450-2460 [doi]
- Neural Deepfake Detection with Factual Structure of TextWanjun Zhong, Duyu Tang, Zenan Xu, Ruize Wang, Nan Duan, Ming Zhou 0001, Jiahai Wang, Jian Yin 0001. 2461-2470 [doi]
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive ScaleAndreas Rücklé, Jonas Pfeiffer, Iryna Gurevych. 2471-2486 [doi]
- XL-AMR: Enabling Cross-Lingual AMR Parsing with Transfer Learning TechniquesRexhina Blloshmi, Rocco Tripodi, Roberto Navigli. 2487-2500 [doi]
- Improving AMR Parsing with Sequence-to-Sequence Pre-trainingDongqin Xu, Junhui Li, Muhua Zhu, Min Zhang 0005, Guodong Zhou. 2501-2511 [doi]
- Hate-Speech and Offensive Language Detection in Roman UrduHammad Rizwan, Muhammad Haroon Shakeel, Asim Karim. 2512-2522 [doi]
- Suicidal Risk Detection for Military PersonnelSungjoon Park, Ki-Woong Park, Jaimeen Ahn, Alice Oh. 2523-2531 [doi]
- Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech DatasetsNedjma Ousidhoum, Yangqiu Song, Dit-Yan Yeung. 2532-2542 [doi]
- HENIN: Learning Heterogeneous Neural Interaction Networks for Explainable Cyberbullying Detection on Social MediaHsin-Yu Chen, Cheng-Te Li. 2543-2552 [doi]
- Reactive Supervision: A New Method for Collecting Sarcasm DataBoaz Shmueli, Lun-Wei Ku, Soumya Ray. 2553-2559 [doi]
- Self-Induced Curriculum Learning in Self-Supervised Neural Machine TranslationDana Ruiter, Josef van Genabith, Cristina España-Bonet. 2560-2571 [doi]
- Towards Reasonably-Sized Character-Level Transformer NMT by Finetuning Subword SystemsJindrich Libovický, Alexander Fraser. 2572-2579 [doi]
- Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African LanguagesMichael A. Hedderich, David Ifeoluwa Adelani, Dawei Zhu, Jesujoba O. Alabi, Udia Markus, Dietrich Klakow. 2580-2591 [doi]
- Translation Quality Estimation by Jointly Learning to Score and RankJingyi Zhang, Josef van Genabith. 2592-2598 [doi]
- Direct Segmentation Models for Streaming Speech TranslationJavier Iranzo-Sánchez, Adrià Giménez-Pastor, Joan Albert Silvestre-Cerdà, Pau Baquero-Arnal, Jorge Civera Saiz, Alfons Juan. 2599-2611 [doi]
- Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine TranslationTahmid Hasan, Abhik Bhattacharjee, Kazi Samin, Masum Hasan, Madhusudan Basak, M. Sohel Rahman, Rifat Shahriyar. 2612-2623 [doi]
- CSP: Code-Switching Pre-training for Neural Machine TranslationZhen Yang, Bojie Hu, Ambyera Han, Shen Huang, Qi Ju. 2624-2636 [doi]
- Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender BiasAna Valeria González-Garduño, Maria Barrett, Rasmus Hvingelby, Kellie Webster, Anders Søgaard. 2637-2648 [doi]
- Pre-training Multilingual Neural Machine Translation by Leveraging Alignment InformationZehui Lin, Xiao Pan, Mingxuan Wang, Xipeng Qiu, JiangTao Feng, Hao Zhou 0012, Lei Li 0005. 2649-2663 [doi]
- Losing Heads in the Lottery: Pruning Transformer Attention in Neural Machine TranslationMaximiliana Behnke, Kenneth Heafield. 2664-2674 [doi]
- Towards Enhancing Faithfulness for Neural Machine TranslationRongxiang Weng, Heng Yu, Xiangpeng Wei, Weihua Luo. 2675-2684 [doi]
- COMET: A Neural Framework for MT EvaluationRicardo Rei, Craig Stewart, Ana C. Farinha, Alon Lavie. 2685-2702 [doi]
- Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMTAlexandra Chronopoulou, Dario Stojanovski, Alexander M. Fraser. 2703-2711 [doi]
- LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent SpaceTasnim Mohiuddin, M. Saiful Bari, Shafiq Rayhan Joty. 2712-2723 [doi]
- Uncertainty-Aware Semantic Augmentation for Neural Machine TranslationXiangpeng Wei, Heng Yu, Yue Hu 0002, Rongxiang Weng, Luxi Xing, Weihua Luo. 2724-2735 [doi]
- Can Automatic Post-Editing Improve NMT?Shamil Chollampatt, Raymond Hendy Susanto, Liling Tan, Ewa Szymanska. 2736-2746 [doi]
- Parsing Gapping Constructions Based on Grammatical and Semantic RolesYoshihide Kato, Shigeki Matsubara. 2747-2752 [doi]
- Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n\^6) down to O(n\^3)Caio Corro. 2753-2764 [doi]
- Some Languages Seem Easier to Parse Because Their Treebanks LeakAnders Søgaard. 2765-2770 [doi]
- Discontinuous Constituent Parsing as Sequence LabelingDavid Vilares, Carlos Gómez-Rodríguez. 2771-2785 [doi]
- Modularized Syntactic Neural Networks for Sentence ClassificationHaiyan Wu, Ying Liu, Shaoyun Shi. 2786-2792 [doi]
- TED-CDB: A Large-Scale Chinese Discourse Relation Dataset on TED TalksWanqiu Long, Bonnie Webber, Deyi Xiong. 2793-2803 [doi]
- QADiscourse - Discourse Relations as QA Pairs: Representation, Crowdsourcing and BaselinesValentina Pyatkin, Ayal Klein, Reut Tsarfaty, Ido Dagan. 2804-2819 [doi]
- Discourse Self-Attention for Discourse Element Identification in Argumentative Student EssaysWei Song, Ziyao Song, Ruiji Fu, Lizhen Liu, MiaoMiao Cheng, Ting Liu 0001. 2820-2830 [doi]
- MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language ModelsPeng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro. 2831-2845 [doi]
- Incomplete Utterance Rewriting as Semantic SegmentationQian Liu, Bei Chen, Jian-Guang Lou, Bin Zhou, Dongmei Zhang. 2846-2857 [doi]
- Improving Grammatical Error Correction Models with Purpose-Built Adversarial ExamplesLihao Wang, Xiaoqing Zheng. 2858-2869 [doi]
- Homophonic Pun Generation with Lexically Constrained RewritingZhiwei Yu, Hongyu Zang, Xiaojun Wan 0001. 2870-2876 [doi]
- How to Make Neural Natural Language Generation as Reliable as Templates in Task-Oriented DialogueHenry Elder, Alexander O'Connor, Jennifer Foster. 2877-2888 [doi]
- Multilingual AMR-to-Text GenerationAngela Fan, Claire Gardent. 2889-2901 [doi]
- Exploring the Linear Subspace Hypothesis in Gender Bias MitigationFrancisco Vargas, Ryan Cotterell. 2902-2913 [doi]
- Lifelong Language Knowledge DistillationYung-Sung Chuang, Shang-Yu Su, Yun-Nung Chen. 2914-2924 [doi]
- Sparse Parallel Training of Hierarchical Dirichlet Process Topic ModelsAlexander Terenin, Måns Magnusson, Leif Jonsson. 2925-2934 [doi]
- Multi-label Few/Zero-shot Learning with Knowledge Aggregated from Multiple Label GraphsJueqing Lu, Lan Du, Ming Liu, Joanna Dipnall. 2935-2943 [doi]
- Word Rotator's DistanceSho Yokoi, Ryo Takahashi, Reina Akama, Jun Suzuki, Kentaro Inui. 2944-2960 [doi]
- Disentangle-based Continual Graph Representation LearningXiaoyu Kou, Yankai Lin, Shaobo Liu, Peng Li 0030, Jie Zhou 0016, Yan Zhang. 2961-2972 [doi]
- Semi-Supervised Bilingual Lexicon Induction with Two-way InteractionXu Zhao, Zihao Wang, Hao Wu, Yong Zhang. 2973-2984 [doi]
- Wasserstein Distance Regularized Sequence Representation for Text Matching in Asymmetrical DomainsWeijie Yu, Chen Xu, Jun Xu 0001, Liang Pang, Xiaopeng Gao, Xiaozhao Wang, Ji-Rong Wen. 2985-2994 [doi]
- A Simple Approach to Learning Unsupervised Multilingual EmbeddingsPratik Jawanpuria, Mayank Meghwanshi, Bamdev Mishra. 2995-3001 [doi]
- Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based GamesSubhajit Chaudhury, Daiki Kimura, Kartik Talamadupula, Michiaki Tatsubori, Asim Munawar, Ryuki Tachibana. 3002-3008 [doi]
- BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's DistanceJianquan Li, Xiaokang Liu, Honghong Zhao, Ruifeng Xu, Min Yang, Yaohong Jin. 3009-3018 [doi]
- Slot Attention with Value Normalization for Multi-Domain Dialogue State TrackingYexiang Wang, Yi Guo, Siqi Zhu. 3019-3028 [doi]
- Don't Read Too Much Into It: Adaptive Computation for Open-Domain Question AnsweringYuxiang Wu, Sebastian Riedel 0001, Pasquale Minervini, Pontus Stenetorp. 3029-3039 [doi]
- Multi-Step Inference for Reasoning Over ParagraphsJiangming Liu, Matt Gardner 0001, Shay B. Cohen, Mirella Lapata. 3040-3050 [doi]
- Learning a Cost-Effective Annotation Policy for Question AnsweringBernhard Kratzwald, Stefan Feuerriegel, Huan Sun. 3051-3062 [doi]
- Scene Restoring for Narrative Machine Reading ComprehensionZhixing Tian, Yuanzhe Zhang, Kang Liu 0001, Jun Zhao 0001, Yantao Jia, Zhicheng Sheng. 3063-3073 [doi]
- A Simple and Effective Model for Answering Multi-span QuestionsElad Segal, Avia Efrat, Mor Shoham, Amir Globerson, Jonathan Berant. 3074-3080 [doi]
- Top-Rank-Focused Adaptive Vote Collection for the Evaluation of Domain-Specific Semantic ModelsPierangelo Lombardo, Alessio Boiardi, Luca Colombo, Angelo Schiavone, Nicolò Tamagnone. 3081-3093 [doi]
- Meta Fine-Tuning Neural Language Models for Multi-Domain Text MiningChengyu Wang 0001, Minghui Qiu, Jun Huang 0007, Xiaofeng He. 3094-3104 [doi]
- Incorporating Behavioral Hypotheses for Query GenerationRuey-Cheng Chen, Chia-Jung Lee. 3105-3110 [doi]
- Conditional Causal Relationships between Emotions and Causes in TextsXinhong Chen, Qing Li, Jianping Wang. 3111-3121 [doi]
- COMETA: A Corpus for Medical Entity Linking in the Social MediaMarco Basaldella, Fangyu Liu 0001, Ehsan Shareghi, Nigel Collier. 3122-3137 [doi]
- Pareto Probing: Trading Off Accuracy for ComplexityTiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell. 3138-3153 [doi]
- Interpretation of NLP models through input marginalizationSiwon Kim, Jihun Yi, Eunji Kim, Sungroh Yoon. 3154-3167 [doi]
- Generating Label Cohesive and Well-Formed Adversarial ClaimsPepa Atanasova, Dustin Wright, Isabelle Augenstein. 3168-3177 [doi]
- Are All Good Word Vector Spaces Isomorphic?Ivan Vulic, Sebastian Ruder, Anders Søgaard. 3178-3192 [doi]
- Cold-Start and Interpretability: Turning Regular Expressions into Trainable Recurrent Neural NetworksChengyue Jiang, Yinggong Zhao, Shanbo Chu, Libin Shen, Kewei Tu. 3193-3207 [doi]
- When BERT Plays the Lottery, All Tickets Are WinningSai Prasanna, Anna Rogers, Anna Rumshisky. 3208-3229 [doi]
- On the weak link between importance and prunability of attention headsAakriti Budhraja, Madhura Pande, Preksha Nema, Pratyush Kumar, Mitesh M. Khapra. 3230-3235 [doi]
- Towards Interpreting BERT for Reading Comprehension Based QASahana Ramnath, Preksha Nema, Deep Sahni, Mitesh M. Khapra. 3236-3242 [doi]
- How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable MaskingNicola De Cao, Michael Sejr Schlichtkrull, Wilker Aziz, Ivan Titov. 3243-3255 [doi]
- A Diagnostic Study of Explainability Techniques for Text ClassificationPepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein. 3256-3274 [doi]
- STL-CQA: Structure-based Transformers with Localization and Encoding for Chart Question AnsweringHrituraj Singh, Sumit Shekhar. 3275-3284 [doi]
- Learning to Contrast the Counterfactual Samples for Robust Visual Question AnsweringZujie Liang, Weitao Jiang, Haifeng Hu 0001, Jiaying Zhu. 3285-3292 [doi]
- Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker FactorizationZhenjie Zhao, Evangelos E. Papalexakis, Xiaojuan Ma. 3293-3298 [doi]
- A Visually-grounded First-person Dialogue Dataset with Verbal and Non-verbal ResponsesHisashi Kamezawa, Noriki Nishida, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama. 3299-3310 [doi]
- Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attention and Image WordingsYue Wang 0034, Jing Li, Michael R. Lyu, Irwin King. 3311-3324 [doi]
- VD-BERT: A Unified Vision and Dialog Transformer with BERTYue Wang 0034, Shafiq R. Joty, Michael R. Lyu, Irwin King, Caiming Xiong, Steven C. H. Hoi. 3325-3338 [doi]
- The Grammar of Emergent LanguagesOskar van der Wal, Silvan de Boer, Elia Bruni, Dieuwke Hupkes. 3339-3359 [doi]
- Sub-Instruction Aware Vision-and-Language NavigationYicong Hong, Cristian Rodriguez Opazo, Qi Wu 0001, Stephen Gould. 3360-3376 [doi]
- Knowledge-Grounded Dialogue Generation with Pre-trained Language ModelsXueliang Zhao, Wei Wu 0014, Can Xu, Chongyang Tao, Dongyan Zhao 0001, Rui Yan 0001. 3377-3390 [doi]
- MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue SystemsZhaojiang Lin, Andrea Madotto, Genta Indra Winata, Pascale Fung. 3391-3405 [doi]
- Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data AugmentationKang Min Yoo, Hanbit Lee, Franck Dernoncourt, Trung Bui, Walter Chang, Sang-goo Lee. 3406-3425 [doi]
- Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue GenerationXiuyi Chen, Fandong Meng, Peng Li 0030, Feilong Chen, Shuang Xu, Bo Xu 0002, Jie Zhou 0016. 3426-3437 [doi]
- Counterfactual Off-Policy Training for Neural Dialogue GenerationQingfu Zhu, Wei-Nan Zhang 0003, Ting Liu, William Yang Wang. 3438-3448 [doi]
- Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired DataRongsheng Zhang, Yinhe Zheng, Jianzhi Shao, Xiaoxi Mao, Yadong Xi, Minlie Huang. 3449-3460 [doi]
- Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling NetworkSihan Wang, Kaijie Zhou, Kunfeng Lai, Jianping Shen. 3461-3471 [doi]
- Learning a Simple and Effective Model for Multi-turn Response Generation with Auxiliary TasksYufan Zhao, Can Xu, Wei Wu. 3472-3483 [doi]
- AttnIO: Knowledge Graph Exploration with In-and-Out Attention Flow for Knowledge-Grounded DialogueJaehun Jung, Bokyung Son, Sungwon Lyu. 3484-3497 [doi]
- Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial TrainingWanwei He, Min Yang 0007, Rui Yan, Chengming Li, Ying Shen, Ruifeng Xu. 3498-3507 [doi]
- Task-oriented Domain-specific Meta-Embedding for Text ClassificationXin Wu, Yi Cai, Kai Yang, Tao Wang 0041, Qing Li 0001. 3508-3513 [doi]
- Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense DisambiguationDaniel Loureiro, José Camacho-Collados. 3514-3520 [doi]
- Within-Between Lexical Relation ClassificationOren Barkan, Avi Caciularu, Ido Dagan. 3521-3527 [doi]
- With More Contexts Comes Better Performance: Contextualized Sense Embeddings for All-Round Word Sense DisambiguationBianca Scarlini, Tommaso Pasini, Roberto Navigli. 3528-3539 [doi]
- Convolution over Hierarchical Syntactic and Lexical Graphs for Aspect Level Sentiment AnalysisMi Zhang, Tieyun Qian. 3540-3549 [doi]
- Multi-Instance Multi-Label Learning Networks for Aspect-Category Sentiment AnalysisYuncong Li, Cunxiang Yin, Sheng-hua Zhong, Xu Pan. 3550-3560 [doi]
- Aspect Sentiment Classification with Aspect-Specific Opinion SpansLu Xu, Lidong Bing, Wei Lu 0011, Fei Huang. 3561-3567 [doi]
- Emotion-Cause Pair Extraction as Sequence Labeling Based on A Novel Tagging SchemeChaofa Yuan, Chuang Fan, Jianzhu Bao, Ruifeng Xu. 3568-3573 [doi]
- End-to-End Emotion-Cause Pair Extraction based on Sliding Window Multi-Label LearningZixiang Ding, Rui Xia, Jianfei Yu. 3574-3583 [doi]
- Multi-modal Multi-label Emotion Detection with Modality and Label DependenceDong Zhang, Xincheng Ju, Junhui Li, Shoushan Li, Qiaoming Zhu, Guodong Zhou. 3584-3593 [doi]
- Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based Sentiment AnalysisXiaoyu Xing, Zhijing Jin, Di Jin, Bingning Wang, Qi Zhang 0001, Xuanjing Huang. 3594-3605 [doi]
- Modeling Content Importance for Summarization with Pre-trained Language ModelsLiqiang Xiao, Lu Wang 0008, Hao He 0007, Yaohui Jin. 3606-3611 [doi]
- Unsupervised Reference-Free Summary Quality Evaluation via Contrastive LearningHanlu Wu, Tengfei Ma, Lingfei Wu, Tariro Manyumwa, Shouling Ji. 3612-3621 [doi]
- Neural Extractive Summarization with Hierarchical Attentive Heterogeneous Graph NetworkRuipeng Jia, Yanan Cao, Hengzhu Tang, Fang Fang, Cong Cao, Shi Wang. 3622-3631 [doi]
- Coarse-to-Fine Query Focused Multi-Document SummarizationYumo Xu, Mirella Lapata. 3632-3645 [doi]
- Pre-training for Abstractive Document Summarization by Reinstating Source TextYanyan Zou, Xingxing Zhang, Wei Lu 0011, Furu Wei, Ming Zhou 0001. 3646-3660 [doi]
- Learning from Context or Names? An Empirical Study on Neural Relation ExtractionHao Peng, Tianyu Gao, Xu Han 0007, Yankai Lin, Peng Li 0030, Zhiyuan Liu 0001, Maosong Sun, Jie Zhou 0016. 3661-3672 [doi]
- SelfORE: Self-supervised Relational Feature Learning for Open Relation ExtractionXuming Hu, Lijie Wen, Yusong Xu, Chenwei Zhang, Philip S. Yu. 3673-3682 [doi]
- Denoising Relation Extraction from Document-level Distant SupervisionChaojun Xiao, Yuan Yao, Ruobing Xie, Xu Han 0007, Zhiyuan Liu 0001, Maosong Sun, Fen Lin, Leyu Lin. 3683-3688 [doi]
- Let's Stop Incorrect Comparisons in End-to-end Relation Extraction!Bruno Taillé, Vincent Guigue, Geoffrey Scoutheeten, Patrick Gallinari. 3689-3701 [doi]
- Exposing Shallow Heuristics of Relation Extraction Models with Challenge DataShachar Rosenman, Alon Jacovi, Yoav Goldberg. 3702-3710 [doi]
- Global-to-Local Neural Networks for Document-Level Relation ExtractionDifeng Wang, Wei Hu, Ermei Cao, Weijian Sun. 3711-3721 [doi]
- Recurrent Interaction Network for Jointly Extracting Entities and Classifying RelationsKai Sun, Richong Zhang, Samuel Mensah, Yongyi Mao, Xudong Liu 0001. 3722-3732 [doi]
- Temporal Knowledge Base Completion: New Algorithms and Evaluation ProtocolsPrachi Jain 0001, Sushant Rathi, Mausam, Soumen Chakrabarti. 3733-3747 [doi]
- OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information ExtractionKeshav Kolluru, Vaibhav Adlakha, Samarth Aggarwal, Mausam, Soumen Chakrabarti. 3748-3761 [doi]
- Public Sentiment Drift Analysis Based on Hierarchical Variational Auto-encoderWenyue Zhang, Xiaoli Li 0001, Yang Li 0074, Suge Wang, Deyu Li, Jian Liao, Jianxing Zheng. 3762-3767 [doi]
- Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer ModelBugeun Kim, Kyung Seo Ki, Donggeon Lee, Gahgene Gweon. 3768-3779 [doi]
- Semantically-Aligned Universal Tree-Structured Solver for Math Word ProblemsJinghui Qin, Lihui Lin, Xiaodan Liang, Rumin Zhang, Liang Lin. 3780-3789 [doi]
- Neural Topic Modeling by Incorporating Document Relationship GraphDeyu Zhou, Xuemeng Hu, Rui Wang 0043. 3790-3796 [doi]
- Routing Enforced Generative Model for Recipe GenerationZhiwei Yu, Hongyu Zang, Xiaojun Wan 0001. 3797-3806 [doi]
- Assessing the Helpfulness of Learning Materials with Inference-Based Learner-Like AgentYun-Hsuan Jen, Chieh-Yang Huang, Mei-hua Chen, Ting-Hao K. Huang, Lun-Wei Ku. 3807-3817 [doi]
- Selection and Generation: Learning towards Multi-Product Advertisement Post GenerationZhangming Chan, Yuchi Zhang, Xiuying Chen, Shen Gao, Zhiqiang Zhang 0011, Dongyan Zhao 0001, Rui Yan 0001. 3818-3829 [doi]
- Form2Seq : A Framework for Higher-Order Form Structure ExtractionMilan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy. 3830-3840 [doi]
- Domain Adaptation of Thai Word Segmentation Models using Stacked EnsemblePeerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich, Sarana Nutanong. 3841-3847 [doi]
- DagoBERT: Generating Derivational Morphology with a Pretrained Language ModelValentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze. 3848-3861 [doi]
- Attention Is All You Need for Chinese Word SegmentationSufeng Duan, Hai Zhao. 3862-3872 [doi]
- A Joint Multiple Criteria Model in Transfer Learning for Cross-domain Chinese Word SegmentationKaiyu Huang, Degen Huang, Zhuang Liu 0001, Fengran Mo. 3873-3882 [doi]
- Alignment-free Cross-lingual Semantic Role LabelingRui Cai, Mirella Lapata. 3883-3894 [doi]
- Leveraging Declarative Knowledge in Text and First-Order Logic for Fine-Grained Propaganda DetectionRuize Wang, Duyu Tang, Nan Duan, Wanjun Zhong, Zhongyu Wei, Xuanjing Huang, Daxin Jiang, Ming Zhou 0001. 3895-3903 [doi]
- X-SRL: A Parallel Cross-Lingual Semantic Role Labeling DatasetAngel Daza, Anette Frank. 3904-3914 [doi]
- Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role LabelingDiego Marcheggiani, Ivan Titov. 3915-3928 [doi]
- Fast semantic parsing with well-typedness guaranteesMatthias Lindemann, Jonas Groschwitz, Alexander Koller. 3929-3951 [doi]
- Improving Out-of-Scope Detection in Intent Classification by Using Embeddings of the Word Graph Space of the ClassesPaulo R. Cavalin, Victor Henrique Alves Ribeiro, Ana Paula Appel, Claudio S. Pinhanez. 3952-3961 [doi]
- Supervised Seeded Iterated Learning for Interactive Language LearningYuchen Lu, Soumye Singhal, Florian Strub, Olivier Pietquin, Aaron C. Courville. 3962-3970 [doi]
- Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue SystemsJan Deriu, Don Tuggener, Pius von Däniken, Jon Ander Campos, Álvaro Rodrigo, Thiziri Belkacem, Aitor Soroa, Eneko Agirre, Mark Cieliebak. 3971-3984 [doi]
- Human-centric dialog training via offline reinforcement learningNatasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun, Craig Ferguson, Àgata Lapedriza, Noah Jones, Shixiang Gu, Rosalind W. Picard. 3985-4003 [doi]
- Speakers Fill Lexical Semantic Gaps with ContextTiago Pimentel, Rowan Hall Maudslay, Damián E. Blasi, Ryan Cotterell. 4004-4015 [doi]
- Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable ModelJun Yen Leung, Guy Emerson, Ryan Cotterell. 4016-4028 [doi]
- Surprisal Predicts Code-Switching in Chinese-English Bilingual TextJesús Calvillo, Le Fang, Jeremy R. Cole, David Reitter. 4029-4039 [doi]
- Word Frequency Does Not Predict Grammatical Knowledge in Language ModelsCharles Yu, Ryan Sie, Nico Tedeschi, Leon Bergen. 4040-4054 [doi]
- Improving Word Sense Disambiguation with TranslationsYixing Luan, Bradley Hauer, Lili Mou, Grzegorz Kondrak. 4055-4065 [doi]
- Towards Better Context-aware Lexical Semantics: Adjusting Contextualized Representations through Static AnchorsQianchu Liu, Diana McCarthy, Anna Korhonen. 4066-4075 [doi]
- Compositional Demographic Word EmbeddingsCharles Welch, Jonathan K. Kummerfeld, Verónica Pérez-Rosas, Rada Mihalcea. 4076-4089 [doi]
- Do "Undocumented Workers" == "Illegal Aliens"? Differentiating Denotation and Connotation in Vector SpacesAlbert Webson, Zhizhong Chen, Carsten Eickhoff, Ellie Pavlick. 4090-4105 [doi]
- Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue SummarizationJiaao Chen, Diyi Yang. 4106-4118 [doi]
- Few-Shot Learning for Opinion SummarizationArthur Brazinskas, Mirella Lapata, Ivan Titov. 4119-4135 [doi]
- Learning to Fuse Sentences with Transformers for SummarizationLogan Lebanoff, Franck Dernoncourt, Doo Soon Kim, Lidan Wang, Walter Chang, Fei Liu 0004. 4136-4142 [doi]
- Stepwise Extractive Summarization and Planning with Structured TransformersShashi Narayan, Joshua Maynez, Jakub Adámek, Daniele Pighin, Blaz Bratanic, Ryan T. McDonald. 4143-4159 [doi]
- CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information RetrievalShuo Sun, Kevin Duh. 4160-4170 [doi]
- SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature SearchSean MacAvaney, Arman Cohan, Nazli Goharian. 4171-4179 [doi]
- Modularized Transfomer-based Ranking FrameworkLuyu Gao, Zhuyun Dai, Jamie Callan. 4180-4190 [doi]
- Ad-hoc Document Retrieval using Weak-Supervision with BERT and GPT2Yosi Mass, Haggai Roitman. 4191-4197 [doi]
- Adversarial Semantic CollisionsCongzheng Song, Alexander M. Rush, Vitaly Shmatikov. 4198-4210 [doi]
- Learning Explainable Linguistic Expressions with Neural Inductive Logic Programming for Sentence ClassificationPrithviraj Sen, Marina Danilevsky, Yunyao Li 0001, Siddhartha Brahma, Matthias Boehm 0001, Laura Chiticariu, Rajasekar Krishnamurthy. 4211-4221 [doi]
- AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated PromptsTaylor Shin, Yasaman Razeghi, Robert L. Logan IV, Eric Wallace, Sameer Singh 0001. 4222-4235 [doi]
- Learning Variational Word Masks to Improve the Interpretability of Neural Text ClassifiersHanjie Chen, Yangfeng Ji. 4236-4251 [doi]
- Sparse Text GenerationPedro Henrique Martins, Zita Marinho, André F. T. Martins. 4252-4273 [doi]
- PlotMachines: Outline-Conditioned Generation with Dynamic Plot State TrackingHannah Rashkin, Asli Çelikyilmaz, Yejin Choi, Jianfeng Gao. 4274-4295 [doi]
- Do sequence-to-sequence VAEs learn global features of sentences?Tom Bosc, Pascal Vincent. 4296-4318 [doi]
- Content Planning for Neural Story Generation with Aristotelian RescoringSeraphina Goldfarb-Tarrant, Tuhin Chakrabarty, Ralph M. Weischedel, Nanyun Peng. 4319-4338 [doi]
- Generating Dialogue Responses from a Semantic Latent SpaceWei-Jen Ko, Avik Ray, Yilin Shen, Hongxia Jin. 4339-4349 [doi]
- Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational ContextsEce Takmaz, Mario Giulianelli, Sandro Pezzelle, Arabella Sinclair, Raquel Fernández. 4350-4368 [doi]
- Visually Grounded Compound PCFGsYanpeng Zhao, Ivan Titov. 4369-4379 [doi]
- ALICE: Active Learning with Contrastive Natural Language ExplanationsWeixin Liang, James Zou, Zhou Yu. 4380-4391 [doi]
- Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal GroundingAlexander Ku, Peter Anderson, Roma Patel, Eugene Ie, Jason Baldridge. 4392-4412 [doi]
- SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual ReasoningTsu-Jui Fu, Xin Wang 0061, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang. 4413-4422 [doi]
- Identifying Elements Essential for BERT's MultilingualityPhilipp Dufter, Hinrich Schütze. 4423-4437 [doi]
- On Negative Interference in Multilingual Models: Findings and A Meta-Learning TreatmentZirui Wang, Zachary C. Lipton, Yulia Tsvetkov. 4438-4450 [doi]
- Pre-tokenization of Multi-word Expressions in Cross-lingual Word EmbeddingsNaoki Otani, Satoru Ozaki, Xingyuan Zhao, Yucen Li, Micaelah St Johns, Lori Levin. 4451-4464 [doi]
- Monolingual Adapters for Zero-Shot Neural Machine TranslationJerin Philip, Alexandre Berard, Matthias Gallé, Laurent Besacier. 4465-4470 [doi]
- Do Explicit Alignments Robustly Improve Multilingual Encoders?Shijie Wu, Mark Dredze. 4471-4482 [doi]
- From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual TransformersAnne Lauscher, Vinit Ravishankar, Ivan Vulic, Goran Glavas. 4483-4499 [doi]
- Distilling Multiple Domains for Neural Machine TranslationAnna Currey, Prashant Mathur, Georgiana Dinu. 4500-4511 [doi]
- Making Monolingual Sentence Embeddings Multilingual using Knowledge DistillationNils Reimers, Iryna Gurevych. 4512-4525 [doi]
- A Streaming Approach For Efficient Batched Beam SearchKevin Yang, Violet Yao, John DeNero, Dan Klein. 4526-4535 [doi]
- Improving Multilingual Models with Language-Clustered VocabulariesHyung Won Chung, Dan Garrette, Kiat Chuan Tan, Jason Riesa. 4536-4546 [doi]
- Zero-Shot Cross-Lingual Transfer with Meta LearningFarhad Nooralahzadeh, Giannis Bekoulis, Johannes Bjerva, Isabelle Augenstein. 4547-4562 [doi]
- The Multilingual Amazon Reviews CorpusPhillip Keung, Yichao Lu, György Szarvas, Noah A. Smith. 4563-4568 [doi]
- GLUCOSE: GeneraLized and COntextualized Story ExplanationsNasrin Mostafazadeh, Aditya Kalyanpur, Lori Moon, David W. Buchanan, Lauren Berkowitz, Or Biran, Jennifer Chu-Carroll. 4569-4586 [doi]
- Character-level Representations Improve DRS-based Semantic Parsing Even in the Age of BERTRik van Noord, Antonio Toral, Johan Bos. 4587-4603 [doi]
- Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name RecognitionYun He, Ziwei Zhu, Yin Zhang, Qin Chen, James Caverlee. 4604-4614 [doi]
- Unsupervised Commonsense Question Answering with Self-TalkVered Shwartz, Peter West, Ronan Le Bras, Chandra Bhagavatula, Yejin Choi. 4615-4629 [doi]
- Reasoning about Goals, Steps, and Temporal Ordering with WikiHowLi Zhang, Qing Lyu, Chris Callison-Burch. 4630-4639 [doi]
- Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language ModelsEthan Wilcox, Peng Qian, Richard Futrell, Ryosuke Kohita, Roger Levy, Miguel Ballesteros. 4640-4652 [doi]
- Investigating representations of verb bias in neural language modelsRobert X. D. Hawkins, Takateru Yamakoshi, Thomas L. Griffiths, Adele E. Goldberg. 4653-4663 [doi]
- Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeEce Takmaz, Sandro Pezzelle, Lisa Beinborn, Raquel Fernández. 4664-4677 [doi]
- Optimus: Organizing Sentences via Pre-trained Modeling of a Latent SpaceChunyuan Li, Xiang Gao, Yuan Li, Baolin Peng, Xiujun Li, Yizhe Zhang, Jianfeng Gao. 4678-4699 [doi]
- BioMegatron: Larger Biomedical Domain Language ModelHoo-Chang Shin, Yang Zhang, Evelina Bakhturina, Raul Puri, Mostofa Patwary, Mohammad Shoeybi, Raghav Mani. 4700-4706 [doi]
- Text Segmentation by Cross Segment AttentionMichal Lukasik, Boris Dadachev, Kishore Papineni, Gonçalo Simões. 4707-4716 [doi]
- RussianSuperGLUE: A Russian Language Understanding Evaluation BenchmarkTatiana Shavrina, Alena Fenogenova, Anton A. Emelyanov, Denis Shevelev, Ekaterina Artemova, Valentin Malykh, Vladislav Mikhailov, Maria Tikhonova, Andrey Chertok, Andrey Evlampiev. 4717-4726 [doi]
- An Empirical Study of Pre-trained Transformers for Arabic Information ExtractionWuwei Lan, Yang Chen, Wei Xu 0004, Alan Ritter. 4727-4734 [doi]
- TNT: Text Normalization based Pre-training of Transformers for Content ModerationFei Tan, Yifan Hu, Changwei Hu, Keqian Li, Kevin Yen. 4735-4741 [doi]
- Methods for Numeracy-Preserving Word EmbeddingsDhanasekar Sundararaman, Shijing Si, Vivek Subramanian, Guoyin Wang 0002, Devamanyu Hazarika, Lawrence Carin. 4742-4753 [doi]
- An Empirical Investigation of Contextualized Number PredictionTaylor Berg-Kirkpatrick, Daniel Spokoyny. 4754-4764 [doi]
- Modeling the Music Genre Perception across Language-Bound CulturesElena V. Epure, Guillaume Salha, Manuel Moussallam, Romain Hennequin. 4765-4779 [doi]
- Joint Estimation and Analysis of Risk Behavior Ratings in Movie ScriptsVictor R. Martinez, Krishna Somandepalli, Yalda T. Uhls, Shrikanth Narayanan. 4780-4790 [doi]
- Keep it Surprisingly Simple: A Simple First Order Graph Based Parsing Model for Joint Morphosyntactic Parsing in SanskritAmrith Krishna, Ashim Gupta, Deepak Garasangi, Pavankumar Satuluri, Pawan Goyal. 4791-4797 [doi]
- Unsupervised Parsing via Constituency TestsSteven Cao, Nikita Kitaev, Dan Klein. 4798-4808 [doi]
- Please Mind the Root: Decoding Arborescences for Dependency ParsingRan Zmigrod, Tim Vieira, Ryan Cotterell. 4809-4819 [doi]
- Unsupervised Cross-Lingual Part-of-Speech Tagging for Truly Low-Resource ScenariosRamy Eskander, Smaranda Muresan, Michael Collins. 4820-4831 [doi]
- Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive AutoencodersAndrew Drozdov, Subendhu Rongali, Yi Pei Chen, Tim O'Gorman, Mohit Iyyer, Andrew McCallum. 4832-4845 [doi]
- Utility is in the Eye of the User: A Critique of NLP LeaderboardsKawin Ethayarajh, Dan Jurafsky. 4846-4853 [doi]
- An Empirical Investigation Towards Efficient Multi-Domain Language Model Pre-trainingKristjan Arumae, Qing Sun, Parminder Bhatia. 4854-4864 [doi]
- Analyzing Individual Neurons in Pre-trained Language ModelsNadir Durrani, Hassan Sajjad, Fahim Dalvi, Yonatan Belinkov. 4865-4880 [doi]
- Dissecting Span Identification Tasks with Performance PredictionSean Papay, Roman Klinger, Sebastian Padó. 4881-4895 [doi]
- Assessing Phrasal Representation and Composition in TransformersLang Yu, Allyson Ettinger. 4896-4907 [doi]
- Analyzing Redundancy in Pretrained Transformer ModelsFahim Dalvi, Hassan Sajjad, Nadir Durrani, Yonatan Belinkov. 4908-4926 [doi]
- Be More with Less: Hypergraph Attention Networks for Inductive Text ClassificationKaize Ding, Jianling Wang, Jundong Li, Dingcheng Li, Huan Liu 0001. 4927-4936 [doi]
- Entities as Experts: Sparse Memory Access with Entity SupervisionThibault Févry, Livio Baldini Soares, Nicholas FitzGerald, Eunsol Choi, Tom Kwiatkowski. 4937-4951 [doi]
- H2KGAT: Hierarchical Hyperbolic Knowledge Graph Attention NetworkShen Wang, Xiaokai Wei, Cícero Nogueira dos Santos, Zhiguo Wang, Ramesh Nallapati, Andrew Arnold, Bing Xiang, Philip S. Yu. 4952-4962 [doi]
- Does the Objective Matter? Comparing Training Objectives for Pronoun ResolutionYordan Yordanov, Oana-Maria Camburu, Vid Kocijan, Thomas Lukasiewicz. 4963-4969 [doi]
- On Losses for Modern Language ModelsStephane Aroca-Ouellette, Frank Rudzicz. 4970-4981 [doi]
- We Can Detect Your Bias: Predicting the Political Ideology of News ArticlesRamy Baly, Giovanni Da San Martino, James R. Glass, Preslav Nakov. 4982-4991 [doi]
- Semantic Label Smoothing for Sequence to Sequence ProblemsMichal Lukasik, Himanshu Jain, Aditya Krishna Menon, Seungyeon Kim, Srinadh Bhojanapalli, Felix X. Yu, Sanjiv Kumar. 4992-4998 [doi]
- Training for Gibbs Sampling on Conditional Random Fields with Neural Scoring FactorsSida Gao, Matthew R. Gormley. 4999-5011 [doi]
- Multilevel Text Alignment with Cross-Document AttentionXuhui Zhou, Nikolaos Pappas 0002, Noah A. Smith. 5012-5025 [doi]
- Conversational Semantic ParsingArmen Aghajanyan, Jean Maillard, Akshat Shrivastava, Keith Diedrick, Michael Haeger, Haoran Li, Yashar Mehdad, Veselin Stoyanov, Anuj Kumar, Mike Lewis, Sonal Gupta. 5026-5035 [doi]
- Probing Task-Oriented Dialogue Representation from Language ModelsChien-Sheng Wu, Caiming Xiong. 5036-5051 [doi]
- End-to-End Slot Alignment and Recognition for Cross-Lingual NLUWeijia Xu, Batool Haider, Saab Mansour. 5052-5063 [doi]
- Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language InferenceJianguo Zhang 0005, Kazuma Hashimoto, Wenhao Liu, Chien-Sheng Wu, Yao Wan, Philip S. Yu, Richard Socher, Caiming Xiong. 5064-5082 [doi]
- Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act TaggingSemih Yavuz, Kazuma Hashimoto, Wenhao Liu, Nitish Shirish Keskar, Richard Socher, Caiming Xiong. 5083-5089 [doi]
- Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic ParsingXilun Chen, Asish Ghoshal, Yashar Mehdad, Luke Zettlemoyer, Sonal Gupta. 5090-5100 [doi]
- Sound Natural: Content Rephrasing in Dialog SystemsArash Einolghozati, Anchit Gupta, Keith Diedrick, Sonal Gupta. 5101-5108 [doi]
- Zero-Shot Crosslingual Sentence SimplificationJonathan Mallinson, Rico Sennrich, Mirella Lapata. 5109-5126 [doi]
- Facilitating the Communication of Politeness through Fine-Grained ParaphrasingLiye Fu, Susan R. Fussell, Cristian Danescu-Niculescu-Mizil. 5127-5140 [doi]
- CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text GenerationTianlu Wang, Xuezhi Wang 0002, Yao Qin, Ben Packer, Kang Li, Jilin Chen, Alex Beutel, Ed Chi. 5141-5146 [doi]
- Seq2Edits: Sequence Transduction Using Span-level Edit OperationsFelix Stahlberg, Shankar Kumar. 5147-5159 [doi]
- Controllable Meaning Representation to Text Generation: Linearization and Data Augmentation StrategiesChris Kedzie, Kathleen R. McKeown. 5160-5185 [doi]
- Blank Language ModelsTianxiao Shen, Victor Quach, Regina Barzilay, Tommi S. Jaakkola. 5186-5198 [doi]
- COD3S: Diverse Generation with Discrete Semantic SignaturesNathaniel Weir, João Sedoc, Benjamin Van Durme. 5199-5211 [doi]
- Automatic Extraction of Rules Governing Morphological AgreementAditi Chaudhary, Antonios Anastasopoulos, Adithya Pratapa, David R. Mortensen, Zaid Sheikh, Yulia Tsvetkov, Graham Neubig. 5212-5236 [doi]
- Tackling the Low-resource Challenge for Canonical SegmentationManuel Mager, Özlem Çetinoglu, Katharina Kann. 5237-5250 [doi]
- IGT2P: From Interlinear Glossed Texts to ParadigmsSarah Moeller, Ling Liu, Changbing Yang, Katharina Kann, Mans Hulden. 5251-5262 [doi]
- A Computational Approach to Understanding Empathy Expressed in Text-Based Mental Health SupportAshish Sharma 0004, Adam S. Miner, David C. Atkins, Tim Althoff. 5263-5276 [doi]
- Modeling Protagonist Emotions for Emotion-Aware StorytellingFaeze Brahman, Snigdha Chaturvedi. 5277-5294 [doi]
- Help! Need Advice on Identifying AdviceVenkata Subrahmanyan Govindarajan, Benjamin T. Chen, Rebecca Warholic, Katrin Erk, Junyi Jessy Li. 5295-5306 [doi]
- Quantifying Intimacy in LanguageJiaxin Pei, David Jurgens. 5307-5326 [doi]
- Writing Strategies for Science Communication: Data and Computational AnalysisTal August, Lauren Kim, Katharina Reinecke, Noah A. Smith. 5327-5344 [doi]
- Weakly Supervised Subevent Knowledge AcquisitionWenlin Yao, Zeyu Dai, Maitreyi Ramaswamy, Bonan Min, Ruihong Huang. 5345-5356 [doi]
- Biomedical Event Extraction as Sequence LabelingAlan Ramponi, Rob van der Goot, Rosario Lombardo, Barbara Plank. 5357-5367 [doi]
- Annotating Temporal Dependency Graphs via CrowdsourcingJiarui Yao, Haoling Qiu, Bonan Min, Nianwen Xue. 5368-5380 [doi]
- Introducing a New Dataset for Event Detection in Cybersecurity TextsHieu Man Duc Trong, Duc-Trong Le, Amir Pouran Ben Veyseh, Thuat Nguyen, Thien Huu Nguyen. 5381-5390 [doi]
- CHARM: Inferring Personal Attributes from ConversationsAnna Tigunova, Andrew Yates, Paramita Mirza, Gerhard Weikum. 5391-5404 [doi]
- Event Detection: Gate Diversity and Syntactic Importance Scores for Graph Convolution Neural NetworksViet Dac Lai, Tuan Ngo Nguyen, Thien Huu Nguyen. 5405-5411 [doi]
- Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of EventsMiguel Ballesteros, Rishita Anubhai, Shuai Wang, Nima Pourdamghani, Yogarshi Vyas, Jie Ma, Parminder Bhatia, Kathleen R. McKeown, Yaser Al-Onaizan. 5412-5417 [doi]
- How Much Knowledge Can You Pack Into the Parameters of a Language Model?Adam Roberts, Colin Raffel, Noam Shazeer. 5418-5426 [doi]
- EXAMS: A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question AnsweringMomchil Hardalov, Todor Mihaylov, Dimitrina Zlatkova, Yoan Dinkov, Ivan Koychev, Preslav Nakov. 5427-5444 [doi]
- End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering SystemsSiamak Shakeri, Cícero Nogueira dos Santos, Henghui Zhu, Patrick Ng, Feng Nan, Zhiguo Wang, Ramesh Nallapati, Bing Xiang. 5445-5460 [doi]
- Multi-Stage Pre-training for Low-Resource Domain AdaptationRong Zhang, Revanth Gangi Reddy, Md. Arafat Sultan, Vittorio Castelli, Anthony Ferritto, Radu Florian, Efsun Sarioglu Kayi, Salim Roukos, Avirup Sil, Todd Ward. 5461-5468 [doi]
- ISAAQ - Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down AttentionJosé Manuél Gómez-Pérez, Raúl Ortega. 5469-5479 [doi]
- SubjQA: A Dataset for Subjectivity and Review ComprehensionJohannes Bjerva, Nikita Bhutani, Behzad Golshan, Wang Chiew Tan, Isabelle Augenstein. 5480-5494 [doi]
- Widget Captioning: Generating Natural Language Description for Mobile User Interface ElementsYang Li, Gang Li, Luheng He, Jingjie Zheng, Hong Li, Zhiwei Guan. 5495-5510 [doi]
- Unsupervised Natural Language Inference via Decoupled Multimodal Contrastive LearningWanyun Cui, Guangyu Zheng, Wei Wang 0009. 5511-5520 [doi]
- Digital Voicing of Silent SpeechDavid Gaddy, Dan Klein. 5521-5530 [doi]
- Imitation Attacks and Defenses for Black-box Machine Translation SystemsEric Wallace, Mitchell Stern, Dawn Song. 5531-5546 [doi]
- Sequence-Level Mixed Sample Data AugmentationDemi Guo, Yoon Kim, Alexander M. Rush. 5547-5552 [doi]
- Consistency of a Recurrent Language Model With Respect to Incomplete DecodingSean Welleck, Ilia Kulikov, Jaedeok Kim, Richard Yuanzhe Pang, KyungHyun Cho. 5553-5568 [doi]
- An Exploration of Arbitrary-Order Sequence Labeling via Energy-Based Inference NetworksLifu Tu, Tianyu Liu, Kevin Gimpel. 5569-5582 [doi]
- Ensemble Distillation for Structured Prediction: Calibrated, Accurate, Fast - Choose ThreeSteven Reich, David Mueller, Nicholas Andrews. 5583-5595 [doi]
- Inducing Target-Specific Latent Structures for Aspect Sentiment ClassificationChenhua Chen, Zhiyang Teng, Yue Zhang 0004. 5596-5607 [doi]
- Affective Event Classification with Discourse-enhanced Self-trainingYuan Zhuang, Tianyu Jiang, Ellen Riloff. 5608-5617 [doi]
- Deep Weighted MaxSAT for Aspect-based Opinion ExtractionMeixi Wu, Wenya Wang, Sinno Jialin Pan. 5618-5628 [doi]
- Multi-view Story Characterization from Movie Plot Synopses and ReviewsSudipta Kar, Gustavo Aguilar, Mirella Lapata, Thamar Solorio. 5629-5646 [doi]
- Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection EncodingSamson Tan, Shafiq R. Joty, Lav R. Varshney, Min-Yen Kan. 5647-5663 [doi]
- Measuring the Similarity of Grammatical Gender Systems by Comparing PartitionsArya D. McCarthy, Adina Williams, Shijia Liu, David Yarowsky, Ryan Cotterell. 5664-5675 [doi]
- RethinkCWS: Is Chinese Word Segmentation a Solved Task?JinLan Fu, Pengfei Liu 0003, Qi Zhang 0001, Xuanjing Huang. 5676-5686 [doi]
- Learning to Pronounce Chinese Without a Pronunciation DictionaryChristopher Chu, Scot Fang, Kevin Knight. 5687-5693 [doi]
- Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge GraphXin Lv, Xu Han 0007, Lei Hou 0001, Juanzi Li, Zhiyuan Liu 0001, Wei Zhang, Yichi Zhang, Hao Kong, Suhui Wu. 5694-5703 [doi]
- Knowledge Association with Hyperbolic Knowledge Graph EmbeddingsZequn Sun, Muhao Chen, Wei Hu 0007, Chengming Wang, Jian Dai, Wei Zhang. 5704-5716 [doi]
- Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation ExtractionRujun Han, Yichao Zhou, Nanyun Peng. 5717-5729 [doi]
- TeMP: Temporal Message Passing for Temporal Knowledge Graph CompletionJiapeng Wu, Meng Cao, Jackie Chi Kit Cheung, William L. Hamilton. 5730-5746 [doi]
- Understanding the Difficulty of Training TransformersLiyuan Liu, Xiaodong Liu, Jianfeng Gao, Weizhu Chen, Jiawei Han 0001. 5747-5763 [doi]
- An Empirical Study of Generation Order for Machine TranslationWilliam Chan, Mitchell Stern, Jamie Kiros, Jakob Uszkoreit. 5764-5773 [doi]
- Inference Strategies for Machine Translation with Conditional MaskingJulia Kreutzer, George F. Foster, Colin Cherry. 5774-5782 [doi]
- AmbigQA: Answering Ambiguous Open-domain QuestionsSewon Min, Julian Michael, Hannaneh Hajishirzi, Luke Zettlemoyer. 5783-5797 [doi]
- Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous SpaceDayiheng Liu, Yeyun Gong, Jie Fu, Yu Yan, Jiusheng Chen, Jiancheng Lv, Nan Duan, Ming Zhou 0001. 5798-5810 [doi]
- Training Question Answering Models From Synthetic DataRaul Puri, Ryan Spring, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro. 5811-5826 [doi]
- Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement LearningYuncheng Hua, Yuan-Fang Li, Gholamreza Haffari, Guilin Qi, Tongtong Wu. 5827-5837 [doi]
- Multilingual Offensive Language Identification with Cross-lingual EmbeddingsTharindu Ranasinghe, Marcos Zampieri. 5838-5844 [doi]
- Solving Historical Dictionary Codes with a Neural Language ModelChristopher Chu, Raphael Valenti, Kevin Knight. 5845-5854 [doi]
- Toward Micro-Dialect Identification in Diaglossic and Code-Switched EnvironmentsMuhammad Abdul-Mageed, Chiyu Zhang, AbdelRahim A. Elmadany, Lyle Ungar. 5855-5876 [doi]
- Investigating African-American Vernacular English in Transformer-Based Text GenerationSophie Groenwold, Lily Ou, Aesha Parekh, Samhita Honnavalli, Sharon Levy, Diba Mirza, William Yang Wang. 5877-5883 [doi]
- Iterative Domain-Repaired Back-TranslationHao-Ran Wei, Zhirui Zhang, Boxing Chen, Weihua Luo. 5884-5893 [doi]
- Dynamic Data Selection and Weighting for Iterative Back-TranslationZi-Yi Dou, Antonios Anastasopoulos, Graham Neubig. 5894-5904 [doi]
- Revisiting Modularized Multilingual NMT to Meet Industrial DemandsSungwon Lyu, Bokyung Son, Kichang Yang, Jaekyoung Bae. 5905-5918 [doi]
- LAReQA: Language-Agnostic Answer Retrieval from a Multilingual PoolUma Roy, Noah Constant, Rami Al-Rfou, Aditya Barua, Aaron Phillips, Yinfei Yang. 5919-5930 [doi]
- OCR Post Correction for Endangered Language TextsShruti Rijhwani, Antonios Anastasopoulos, Graham Neubig. 5931-5942 [doi]
- X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language ModelsZhengbao Jiang, Antonios Anastasopoulos, Jun Araki, Haibo Ding, Graham Neubig. 5943-5959 [doi]
- CCAligned: A Massive Collection of Cross-Lingual Web-Document PairsAhmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn. 5960-5969 [doi]
- Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine TranslationMehrad Moradshahi, Giovanni Campagna, Sina J. Semnani, Silei Xu, Monica S. Lam. 5970-5983 [doi]
- Interactive Refinement of Cross-Lingual Word EmbeddingsMichelle Yuan, Mozhi Zhang, Benjamin Van Durme, Leah Findlater, Jordan L. Boyd-Graber. 5984-5996 [doi]
- Exploiting Sentence Order in Document AlignmentBrian Thompson, Philipp Koehn. 5997-6007 [doi]
- XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and GenerationYaobo Liang, Nan Duan, Yeyun Gong, Ning Wu, Fenfei Guo, Weizhen Qi, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Ruofei Zhang, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Ying Qiao, Jiun-Hung Chen, Winnie Wu, Shuguang Liu, Fan Yang, Daniel Campos, Rangan Majumder, Ming Zhou 0001. 6008-6018 [doi]
- AIN: Fast and Accurate Sequence Labeling with Approximate Inference NetworkXinyu Wang 0013, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu. 6019-6026 [doi]
- HIT: Nested Named Entity Recognition via Head-Tail Pair and Token InteractionYu Wang, Yun Li, Hanghang Tong, Ziye Zhu. 6027-6036 [doi]
- Supertagging Combinatory Categorial Grammar with Attentive Graph Convolutional NetworksYuanhe Tian, Yan Song, Fei Xia. 6037-6044 [doi]
- DAGA: Data Augmentation with a Generation Approach forLow-resource Tagging TasksBosheng Ding, Linlin Liu, Lidong Bing, Canasai Kruengkrai, Thien Hai Nguyen, Shafiq R. Joty, Luo Si, Chunyan Miao. 6045-6057 [doi]
- Interpretable Multi-dataset Evaluation for Named Entity RecognitionJinLan Fu, Pengfei Liu, Graham Neubig. 6058-6069 [doi]
- Adversarial Semantic Decoupling for Recognizing Open-Vocabulary SlotsYuanmeng Yan, Keqing He, Hong Xu, Sihong Liu, Fanyu Meng, Min Hu, Weiran Xu. 6070-6075 [doi]
- Plug and Play Autoencoders for Conditional Text GenerationFlorian Mai, Nikolaos Pappas 0002, Ivan Montero, Noah A. Smith, James Henderson. 6076-6092 [doi]
- Structure Aware Negative Sampling in Knowledge GraphsKian Ahrabian, Aarash Feizi, Yasmin Salehi, William L. Hamilton, Avishek Joey Bose. 6093-6101 [doi]
- Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model AdaptationMinki Kang, Moonsu Han, Sung Ju Hwang. 6102-6120 [doi]
- Autoregressive Knowledge Distillation through Imitation LearningAlexander Lin, Jeremy Wohlwend, Howard Chen, Tao Lei 0001. 6121-6133 [doi]
- T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted AttackBoxin Wang, Hengzhi Pei, Boyuan Pan, Qian Chen, Shuohang Wang, Bo Li 0026. 6134-6150 [doi]
- Structured Pruning of Large Language ModelsZiheng Wang, Jeremy Wohlwend, Tao Lei 0001. 6151-6162 [doi]
- Effective Unsupervised Domain Adaptation with Adversarially Trained Language ModelsThuy-Trang Vu, Dinh Phung, Gholamreza Haffari. 6163-6173 [doi]
- BAE: BERT-based Adversarial Examples for Text ClassificationSiddhant Garg, Goutham Ramakrishnan. 6174-6181 [doi]
- Adversarial Self-Supervised Data-Free Distillation for Text ClassificationXinyin Ma, Yongliang Shen, Gongfan Fang, Chen Chen, Chenghao Jia, Weiming Lu 0001. 6182-6192 [doi]
- BERT-ATTACK: Adversarial Attack Against BERT Using BERTLinyang Li, Ruotian Ma, Qipeng Guo, Xiangyang Xue, Xipeng Qiu. 6193-6202 [doi]
- The Thieves on Sesame Street are Polyglots - Extracting Multilingual Models from Monolingual APIsNitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher. 6203-6207 [doi]
- When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional ModelsChanglong Yu, Jialong Han, PeiFeng Wang, Yangqiu Song, Hongming Zhang, Wilfred Ng, Shuming Shi 0001. 6208-6217 [doi]
- Interpreting Open-Domain Modifiers: Decomposition of Wikipedia Categories into Disambiguated Property-Value PairsMarius Pasca. 6218-6228 [doi]
- A Synset Relation-enhanced Framework with a Try-again Mechanism for Word Sense DisambiguationMing Wang, Yinglin Wang. 6229-6240 [doi]
- Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline GenerationDayiheng Liu, Yeyun Gong, Yu Yan, Jie Fu, Bo Shao, Daxin Jiang, Jiancheng Lv, Nan Duan. 6241-6250 [doi]
- Factual Error Correction for Abstractive Summarization ModelsMeng Cao, Yue Dong, Jiapeng Wu, Jackie Chi Kit Cheung. 6251-6258 [doi]
- Compressive Summarization with Plausibility and Salience ModelingShrey Desai, Jiacheng Xu, Greg Durrett. 6259-6274 [doi]
- Understanding Neural Abstractive Summarization Models via UncertaintyJiacheng Xu, Shrey Desai, Greg Durrett. 6275-6281 [doi]
- Better Highlighting: Creating Sub-Sentence Summary HighlightsSangwoo Cho, Kaiqiang Song, Chen Li, Dong Yu, Hassan Foroosh, Fei Liu 0004. 6282-6300 [doi]
- Summarizing Text on Any Aspects: A Knowledge-Informed Weakly-Supervised ApproachBowen Tan, Lianhui Qin, Eric P. Xing, Zhiting Hu. 6301-6309 [doi]
- BERT-enhanced Relational Sentence Ordering NetworkBaiyun Cui, Yingming Li, Zhongfei Zhang. 6310-6320 [doi]
- Online Conversation Disentanglement with Pointer NetworksTao Yu, Shafiq Joty. 6321-6330 [doi]
- VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word Representations for Improved Definition ModelingMachel Reid, Edison Marrese-Taylor, Yutaka Matsuo. 6331-6344 [doi]
- Coarse-to-Fine Pre-training for Named Entity RecognitionMengge Xue, Bowen Yu 0002, Zhenyu Zhang 0006, Tingwen Liu, Yue Zhang, Bin Wang 0004. 6345-6354 [doi]
- Exploring and Evaluating Attributes, Values, and Structures for Entity AlignmentZhiyuan Liu 0001, Yixin Cao 0002, Liangming Pan, Juanzi Li, Tat-Seng Chua. 6355-6364 [doi]
- Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor LearningYi Yang, Arzoo Katiyar. 6365-6375 [doi]
- Learning Structured Representations of Entity Names using ActiveLearning and Weak SupervisionKun Qian 0002, Poornima Chozhiyath Raman, Yunyao Li 0001, Lucian Popa 0001. 6376-6383 [doi]
- Entity Enhanced BERT Pre-training for Chinese NERChen Jia, Yuefeng Shi, Qinrong Yang, Yue Zhang. 6384-6396 [doi]
- Scalable Zero-shot Entity Linking with Dense Entity RetrievalLedell Wu, Fabio Petroni, Martin Josifoski, Sebastian Riedel 0001, Luke Zettlemoyer. 6397-6407 [doi]
- A Dataset for Tracking Entities in Open Domain Procedural TextNiket Tandon, Keisuke Sakaguchi, Bhavana Dalvi, Dheeraj Rajagopal, Peter Clark, Michal Guerquin, Kyle Richardson, Eduard H. Hovy. 6408-6417 [doi]
- Design Challenges in Low-resource Cross-lingual Entity LinkingXingyu Fu, Weijia Shi, Xiaodong Yu, Zian Zhao, Dan Roth. 6418-6432 [doi]
- Efficient One-Pass End-to-End Entity Linking for QuestionsBelinda Z. Li, Sewon Min, Srinivasan Iyer, Yashar Mehdad, Wen-tau Yih. 6433-6441 [doi]
- LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attentionIkuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda 0001, Yuji Matsumoto 0001. 6442-6454 [doi]
- Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile GenerationTuhin Chakrabarty, Smaranda Muresan, Nanyun Peng. 6455-6469 [doi]
- STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story GenerationNader Akoury, Shufan Wang, Josh Whiting, Stephen Hood, Nanyun Peng, Mohit Iyyer. 6470-6484 [doi]
- Substance over Style: Document-Level Targeted Content TransferAllison Hegel, Sudha Rao, Asli Çelikyilmaz, Bill Dolan. 6485-6504 [doi]
- Template Guided Text Generation for Task-Oriented DialogueMihir Kale, Abhinav Rastogi. 6505-6520 [doi]
- MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension MetricsAnthony Chen, Gabriel Stanovsky, Sameer Singh 0001, Matt Gardner 0001. 6521-6532 [doi]
- Plan ahead: Self-Supervised Text Planning for Paragraph Completion TaskDongyeop Kang, Eduard H. Hovy. 6533-6543 [doi]
- Inquisitive Question Generation for High Level Text ComprehensionWei-Jen Ko, Te-Yuan Chen, Yiyan Huang, Greg Durrett, Junyi Jessy Li. 6544-6555 [doi]
- Towards Persona-Based Empathetic Conversational ModelsPeixiang Zhong, Chen Zhang, Hao Wang, Yong Liu 0020, Chunyan Miao. 6556-6566 [doi]
- Personal Information Leakage Detection in ConversationsQiongkai Xu, Lizhen Qu, Zeyu Gao, Gholamreza Haffari. 6567-6580 [doi]
- Response Selection for Multi-Party Conversations with Dynamic Topic TrackingWeishi Wang, Steven C. H. Hoi, Shafiq R. Joty. 6581-6591 [doi]
- Regularizing Dialogue Generation by Imitating Implicit ScenariosShaoxiong Feng, Xuancheng Ren, Hongshen Chen, Bin Sun, Kan Li 0001, Xu Sun 0001. 6592-6604 [doi]
- MovieChats: Chat like Humans in a Closed DomainHui Su, Xiaoyu Shen 0001, Xiao Zhou, Zheng Zhang, Ernie Chang, Cheng Zhang, Cheng Niu, Jie Zhou 0016. 6605-6619 [doi]
- Conundrums in Entity Coreference Resolution: Making Sense of the State of the ArtJing Lu, Vincent Ng. 6620-6631 [doi]
- Semantic Role Labeling Guided Multi-turn Dialogue ReWriterKun Xu, Haochen Tan, Linfeng Song, Han Wu, Haisong Zhang, Linqi Song, Dong Yu 0001. 6632-6639 [doi]
- Continuity of Topic, Interaction, and Query: Learning to Quote in Online ConversationsLingzhi Wang, Jing Li 0049, Xingshan Zeng, Haisong Zhang, Kam-Fai Wong. 6640-6650 [doi]
- Profile Consistency Identification for Open-domain Dialogue AgentsHaoyu Song 0002, Yan Wang 0060, Wei-Nan Zhang 0003, Zhengyu Zhao 0003, Ting Liu 0001, Xiaojiang Liu. 6651-6662 [doi]
- An Element-aware Multi-representation Model for Law Article PredictionHuilin Zhong, Junsheng Zhou, Weiguang Qu, Yunfei Long, Yanhui Gu. 6663-6668 [doi]
- Recurrent Event Network: Autoregressive Structure Inferenceover Temporal Knowledge GraphsWoojeong Jin, Meng Qu, Xisen Jin, Xiang Ren. 6669-6683 [doi]
- Multi-resolution Annotations for Emoji PredictionWeicheng Ma, Ruibo Liu, Lili Wang, Soroush Vosoughi. 6684-6694 [doi]
- Less is More: Attention Supervision with Counterfactuals for Text ClassificationSeungtaek Choi, Haeju Park, Jinyoung Yeo, Seung-won Hwang. 6695-6704 [doi]
- MODE-LSTM: A Parameter-efficient Recurrent Network with Multi-Scale for Sentence ClassificationQianli Ma, Zhenxi Lin, Jiangyue Yan, Zipeng Chen, Liuhong Yu. 6705-6715 [doi]
- HSCNN: A Hybrid-Siamese Convolutional Neural Network for Extremely Imbalanced Multi-label Text ClassificationWenshuo Yang, Jiyi Li, Fumiyo Fukumoto, Yanming Ye. 6716-6722 [doi]
- Multi-Stage Pre-training for Automated Chinese Essay ScoringWei Song, Kai Zhang, Ruiji Fu, Lizhen Liu, Ting Liu 0001, MiaoMiao Cheng. 6723-6733 [doi]
- Multi-hop Inference for Question-driven SummarizationYang Deng, Wenxuan Zhang, Wai Lam. 6734-6744 [doi]
- Towards Interpretable Reasoning over Paragraph Effects in SituationMucheng Ren, Xiubo Geng, Tao Qin, Heyan Huang, Daxin Jiang. 6745-6758 [doi]
- Question Directed Graph Attention Network for Numerical Reasoning over TextKunlong Chen, Weidi Xu, Xingyi Cheng, Zou Xiaochuan, Yuyu Zhang, Le Song, Taifeng Wang, Yuan Qi, Wei Chu. 6759-6768 [doi]
- Dense Passage Retrieval for Open-Domain Question AnsweringVladimir Karpukhin, Barlas Oguz, Sewon Min, Patrick S. H. Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih. 6769-6781 [doi]
- Distilling Structured Knowledge for Text-Based Relational ReasoningJin Dong, Marc-Antoine Rondeau, William L. Hamilton. 6782-6791 [doi]
- Asking without Telling: Exploring Latent Ontologies in Contextual RepresentationsJulian Michael, Jan A. Botha, Ian Tenney. 6792-6812 [doi]
- Pretrained Language Model Embryology: The Birth of ALBERTDavid Cheng-Han Chiang, Sung-Feng Huang, Hung-yi Lee. 6813-6828 [doi]
- Learning Music Helps You Read: Using Transfer to Study Linguistic Structure in Language ModelsIsabel Papadimitriou, Dan