Abstract is missing.
- Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuningHonghe Zhang, Xiaolong Shi, Jingwei Sun, Guangzhong Sun. 1-12 [doi]
- Weight-Inherited Distillation for Task-Agnostic BERT CompressionTaiqiang Wu, Cheng Hou, Shanshan Lao, Jiayi Li, Ngai Wong, Zhe Zhao 0006, Yujiu Yang. 13-28 [doi]
- Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity DomainEugene Jang, Jian Cui, Dayeon Yim, Youngjin Jin, Jin-Woo Chung, Seungwon Shin, Yongjae Lee. 29-42 [doi]
- Extremely efficient online query encoding for dense retrievalNachshon Cohen, Yaron Fairstein, Guy Kushilevitz. 43-50 [doi]
- DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and TextWenting Zhao 0006, Ye Liu 0006, Tong Niu, Yao Wan 0001, Philip S. Yu, Shafiq Joty, Yingbo Zhou, Semih Yavuz. 51-68 [doi]
- SpeedE: Euclidean Geometric Knowledge Graph Embedding Strikes BackAleksandar Pavlovic 0002, Emanuel Sallinger. 69-92 [doi]
- Language Guided Exploration for RL Agents in Text EnvironmentsHitesh Golchha, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan. 93-102 [doi]
- GPT-who: An Information Density-based Machine-Generated Text DetectorSaranya Venkatraman, Adaku Uchendu, Dongwon Lee 0001. 103-115 [doi]
- DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer ModelsPeng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha. 116-131 [doi]
- Attention Alignment and Flexible Positional Embeddings Improve Transformer Length ExtrapolationTa-Chung Chi, Ting-Han Fan, Alexander Rudnicky. 132-148 [doi]
- Automatic Pair Construction for Contrastive Post-trainingCanwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian J. McAuley, Jennifer Neville, Ahmed Awadallah 0001, Nikhil Rao 0001. 149-162 [doi]
- Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language ModelsMiaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao 0001, Zhu Zhang. 163-181 [doi]
- Low-resource neural machine translation with morphological modelingAntoine Nzeyimana. 182-195 [doi]
- Self-Cleaning: Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean InstancesZhendong Chu, Ruiyi Zhang, Tong Yu 0001, Rajiv Jain, Vlad I. Morariu, Jiuxiang Gu, Ani Nenkova. 196-210 [doi]
- VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language UnderstandingPhong Do, Son Tran, Phu Hoang, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen. 211-222 [doi]
- LETI: Learning to Generate from Textual InteractionsXingyao Wang 0002, Hao Peng 0009, Reyhaneh Jabbarvand, Heng Ji. 223-239 [doi]
- Bilateral Masking with prompt for Knowledge Graph CompletionYonghui Kong, Cunhang Fan, Yujie Chen, Shuai Zhang 0014, Zhao Lv, Jianhua Tao 0001. 240-249 [doi]
- MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language ModelsZhenpeng Su, Zijia Lin, Bai Xue, Hui Chen, Guiguang Ding, Wei Zhou, Songlin Hu. 250-262 [doi]
- GOLD: Geometry Problem Solver with Natural Language DescriptionJiaxin Zhang, Yashar Moshfeghi. 263-278 [doi]
- RoDia: A New Dataset for Romanian Dialect Identification from SpeechRotaru Codrut, Nicolae-Catalin Ristea, Radu-Tudor Ionescu. 279-286 [doi]
- Examining Modularity in Multilingual LMs via Language-Specialized SubnetworksRochelle Choenni, Ekaterina Shutova, Dan Garrette. 287-301 [doi]
- Reverse Chain: A Generic-Rule for LLMs to Master Multi-API PlanningYinger Zhang, Hui Cai, Xierui Song, Yicheng Chen, Rui Sun, Jing Zheng. 302-325 [doi]
- Incorporating Exponential Smoothing into MLP: a Simple but Effective Sequence ModelJiqun Chu, Zuoquan Lin. 326-337 [doi]
- OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation ModelsYuxuan Kuang, Hai Lin, Meng Jiang. 338-351 [doi]
- Comparing Two Model Designs for Clinical Note Generation; Is an LLM a Useful Evaluator of Consistency?Nathan Brake, Thomas Schaaf. 352-363 [doi]
- VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing AutoencoderYueen Ma, Dafeng Chi, Jingjing Li 0007, Kai Song, Yuzheng Zhuang, Irwin King. 364-378 [doi]
- EcoSpeak: Cost-Efficient Bias Mitigation for Partially Cross-Lingual Speaker VerificationDivya Sharma. 379-394 [doi]
- Leveraging Contextual Information for Effective Entity Salience DetectionRajarshi Bhowmik, Marco Ponza, Atharva Tendle, Anant Gupta, Rebecca Jiang, Xingyu Lu, Qian Zhao, Daniel Preotiuc-Pietro. 395-408 [doi]
- LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?Qihui Zhang, Chujie Gao, Dongping Chen, Yue Huang, Yixin Huang, Zhenyang Sun, Shilin Zhang, Weiye Li, Zhengyan Fu, Yao Wan 0001, Lichao Sun 0001. 409-436 [doi]
- A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content DetectionIvo Verhoeven, Pushkar Mishra, Rahel Beloch, Helen Yannakoudakis, Ekaterina Shutova. 437-463 [doi]
- Citation: A Key to Building Responsible and Accountable Large Language ModelsJie Huang 0009, Kevin Chang 0001. 464-473 [doi]
- Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncodersYingji Zhang, Marco Valentino, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas. 474-489 [doi]
- Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching StylesWeiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen. 490-502 [doi]
- Which Modality should I use - Text, Motif, or Image? : Understanding Graphs with Large Language ModelsDebarati Das 0004, Ishaan Gupta, Jaideep Srivastava, Dongyeop Kang. 503-519 [doi]
- On-the-Fly Fusion of Large Language Models and Machine TranslationHieu Hoang, Huda Khayrallah, Marcin Junczys-Dowmunt. 520-532 [doi]
- READ: Improving Relation Extraction from an ADversarial PerspectiveDawei Li, William Hogan, Jingbo Shang. 533-548 [doi]
- REQUAL-LM: Reliability and Equity through Aggregation in Large Language ModelsSana Ebrahimi, Nima Shahbazi, Abolfazl Asudeh. 549-560 [doi]
- Addressing Both Statistical and Causal Gender Fairness in NLP ModelsHannah Chen, Yangfeng Ji, David Evans 0001. 561-582 [doi]
- LLM-Rec: Personalized Recommendation via Prompting Large Language ModelsHanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Christopher Leung, Jiajie Tang, Jiebo Luo. 583-612 [doi]
- A Robust Semantics-based Watermark for Large Language Model against ParaphrasingJie Ren 0019, Han Xu 0002, Yiding Liu, Yingqian Cui, Shuaiqiang Wang, Dawei Yin, Jiliang Tang. 613-625 [doi]
- Solving Data-centric Tasks using Large Language ModelsShraddha Barke, Christian Pölitz, Carina Negreanu, Benjamin Zorn 0001, José Cambronero, Andrew D. Gordon 0001, Vu Le 0002, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams 0001. 626-638 [doi]
- A Novel Paradigm Boosting Translation Capabilities of Large Language ModelsJiaxin Guo, Hao Yang 0006, Zongyao Li, Daimeng Wei, Hengchao Shang, Xiaoyu Chen 0004. 639-649 [doi]
- Measuring Social Norms of Large Language ModelsYe Yuan, Kexin Tang, Jianhao Shen, Ming Zhang 0004, Chenguang Wang. 650-699 [doi]
- Source-Free Unsupervised Domain Adaptation for Question Answering via Prompt-Assisted Self-learningMaxwell Yin, Boyu Wang 0004, Charles Ling 0001. 700-713 [doi]
- Hierarchical Attention Graph for Scientific Document Summarization in Global and Local LevelChenlong Zhao, Xiwen Zhou, Xiaopeng Xie, Yong Zhang. 714-726 [doi]
- LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systemsNalin Kumar, Ondrej Dusek. 727-735 [doi]
- Efficient Dependency Tree Sampling Without ReplacementBogdan Dobre. 736-741 [doi]
- Towards Better Generalization in Open-Domain Question Answering by Mitigating Context MemorizationZixuan Zhang, Revanth Gangi Reddy, Kevin Small, Tong Zhang 0001, Heng Ji. 742-753 [doi]
- GEE! Grammar Error Explanation with Large Language ModelsYixiao Song, Kalpesh Krishna, Rajesh Bhatt, Kevin Gimpel, Mohit Iyyer. 754-781 [doi]
- AdaRefiner: Refining Decisions of Language Models with Adaptive FeedbackWanpeng Zhang 0002, Zongqing Lu. 782-799 [doi]
- DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue RepresentationsWeihao Zeng, Dayuan Fu, Keqing He 0001, Yejie Wang, Yukai Xu, Weiran Xu. 800-813 [doi]
- Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional TrainingPavel Denisov, Thang Vu. 814-834 [doi]
- CLEAN-EVAL: Clean Evaluation on Contaminated Large Language ModelsWenhong Zhu, Hongkun Hao, Zhiwei He 0002, Yunze Song, Jiao Yueyang, Yumeng Zhang, Hanxu Hu, Yiran Wei, Rui Wang 0015, Hongyuan Lu. 835-847 [doi]
- R-BASS : Relevance-aided Block-wise Adaptation for Speech SummarizationRoshan Sharma 0001, Ruchira Sharma, Hira Dhamyal, Rita Singh, Bhiksha Raj. 848-857 [doi]
- OVM, Outcome-supervised Value Models for Planning in Mathematical ReasoningFei Yu, Anningzhe Gao, Benyou Wang. 858-875 [doi]
- The Whole is Better than the Sum: Using Aggregated Demonstrations in In-Context Learning for Sequential RecommendationLei Wang, Ee-Peng Lim. 876-895 [doi]
- Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQADhruv Agarwal 0003, Rajarshi Das, Sopan Khosla, Rashmi Gangadharaiah. 896-919 [doi]
- GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention MechanismShuzhou Yuan, Michael Färber 0001. 920-933 [doi]
- Can Public Large Language Models Help Private Cross-device Federated Learning?Boxin Wang, Yibo Zhang, Yuan Cao, Bo Li 0026, Hugh McMahan, Sewoong Oh, Zheng Xu 0002, Manzil Zaheer. 934-949 [doi]
- LangNav: Language as a Perceptual Representation for NavigationBowen Pan, Rameswar Panda, SouYoung Jin, Rogério Feris, Aude Oliva, Phillip Isola, Yoon Kim. 950-974 [doi]
- Planning and Editing What You Retrieve for Enhanced Tool LearningTenghao Huang, Dongwon Jung, Vaibhav Kumar, Mohammad Kachuee, Xiang Li, Puyang Xu, Muhao Chen. 975-988 [doi]
- Chart-based Reasoning: Transferring Capabilities from LLMs to VLMsVictor Carbune, Hassan Mansoor, Fangyu Liu, Rahul Aralikatte, Gilles Baechler, Jindong Chen, Abhanshu Sharma. 989-1004 [doi]
- SLiM: Speculative Decoding with Hypothesis ReductionChi-Heng Lin, Shikhar Tuli, James Seale Smith, Yen-Chang Hsu, Yilin Shen, Hongxia Jin. 1005-1017 [doi]
- REMATCH: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic SimilarityZoher Kachwala, Jisun An, Haewoon Kwak, Filippo Menczer. 1018-1028 [doi]
- Modeling the Sacred: Considerations when Using Religious Texts in Natural Language ProcessingBen Hutchinson. 1029-1043 [doi]
- Testing the Effect of Code Documentation on Large Language Model Code UnderstandingWilliam Macke, Michael Doyle. 1044-1050 [doi]
- Aligning Large Language Models with Recommendation KnowledgeYuwei Cao, Nikhil Mehta 0002, Xinyang Yi, Raghunandan Hulikal Keshavan, Lukasz Heldt, Lichan Hong, Ed H. Chi, Maheswaran Sathiamoorthy. 1051-1066 [doi]
- OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued PretrainingYihong Liu, Peiqin Lin, Mingyang Wang, Hinrich Schütze. 1067-1097 [doi]
- SELF-EXPERTISE: Knowledge-based Instruction Dataset Augmentation for a Legal Expert Language ModelMinju Kim, Haein Jung, Myoung-Wan Koo. 1098-1112 [doi]
- Re-evaluating the Need for Visual Signals in Unsupervised Grammar InductionBoyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge J. Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein. 1113-1123 [doi]
- EDEntail: An Entailment-based Few-shot Text Classification with Extensional DefinitionZixiao Zhu, Junlang Qian, Zijian Feng, Hanzhang Zhou, Kezhi Mao. 1124-1137 [doi]
- What Makes Math Word Problems Challenging for LLMs?Kv Aditya Srivatsa, Ekaterina Kochmar. 1138-1148 [doi]
- SMILE: Multimodal Dataset for Understanding Laughter in Video with Language ModelsLee Hyun, Kim Sung-Bin, Seungju Han, Youngjae Yu, Tae Hyun Oh. 1149-1167 [doi]
- T3M: Text Guided 3D Human Motion Synthesis from SpeechWenshuo Peng, Kaipeng Zhang, Sai Qian Zhang. 1168-1177 [doi]
- Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph ReasoningMiao Peng, Ben Liu, Wenjie Xu, Zihao Jiang 0009, Jiahui Zhu, Min Peng. 1178-1191 [doi]
- Explanation Extraction from Hierarchical Classification Frameworks for Long Legal DocumentsNishchal Prasad, Taoufiq Dkaki, Mohand Boughanem. 1192-1201 [doi]
- Low-Rank Adaptation for Multilingual Summarization: An Empirical StudyChenxi Whitehouse, Fantine Huot, Jasmijn Bastings, Mostafa Dehghani 0001, Chu-Cheng Lin, Mirella Lapata. 1202-1228 [doi]
- A Tree-of-Thoughts to Broaden Multi-step Reasoning across LanguagesLeonardo Ranaldi, Giulia Pucci, Federico Ranaldi, Elena Sofia Ruzzetti, Fabio Massimo Zanzotto. 1229-1241 [doi]
- Emergent Abilities in Reduced-Scale Generative Language ModelsSherin Muckatira, Vijeta Deshpande, Vladislav Lialin, Anna Rumshisky. 1242-1257 [doi]
- Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue SystemsClemencia Siro, Mohammad Aliannejadi, Maarten de Rijke. 1258-1273 [doi]
- Matching Varying-Length Texts via Topic-Informed and Decoupled Sentence EmbeddingsXixi Zhou, Chunbin Gu, Xin Jie, Jiajun Bu, Haishuai Wang. 1274-1280 [doi]
- Instruction Tuning with Human CurriculumBruce W. Lee, Hyunsoo Cho, Kang Min Yoo. 1281-1309 [doi]
- Natural Language-based State Representation in Deep Reinforcement LearningMd Masudur Rahman, Yexiang Xue. 1310-1319 [doi]
- Learning Cross-Architecture Instruction Embeddings for Binary Code Analysis in Low-Resource ArchitecturesJunzhe Wang, Qiang Zeng, Lannan Luo. 1320-1332 [doi]
- ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial AttacksXiaodong Yu 0003, Hao Cheng 0002, Xiaodong Liu, Dan Roth, Jianfeng Gao 0001. 1333-1351 [doi]
- An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced DistributionTien-Hong Lo, Fu-An Chao, Tzu-I Wu, Yao-Ting Sung, Berlin Chen. 1352-1362 [doi]
- GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and BeyondShen Zheng 0001, Yuyu Zhang, Yijie Zhu, Chenguang Xi, Pengyang Gao, Zhou Xun, Kevin Chang 0001. 1363-1382 [doi]
- Subword Attention and Post-Processing for Rare and Unknown Contextualized EmbeddingsRaj Patel, Carlotta Domeniconi. 1383-1389 [doi]
- UGIF-DataSet: A New Dataset for Cross-lingual, Cross-modal Sequential actions on the UISagar Gubbi Venkatesh, Partha Talukdar, Srini Narayanan. 1390-1399 [doi]
- SimSCOOD: Systematic Analysis of Out-of-Distribution Generalization in Fine-tuned Source Code ModelsHossein Hajipour, Ning Yu, Cristian-Alexandru Staicu, Mario Fritz. 1400-1416 [doi]
- Pruning as a Domain-specific LLM ExtractorNan Zhang, Yanchi Liu, Xujiang Zhao, Wei Cheng 0002, Runxue Bao, Rui Zhang, Prasenjit Mitra, Haifeng Chen. 1417-1428 [doi]
- LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable FeedbackWenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li 0005, Markus Freitag. 1429-1445 [doi]
- Noisy Multi-Label Text Classification via Instance-Label Pair CorrectionPengyu Xu, MingYang Song, Linkaida Liu, Bing Liu, Hongjian Sun, Liping Jing, Jian Yu. 1446-1458 [doi]
- Composite Backdoor Attacks Against Large Language ModelsHai Huang, Zhengyu Zhao 0001, Michael Backes 0001, Yun Shen, Yang Zhang 0016. 1459-1472 [doi]
- Adapting Fake News Detection to the Era of Large Language ModelsJinyan Su, Claire Cardie, Preslav Nakov. 1473-1490 [doi]
- MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrievalYoubo Lei, Feifei He, Chen Chen 0015, Yingbin Mo, Sijia Li, Defeng Xie, Haonan Lu. 1491-1503 [doi]
- Large Language Models are Effective Text Rankers with Pairwise Ranking PromptingZhen Qin 0001, Rolf Jagerman, Kai Hui 0001, Honglei Zhuang, Junru Wu, Le Yan, Jiaming Shen, Tianqi Liu 0002, Jialu Liu, Donald Metzler, Xuanhui Wang, Michael Bendersky. 1504-1518 [doi]
- FedLFC: Towards Efficient Federated Multilingual Modeling with LoRA-based Language Family ClusteringZhihan Guo, Yifei Zhang, Zhuo Zhang 0007, Zenglin Xu, Irwin King. 1519-1528 [doi]
- Gaussian Process Optimization for Adaptable Multi-Objective Text Generation using Linearly-Weighted Language ModelsMohammad Mahdi Abdollah Pour, Ali Pesaranghader, Eldan Cohen, Scott Sanner. 1529-1536 [doi]
- Groundedness in Retrieval-augmented Long-form Generation: An Empirical StudyAlessandro Stolfo. 1537-1552 [doi]
- TagDebias: Entity and Concept Tagging for Social Bias Mitigation in Pretrained Language ModelsMehrnaz Moslemi, Amal Zouaq. 1553-1567 [doi]
- Improving Absent Keyphrase Generation with Diversity HeadsEdwin Thomas, Sowmya Vajjala. 1568-1584 [doi]
- mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?Tianze Hua, Tian Yun, Ellie Pavlick. 1585-1598 [doi]
- Discovering and Mitigating Indirect Bias in Attention-Based Model ExplanationsFarsheed Haque, Depeng Xu, Shuhan Yuan. 1599-1614 [doi]
- i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech DataZiyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu 0001, Dongdong Chen 0001, Yao Qian, Xuemei Gao, Yi-ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao 0004, Yu Shi 0001, Lu Yuan, Takuya Yoshioka, Michael Zeng 0001, Xuedong Huang 0001. 1615-1627 [doi]
- Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text GenerationYifu Qiu, Varun Embar, Shay B. Cohen, Benjamin Han. 1628-1644 [doi]
- It's All Relative! - A Synthetic Query Generation Approach for Improving Zero-Shot Relevance PredictionAditi Chaudhary, Karthik Raman 0001, Michael Bendersky. 1645-1664 [doi]
- RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language ModelsSaeed Khaki, Jinjin Li, Lan Ma, Liu Yang, Prathap Ramachandra. 1665-1680 [doi]
- Hypernetwork-Assisted Parameter-Efficient Fine-Tuning with Meta-Knowledge Distillation for Domain Knowledge DisentanglementChangqun Li, Linlin Wang, Xin Lin, Shizhou Huang, Liang He. 1681-1695 [doi]
- MICo: Preventative Detoxification of Large Language Models through Inhibition ControlRoy Siegelmann, Ninareh Mehrabi, Palash Goyal, Prasoon Goyal, Lisa Bauer, Jwala Dhamala, Aram Galstyan, Rahul Gupta 0001, Reza Ghanadan. 1696-1703 [doi]
- Reinforcement Learning with Token-level Feedback for Controllable Text GenerationWendi Li, Wei Wei 0002, Kaihe Xu, Wenfeng Xie, Dangyang Chen, Yu Cheng 0001. 1704-1719 [doi]
- CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem SolvingPei Chen, Shuai Zhang, Boran Han. 1720-1738 [doi]
- Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language TechnologiesAnaelia Ovalle, Ninareh Mehrabi, Palash Goyal, Jwala Dhamala, Kai-Wei Chang, Richard S. Zemel, Aram Galstyan, Yuval Pinter, Rahul Gupta 0001. 1739-1756 [doi]
- AdaPT: A Set of Guidelines for Hyperbolic Multimodal Multilingual NLPRamit Sawhney, Shrey Pandit, Vishwa Shah, Megh Thakkar, Shafiq Joty. 1757-1771 [doi]
- More Samples or More Prompts? Exploring Effective Few-Shot In-Context Learning for LLMs with In-Context SamplingBingsheng Yao, Guiming Chen, Ruishi Zou, Yuxuan Lu 0003, Jiachen Li, Shao Zhang, Yisi Sang, Sijia Liu 0001, James A. Hendler, Dakuo Wang. 1772-1790 [doi]
- ZSEE: A Dataset based on Zeolite Synthesis Event Extraction for Automated Synthesis PlatformSong He, Xin Peng, Yihan Cai, Xin Li, Zhiqing Yuan, Wenli Du, Weimin Yang. 1791-1808 [doi]
- Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual InformationKyubyung Chae, Jaepill Choi, Yohan Jo, Taesup Kim. 1809-1820 [doi]
- Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue AgentsSan Kim, Gary Lee. 1821-1835 [doi]
- Prompt Space Optimizing Few-shot Reasoning Success with Large Language ModelsFobo Shi, Peijun Qing, Dong Yang, Nan Wang, Youbo Lei, Haonan Lu, Xiaodong Lin 0004, Duantengchuan Li. 1836-1862 [doi]
- DAGCN: Distance-based and Aspect-oriented Graph Convolutional Network for Aspect-based Sentiment AnalysisZhiHao Wang, Bo Zhang 0004, Ru Yang, Chang Guo, Maozhen Li 0001. 1863-1876 [doi]
- Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase GraphsZhuoyi Peng, Yi Yang. 1877-1890 [doi]
- Self-Regulated Sample Diversity in Large Language ModelsMingyue Liu, Jonathan Frawley, Sarah Wyer, Hubert P. H. Shum, Sara L. Uckelman, Sue Black 0001, Chris G. Willcocks. 1891-1899 [doi]
- Methods, Applications, and Directions of Learning-to-Rank in NLP ResearchJustin Lee, Gabriel Bernier-Colborne, Tegan Maharaj, Sowmya Vajjala. 1900-1917 [doi]
- When Quantization Affects Confidence of Large Language Models?Irina Proskurina, Luc Brun, Guillaume Metzler, Julien Velcin. 1918-1928 [doi]
- MedCycle: Unpaired Medical Report Generation via Cycle-ConsistencyElad Hirsch, Gefen Dawidowicz, Ayellet Tal. 1929-1944 [doi]
- Beta-LR: Interpretable Logical Reasoning based on Beta DistributionYizhuo Ma, Ke Qin, Shuang Liang. 1945-1955 [doi]
- Applications of BERT Models Towards Automation of Clinical Coding in IcelandicHaraldur Orri Hauksson, Hafsteinn Einarsson. 1956-1967 [doi]
- "Tell me who you are and I tell you how you argue": Predicting Stances and Arguments for Stakeholder GroupsPhilipp Heinisch, Lorik Dumani, Philipp Cimiano, Ralf Schenkel. 1968-1982 [doi]
- Psychometric Predictive Power of Large Language ModelsTatsuki Kuribayashi, Yohei Oseki, Timothy Baldwin. 1983-2005 [doi]
- Large Language Models Sensitivity to The Order of Options in Multiple-Choice QuestionsPouya Pezeshkpour, Estevam Hruschka. 2006-2017 [doi]
- PEEB: Part-based Image Classifiers with an Explainable and Editable Language BottleneckThang Pham, Peijie Chen, Tin Nguyen 0006, Seunghyun Yoon, Trung Bui, Anh Nguyen 0002. 2018-2053 [doi]
- Ethos: Rectifying Language Models in Orthogonal Parameter SpaceLei Gao, Yue Niu, Tingting Tang, Salman Avestimehr, Murali Annavaram. 2054-2068 [doi]
- Crafting In-context Examples according to LMs' Parametric KnowledgeYoonsang Lee 0004, Pranav Atreya, Xi Ye, Eunsol Choi. 2069-2085 [doi]
- ICXML: An In-Context Learning Framework for Zero-Shot Extreme Multi-Label ClassificationYaxin Zhu, Hamed Zamani. 2086-2098 [doi]
- CLGSI: A Multimodal Sentiment Analysis Framework based on Contrastive Learning Guided by Sentiment IntensityYang Yang, Xunde Dong, Yupeng Qiang. 2099-2110 [doi]
- Interpreting Answers to Yes-No Questions in Dialogues from Multiple DomainsZijie Wang, Farzana Rashid, Eduardo Blanco 0002. 2111-2128 [doi]
- Enhancing Perception: Refining Explanations of News Claims with LLM ConversationsYi-Li Hsu, Jui-Ning Chen, Yang Fan Chiang, Shang-Chien Liu, Aiping Xiong, Lun-Wei Ku. 2129-2147 [doi]
- How Interpretable are Reasoning Explanations from Prompting Large Language Models?Wei Jie Yeo, Ranjan Satapathy, Rich Siow Mong Goh, Erik Cambria. 2148-2164 [doi]
- Plug-in Language Model: Controlling Text Generation with a Simple Regression ModelNai-Chi Yang, Wei-Yun Ma, Pu-Jen Cheng. 2165-2181 [doi]
- Signer Diversity-driven Data Augmentation for Signer-Independent Sign Language TranslationHonghaofu Honghaofu, Liang Zhang, Biao Fu, Rui Zhao, Jinsong Su, Xiaodong Shi, Yidong Chen 0001. 2182-2193 [doi]
- A Systematic Analysis of Subwords and Cross-Lingual Transfer in Multilingual TranslationFrancois Meyer, Jan Buys. 2194-2200 [doi]
- Multi-Granularity Guided Fusion-in-DecoderEunseong Choi, Hyeri Lee, Jongwuk Lee. 2201-2212 [doi]
- Group Fairness in Multilingual Speech Recognition ModelsAnna Zee, Marc Zee, Anders Søgaard. 2213-2226 [doi]
- Rethinking Machine Ethics - Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?Jingyan Zhou, Minda Hu, Junan Li, Xiaoying Zhang, Xixin Wu, Irwin King, Helen Meng. 2227-2242 [doi]
- Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language ModelsRui Wang 0092, Fei Mi, Yi Chen, Boyang Xue, Hongru Wang 0003, Qi Zhu, Kam-Fai Wong, Ruifeng Xu. 2243-2255 [doi]
- BERTweet's TACO Fiesta: Contrasting Flavors On The Path Of Inference And Information-Driven Argument Mining On TwitterMarc Feger, Stefan Dietze. 2256-2266 [doi]
- Testing the limits of logical reasoning in neural and hybrid modelsManuel Vargas Guzmán, Jakub Szymanik, Maciej Malicki. 2267-2279 [doi]
- METAL: Towards Multilingual Meta-EvaluationRishav Hada, Varun Gumma, Mohamed Ahmed, Kalika Bali, Sunayana Sitaram. 2280-2298 [doi]
- AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsWanjun Zhong, Ruixiang Cui, Yiduo Guo, Yaobo Liang, Shuai Lu, Yanlin Wang, Amin Saied, Weizhu Chen, Nan Duan. 2299-2314 [doi]
- Product Description and QA Assisted Self-Supervised Opinion SummarizationTejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya. 2315-2332 [doi]
- COMEM: In-Context Retrieval-Augmented Mass-Editing Memory in Large Language ModelsShanbao Qiao, Xuebing Liu, Seung-Hoon Na. 2333-2347 [doi]
- Content-Specific Humorous Image Captioning Using Incongruity Resolution Chain-of-ThoughtKohtaro Tanaka, Kohei Uehara, Lin Gu, Yusuke Mukuta, Tatsuya Harada. 2348-2367 [doi]
- Denoising Attention for Query-aware User ModelingElias Bassani, Pranav Kasela, Gabriella Pasi. 2368-2380 [doi]
- A Lightweight Mixture-of-Experts Neural Machine Translation Model with Stage-wise Training StrategyFan Zhang, Mei Tu, Song Liu, Jinyao Yan. 2381-2392 [doi]
- BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language ModelsJacek Wiland, Max Ploner, Alan Akbik. 2393-2411 [doi]
- Conformal Intent Classification and Clarification for Fast and Accurate Intent RecognitionFloris den Hengst, Ralf Wolter, Patrick Altmeyer, Arda Kaygan. 2412-2432 [doi]
- Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models in Court DecisionsAlex Nyffenegger, Matthias Stürmer, Joel Niklaus. 2433-2462 [doi]
- X-LLaVA: Optimizing Bilingual Large Vision-Language AlignmentDongjae Shin, HyeonSeok Lim, Inho Won, ChangSu Choi, MinJun Kim, Seungwoo Song, Hangyeol Yoo, Sangmin Kim, Kyungtae Lim. 2463-2473 [doi]
- Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual NoiseGiwon Hong, Jeonghwan Kim, Junmo Kang, Sung-Hyon Myaeng, Joyce Jiyoung Whang. 2474-2495 [doi]
- Heterogeneity over Homogeneity: Investigating Multilingual Speech Pre-Trained Models for Detecting Audio DeepfakeOrchid Chetia Phukan, Gautam Siddharth Kashyap, Arun Balaji Buduru, Rajesh Sharma 0002. 2496-2506 [doi]
- Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media PostsChenghao Yang, Tuhin Chakrabarty, Karli R. Hochstatter, Melissa N. Slavin, Nabila El-Bassel, Smaranda Muresan. 2507-2521 [doi]
- Self-Adaptive Sampling for Accurate Video Question Answering on Image Text ModelsWei Han, Hui Chen 0023, Min-Yen Kan, Soujanya Poria. 2522-2534 [doi]
- Towards an On-device Agent for Text RewritingYun Zhu, Yinxiao Liu, Felix Stahlberg, Shankar Kumar, Yu-Hui Chen, Liangchen Luo, Lei Shu 0004, Renjie Liu, Jindong Chen, Lei Meng. 2535-2552 [doi]
- Tailoring Vaccine Messaging with Common-Ground OpinionsRickard Stureborg, Sanxing Chen, Roy Xie, Aayushi Patel, Christopher Li, Chloe Qinyu Zhu, Tingnan Hu, Jun Yang, Bhuwan Dhingra. 2553-2575 [doi]
- Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation ClassificationRobert Vacareanu, Fahmida Alam, Md. Asiful Islam, Haris Riaz, Mihai Surdeanu. 2576-2594 [doi]
- Q-Tuning: Queue-based Prompt Tuning for Lifelong Few-shot Language LearningYanhui Guo, Shaoyuan Xu, Jinmiao Fu, Jia Liu, Chaosheng Dong, Bryan Wang. 2595-2622 [doi]
- In-Context Example Ordering Guided by Label DistributionsZhichao Xu, Daniel Cohen, Bei Wang 0001, Vivek Srikumar. 2623-2640 [doi]
- Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial NarrativesJiaxin Liu, Yi Yang, Kar Yan Tam. 2641-2652 [doi]
- Laying Anchors: Semantically Priming Numerals in Language ModelingMandar Sharma, Rutuja Murlidhar Taware, Pravesh Koirala, Nikhil Muralidhar, Naren Ramakrishnan. 2653-2660 [doi]
- UEGP: Unified Expert-Guided Pre-training for Knowledge RekindleYutao Mou, Kexiang Wang, Jianhe Lin, Dehong Ma, Jun Fan, Daiting Shi, Zhicong Cheng, Simiu Gu, Dawei Yin, Weiran Xu. 2661-2673 [doi]
- LatticeGen: Hiding Generated Text in a Lattice for Privacy-Aware Large Language Model Generation on CloudMengke Zhang, Tianxing He, Tianle Wang 0003, Lu Mi, Niloofar Mireshghallah, Binyi Chen, Hao Wang, Yulia Tsvetkov. 2674-2690 [doi]
- HateModerate: Testing Hate Speech Detectors against Content Moderation PoliciesJiangrui Zheng, Xueqing Liu, Mirazul Haque, Xing Qian, Guanqun Yang, Wei Yang. 2691-2710 [doi]
- Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each OtherYifei Gao, Jie Ou, Lei Wang, Yuting Xiao, Xiangzhiyuan Xiangzhiyuan, Ruiting Dai, Jun Cheng. 2711-2722 [doi]
- Contrastive Preference Learning for Neural Machine TranslationJianfei He, Shichao Sun, Sen Peng, Jie Xu 0031, Xiaohua Jia, Wenjie Li 0002. 2723-2735 [doi]
- SocREval: Large Language Models with the Socratic Method for Reference-free Reasoning EvaluationHangfeng He 0001, Hongming Zhang 0009, Dan Roth. 2736-2764 [doi]
- Multilingual Machine Translation with Large Language Models: Empirical Results and AnalysisWenhao Zhu, Hongyi Liu 0008, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen 0001, Lei Li. 2765-2781 [doi]
- Unleashing the Power of LLMs in Court View Generation by Stimulating Internal Knowledge and Incorporating External KnowledgeYiFei Liu, Yiquan Wu, Ang Li, Yating Zhang, Changlong Sun, Weiming Lu 0001, Fei Wu 0001, Kun Kuang. 2782-2792 [doi]
- Prompting Vision-Language Models For Aspect-Controlled Generation of Referring ExpressionsDanfeng Guo, Sanchit Agarwal, Arpit Gupta, Jiun-Yu Kao, Emre Barut, Tagyoung Chung, Jing Huang, Mohit Bansal. 2793-2807 [doi]
- Task-Agnostic Detector for Insertion-Based Backdoor AttacksWeimin Lyu, Xiao Lin, Songzhu Zheng, Lu Pang 0006, Haibin Ling, Susmit Jha, Chao Chen 0012. 2808-2822 [doi]
- Uncertainty Estimation on Sequential Labeling via Uncertainty TransmissionJianfeng He, Linlin Yu, Shuo Lei, Chang-Tien Lu, Feng Chen 0001. 2823-2835 [doi]
- Exploring Language Model's Code Generation Ability with Auxiliary FunctionsSeonghyeon Lee, Sanghwan Jang, Seongbo Jang, Dongha Lee, Hwanjo Yu. 2836-2848 [doi]
- Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language ModelsSang T. Truong, Duc Nguyen, Toan Nguyen, Dong D. Le, Nhi N. Truong, Tho Quan, Sanmi Koyejo. 2849-2900 [doi]
- GoT: Effective Graph-of-Thought Reasoning in Language ModelsYao Yao, Zuchao Li, Hai Zhao 0001. 2901-2921 [doi]
- Enhancing the General Agent Capabilities of Low-Paramter LLMs through Tuning and Multi-Branch ReasoningQinhao Zhou, Zihan Zhang, Xiang Xiang 0001, Ke Wang, Yuchuan Wu, Yongbin Li. 2922-2931 [doi]
- MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language ModelsWeihao You, Shuo Yin, Xudong Zhao, Zhilong Ji, Guoqiang Zhong 0001, Jinfeng Bai. 2932-2958 [doi]
- Tram: A Token-level Retrieval-augmented Mechanism for Source Code SummarizationTong Ye, Lingfei Wu 0001, Tengfei Ma 0001, Xuhong Zhang 0002, Yangkai Du, Peiyu Liu 0003, Shouling Ji, Wenhai Wang. 2959-2971 [doi]
- UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State TrackingChuang Li, Yan Zhang 0004, Min-Yen Kan, Haizhou Li 0001. 2972-2983 [doi]
- Evaluating Step-by-Step Reasoning through Symbolic VerificationYifan Zhang, Hanlin Zhang, Li Li, Eric P. Xing. 2984-3002 [doi]
- Multi-Review Fusion-in-ContextAviv Slobodkin, Ori Shapira, Ran Levy 0001, Ido Dagan. 3003-3021 [doi]
- Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic ComparisonMaxime Bouthors, Josep Maria Crego, François Yvon. 3022-3039 [doi]
- Extending Input Contexts of Language Models through Training on Segmented SequencesPetros Karypis, Julian J. McAuley, George Karypis. 3040-3052 [doi]
- Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy UnderstandingYanda Li, Dixuan Wang, Jiaqing Liang, Guochao Jiang, Qianyu He, Yanghua Xiao, Deqing Yang. 3053-3066 [doi]
- Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language ModelsWanyong Feng, Jaewook Lee 0006, Hunter McNichols, Alexander Scarlatos, Digory Smith, Simon Woodhead 0002, Nancy Otero Ornelas, Andrew S. Lan. 3067-3082 [doi]
- Aspect-based Sentiment Analysis with Context DenoisingYuanhe Tian, Chang Liu, Yan Song, Fei Xia, Yongdong Zhang 0001. 3083-3095 [doi]
- IruMozhi: Automatically classifying diglossia in TamilKabilan Prasanna, Aryaman Arora. 3096-3103 [doi]
- RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural ConversationsHaolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng 0013, Yuncheng Hua, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, Jureynolds Jureynolds, Suraj Sharma, Shilin Qu, Linhao Luo, Ingrid Zukerman, Lay-Ki Soon, Zhaleh Semnani-Azad, Reza Haf. 3104-3117 [doi]
- Human-in-the-Loop Synthetic Text Data Inspection with Provenance TrackingHong Jin Kang, Fabrice Harel-Canada, Muhammad Ali Gulzar, Nanyun Peng, Miryung Kim. 3118-3129 [doi]
- COMMIT: Code-Mixing English-Centric Large Language Model for Multilingual Instruction TuningJaeseong Lee, YeonJoon Jung, Seung-won Hwang. 3130-3137 [doi]
- DiLM: Distilling Dataset into Language Model for Text-level Dataset DistillationAru Maekawa, Satoshi Kosugi, Kotaro Funakoshi, Manabu Okumura. 3138-3153 [doi]
- MindAgent: Emergent Gaming InteractionRan Gong, Qiuyuan Huang, Xiaojian Ma, Yusuke Noda, Zane Durante, Zilong Zheng, Demetri Terzopoulos, Li Fei-Fei 0001, Jianfeng Gao 0001, Hoi Vo. 3154-3183 [doi]
- BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn DialoguesHaodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen 0026. 3184-3200 [doi]
- Learning Mutually Informed Representations for Characters and SubwordsYilin Wang, Xinyi Hu, Matthew R. Gormley. 3201-3213 [doi]
- A Novel Two-step Fine-tuning Framework for Transfer Learning in Low-Resource Neural Machine TranslationYuan Gao, Feng Hou, Ruili Wang. 3214-3224 [doi]
- Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word AlignmentZhongtao Miao, Qiyu Wu 0001, Kaiyan Zhao, Zilong Wu, Yoshimasa Tsuruoka. 3225-3236 [doi]
- C³LPGCN:Integrating Contrastive Learning and Cooperative Learning with Prompt into Graph Convolutional Network for Aspect-based Sentiment AnalysisYe He, Shihao Zou, Yuzhe Chen, Xianying Huang. 3237-3247 [doi]
- Visual Enhanced Entity-Level Interaction Network for Multimodal SummarizationHaolong Yan, Binghao Tang, Boda Lin, Gang Zhao, Si Li 0001. 3248-3260 [doi]
- Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context LearningJianing Wang, Chengyu Wang 0001, Chuanqi Tan, Jun Huang 0007, Ming Gao 0001. 3261-3280 [doi]
- Time Machine GPTFelix Drinkall, Eghbal Rahimikia, Janet B. Pierrehumbert, Stefan Zohren. 3281-3292 [doi]
- An End-to-End Submodular Framework for Data-Efficient In-Context LearningLilly Kumari, Shengjie Wang, Arnav Das, Tianyi Zhou 0001, Jeff A. Bilmes. 3293-3308 [doi]
- Teaching Llama a New Language Through Cross-Lingual Knowledge TransferHele-Andra Kuulmets, Taido Purason, Agnes Luhtaru, Mark Fishel. 3309-3325 [doi]
- Simulating Opinion Dynamics with Networks of LLM-based AgentsYun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers. 3326-3346 [doi]
- Probing the Category of Verbal Aspect in Transformer Language ModelsAnisia Katinskaia, Roman Yangarber. 3347-3366 [doi]
- A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data SetsTanja Samardzic, Ximena Gutierrez, Christian Bentz, Steven Moran, Olga Pelloni. 3367-3382 [doi]
- Beyond Read-Only: Crafting a Comprehensive Chinese Text-to-SQL Dataset for Database Manipulation and QueryXi Chen, Jinguo You, Likun Likun, Xiang Li. 3383-3393 [doi]
- Normalizing without Modernizing: Keeping Historical Wordforms of Middle French while Reducing Spelling VariantsRaphael Rubino, Johanna Gerlach, Jonathan Mutal, Pierrette Bouillon. 3394-3402 [doi]
- Anti-LM Decoding for Zero-shot In-context Machine TranslationSuzanna Sia, Alexandra DeLucia, Kevin Duh. 3403-3420 [doi]
- Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-TuningShuai Zhao, Leilei Gan, Anh Tuan Luu, Jie Fu, Lingjuan Lyu, Meihuizi Jia, Jinming Wen. 3421-3438 [doi]
- Select and Summarize: Scene Saliency for Movie Script SummarizationRohit Saxena, Frank Keller. 3439-3455 [doi]
- Don't be a Fool: Pooling Strategies in Offensive Language Detection from User-Intended Adversarial AttacksSeunguk Yu, Juhwan Choi, Youngbin Kim. 3456-3467 [doi]
- Z-GMOT: Zero-shot Generic Multiple Object TrackingKim Tran, Anh-Duy Le-Dinh, Tien-Phat Nguyen, Thinh Phan, Pha A. Nguyen, Khoa Luu, Donald A. Adjeroh, Gianfranco Doretto, Ngan Le. 3468-3479 [doi]
- NLP for Counterspeech against Hate: A Survey and How-To GuideHelena Bonaldi, Yi-Ling Chung, Gavin Abercrombie, Marco Guerini. 3480-3499 [doi]
- PRODIGy: a PROfile-based DIalogue Generation datasetDaniela Occhipinti, Serra Sinem Tekiroglu, Marco Guerini. 3500-3514 [doi]
- WaterJudge: Quality-Detection Trade-off when Watermarking Large Language ModelsPiotr Molenda, Adian Liusie, Mark J. F. Gales. 3515-3525 [doi]
- Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical ThinkingNan Xu, Fei Wang 0060, Ben Zhou, Bangzheng Li, Chaowei Xiao, Muhao Chen. 3526-3548 [doi]
- PAELLA: Parameter-Efficient Lightweight Language-Agnostic Captioning ModelRita Ramos, Emanuele Bugliarello, Bruno Martins 0001, Desmond Elliott. 3549-3564 [doi]
- OSCaR: Object State Captioning and State Change RepresentationNguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu. 3565-3576 [doi]
- SumCSE: Summary as a transformation for Contrastive LearningRaghuveer Thirukovalluru, Xiaolan Wang, Jun Chen, Shuyang Li, Jie Lei, Rong Jin, Bhuwan Dhingra. 3577-3588 [doi]
- The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic TextYanzhu Guo, Guokan Shang, Michalis Vazirgiannis, Chloé Clavel. 3589-3604 [doi]
- PersonaLLM: Investigating the Ability of Large Language Models to Express Personality TraitsHang Jiang, Xiajie Zhang, Xubo Cao, Cynthia Breazeal, Deb Roy, Jad Kabbara. 3605-3627 [doi]
- FIRE: A Dataset for Financial Relation ExtractionHassan Hamad, Abhinav Kumar Thakur, Nijil Kolleri, Sujith Pulikodan, Keith M. Chugg. 3628-3642 [doi]
- MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query ResponseZihao Deng, Yinghao Ma, Yudong Liu, Rongchen Guo, Ge Zhang, Wenhu Chen, Wenhao Huang, Emmanouil Benetos. 3643-3655 [doi]
- Investigating Acceleration of LLaMA Inference by Enabling Intermediate Layer Decoding via Instruction Tuning with 'LITE'Neeraj Varshney, Agneet Chatterjee, Mihir Parmar, Chitta Baral. 3656-3677 [doi]
- Instruction-following Evaluation through Verbalizer ManipulationShiyang Li, Jun Yan, Hai Wang, Zheng Tang, Xiang Ren, Vijay Srinivasan, Hongxia Jin. 3678-3692 [doi]
- WebWISE: Unlocking Web Interface Control for LLMs via Sequential ExplorationHeyi Tao, Sethuraman TV, Michal Shlapentokh-Rothman, Tanmay Gupta, Heng Ji, Derek Hoiem. 3693-3711 [doi]
- CodecLM: Aligning Language Models with Tailored Synthetic DataZifeng Wang 0002, Chun-Liang Li, Vincent Perot, Long T. Le, Jin Miao, Zizhao Zhang, Chen-Yu Lee, Tomas Pfister. 3712-3729 [doi]
- Prompting Few-shot Multi-hop Question Generation via Comprehending Type-aware SemanticsZefeng Lin, Weidong Chen, Yan Song, Yongdong Zhang 0001. 3730-3740 [doi]
- When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language ModelsYanhong Li, Chenghao Yang, Allyson Ettinger. 3741-3753 [doi]
- CoDa: Constrained Generation based Data Augmentation for Low-Resource NLPChandra Kiran Reddy Evuru, Sreyan Ghosh, Sonal Kumar, Ramaneswaran S., Utkarsh Tyagi, Dinesh Manocha. 3754-3769 [doi]
- Synonym relations affect object detection learned on vision-language dataGiacomo Nebbia, Adriana Kovashka. 3770-3776 [doi]
- CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency ModelsXiang Li, FanBu FanBu, Ambuj Mehrish, Yingting Li, Jiale Han, Bo Cheng, Soujanya Poria. 3777-3794 [doi]
- RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive LearningJavad Rafiei-Asl, Prajwal Panzade, Eduardo Blanco 0002, Daniel Takabi, Zhipeng Cai 0001. 3795-3809 [doi]
- Characterizing Human and Zero-Shot GPT-3.5 Object-Similarity JudgmentsD. McKnight, Alona Fyshe. 3810-3828 [doi]
- Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language ModelsWei He, Shichun Liu, Jun Zhao 0019, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang 0001, Xuanjing Huang 0001. 3829-3845 [doi]
- Getting Sick After Seeing a Doctor? Diagnosing and Mitigating Knowledge Conflicts in Event Temporal ReasoningTianqing Fang, Zhaowei Wang 0003, Wenxuan Zhou, Hongming Zhang 0009, Yangqiu Song, Muhao Chen. 3846-3868 [doi]
- MCECR: A Novel Dataset for Multilingual Cross-Document Event Coreference ResolutionAmir Pouran Ben Veyseh, Viet Lai, Chien Nguyen, Franck Dernoncourt, Thien Huu Nguyen. 3869-3880 [doi]
- Sentiment Analysis in the Era of Large Language Models: A Reality CheckWenxuan Zhang, Yue Deng 0010, Bing Liu 0001, Sinno Jialin Pan, Lidong Bing. 3881-3906 [doi]
- Tokenizer Choice For LLM Training: Negligible or Crucial?Mehdi Ali, Michael Fromm 0001, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr. 3907-3924 [doi]
- Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner MonologueJunkai Zhou, Liang Pang, Huawei Shen, Xueqi Cheng. 3925-3951 [doi]
- The Impact of Differential Privacy on Group Disparity MitigationVictor Petrén Bach Hansen, Atula Tejaswi Neerkaje, Ramit Sawhney, Lucie Flek, Anders Søgaard. 3952-3965 [doi]
- Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement LearningShivam Mhaskar, Nirmesh Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah. 3966-3976 [doi]
- Read between the lines - Functionality Extraction From READMEsPrince Kumar, Srikanth Tamilselvam, Dinesh Garg. 3977-3990 [doi]
- AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment GraphZhaowei Wang 0003, Haochen Shi, Weiqi Wang, Tianqing Fang, Hongming Zhang 0009, Sehyun Choi, Xin Liu, Yangqiu Song. 3991-4010 [doi]
- Few-TK: A Dataset for Few-shot Scientific Typed Keyphrase RecognitionAvishek Lahiri, Pratyay Sarkar, Medha Sen, Debarshi Kumar Sanyal, Imon Mukherjee. 4011-4025 [doi]
- Language Models can be Deductive SolversJiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen, Dongyan Zhao 0001, Weizhu Chen. 4026-4042 [doi]
- Interpreting User Requests in the Context of Natural Language Standing InstructionsNikita Moghe, Patrick Xia 0002, Jacob Andreas, Jason Eisner, Benjamin Van Durme, Harsh Jhamtani. 4043-4060 [doi]
- Secure Your Model: An Effective Key Prompt Protection Mechanism for Large Language ModelsRuixiang Tang, Yu-Neng Chuang, Xuanting Cai, Mengnan Du, Xia Hu 0001. 4061-4073 [doi]
- Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language ModelsJiashuo Sun, Yi Luo, Yeyun Gong, Chen Lin 0001, Yelong Shen, Jian Guo, Nan Duan. 4074-4101 [doi]
- Do Prompt Positions Really Matter?Junyu Mao, Stuart E. Middleton, Mahesan Niranjan. 4102-4130 [doi]
- Natural Language Embedded Programs for Hybrid Language Symbolic ReasoningTianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong 0001, Yoon Kim, Xixin Wu, Helen Meng, Jim Glass 0001. 4131-4155 [doi]
- A Study on Scaling Up Multilingual News Framing AnalysisSyeda Sabrina Akter, Antonios Anastasopoulos. 4156-4173 [doi]
- ViGLUE: A Vietnamese General Language Understanding Benchmark and Analysis of Vietnamese Language ModelsMinh-Nam Tran, Phu-Vinh Nguyen, Long Nguyen, Dien Dinh. 4174-4189 [doi]
- Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human RationalesLucas E. Resck, Marcos M. Raimundo, Jorge Poco. 4190-4216 [doi]
- Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language TranslationTong Su, Xin Peng, Sarubi Thillainathan, David Guzmán, Surangika Ranathunga, En-Shiun Annie Lee. 4217-4225 [doi]
- ADaPT: As-Needed Decomposition and Planning with Language ModelsArchiki Prasad, Alexander Koller, Mareike Hartmann, Peter Clark, Ashish Sabharwal, Mohit Bansal, Tushar Khot. 4226-4252 [doi]
- Guiding Large Language Models to Post-Edit Machine Translation with Error AnnotationsDayeon Ki, Marine Carpuat. 4253-4273 [doi]
- Non-contrastive sentence representations via self-supervisionDuccio Pappadopulo, Marco Farina. 4274-4284 [doi]
- Semantically-Prompted Language Models Improve Visual DescriptionsMichael Ogezi, Bradley Hauer, Grzegorz Kondrak. 4285-4302 [doi]
- GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language ModelsRuotong Liao, Xu Jia, Yangzhe Li, Yunpu Ma, Volker Tresp. 4303-4317 [doi]
- A Transformer with Stack AttentionJiaoda Li, Jennifer C. White, Mrinmaya Sachan, Ryan Cotterell. 4318-4335 [doi]
- InstructEval: Systematic Evaluation of Instruction Selection MethodsAnirudh Ajith, Chris Pan, Mengzhou Xia, Ameet Deshpande, Karthik Narasimhan. 4336-4350 [doi]
- RecMind: Large Language Model Powered Agent For RecommendationYancheng Wang, Ziyan Jiang, Zheng Chen 0010, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Yanbin Lu, Xiaojiang Huang, Yingzhen Yang. 4351-4364 [doi]
- GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data GenerationMohsen Gholami, Mohammad Akbari, Tianxi Hu, Vaden Masrani, Z. Wang, Yong Zhang. 4365-4380 [doi]
- How Lexical is Bilingual Lexicon Induction?Harsh Kohli, Helian Feng, Nicholas Dronen, Calvin McCarter, Sina Moeini, Ali Kebarighotbi. 4381-4386 [doi]
- Fumbling in Babel: An Investigation into ChatGPT's Language Identification AbilityWei-Rui Chen, Ife Adebara, Khai Duy Doan, Qisheng Liao, Muhammad Abdul-Mageed. 4387-4413 [doi]
- Targeted Augmentation for Low-Resource Event ExtractionSijia Wang, Lifu Huang. 4414-4428 [doi]
- Asking More Informative Questions for Grounded RetrievalSedrick Keh, Justin T. Chiu, Daniel Fried. 4429-4442 [doi]
- Efficient Citer: Tuning Large Language Models for Enhanced Answer Quality and VerificationMarzieh S. Tahaei, Aref Jafari, Ahmad Rashid, David Alfonso-Hermelo, Khalil Bibi, Yimeng Wu, Ali Ghodsi 0001, Boxing Chen, Mehdi Rezagholizadeh. 4443-4450 [doi]
- Addressing Healthcare-related Racial and LGBTQ+ Biases in Pretrained Language ModelsSean Xie, Saeed Hassanpour, Soroush Vosoughi. 4451-4464 [doi]
- ATG: Benchmarking Automated Theorem Generation for Generative Language ModelsXiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang. 4465-4480 [doi]
- Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable SummarizationYixin Liu 0003, Alexander R. Fabbri, Jiawen Chen, Yilun Zhao 0001, Simeng Han, Shafiq Joty, Pengfei Liu 0003, Dragomir Radev, Chien-Sheng Wu, Arman Cohan. 4481-4501 [doi]
- NeuroComparatives: Neuro-Symbolic Distillation of Comparative KnowledgePhillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Yejin Choi 0001, Swabha Swayamdipta. 4502-4520 [doi]
- Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in ConversationFangxu Yu, Junjie Guo, Zhen Wu, Xinyu Dai. 4521-4534 [doi]
- SUQL: Conversational Search over Structured and Unstructured Data with Large Language ModelsShicheng Liu, Jialiang Xu, Wesley Tjangnaka, Sina J. Semnani, Chen Jie Yu, Monica Lam 0001. 4535-4555 [doi]
- On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question AnsweringLinyong Nan, Ellen Zhang, Weijin Zou, Yilun Zhao 0001, Wenfei Zhou, Arman Cohan. 4556-4579 [doi]
- CARE: Extracting Experimental Findings From Clinical LiteratureAakanksha Naik, Bailey Kuehl, Erin Bransom, Doug Downey, Tom Hope. 4580-4596 [doi]
- Personalized Federated Learning for Text Classification with Gradient-Free Prompt TuningRui Wang 0088, Tong Yu 0001, Ruiyi Zhang, SungChul Kim, Ryan A. Rossi, Handong Zhao, Junda Wu, Subrata Mitra, Lina Yao 0001, Ricardo Henao. 4597-4612 [doi]
- SGSH: Stimulate Large Language Models with Skeleton Heuristics for Knowledge Base Question GenerationShasha Guo, Lizi Liao, Jing Zhang, Yanling Wang, Cuiping Li, Hong Chen. 4613-4625 [doi]
- Biomedical Entity Representation with Graph-Augmented Multi-Objective TransformerAndrey Sakhovskiy, Natalia Semenova, Artur Kadurin, Elena Tutubalina. 4626-4643 [doi]
- Cross-Lingual Summarization with Pseudo-Label RegularizationThang Le. 4644-4677 [doi]
- On the Way to Gentle AI Counselor: Politeness Cause Elicitation and Intensity Tagging in Code-mixed Hinglish Conversations for Social GoodPriyanshu Priya, Gopendra Vikram Singh, Mauajama Firdaus, Jyotsna Agrawal, Asif Ekbal. 4678-4696 [doi]
- Leveraging Summarization for Unsupervised Dialogue Topic SegmentationAleksei Artemiev, Daniil Parinov, Alexey Grishanov, Ivan Borisov, Alexey Vasilev, Daniil Muravetskii, Aleksey Rezvykh, Aleksei Goncharov, Andrey Savchenko. 4697-4704 [doi]
- LLaMA-Rider: Spurring Large Language Models to Explore the Open WorldYicheng Feng, Yuxuan Wang, Jiazheng Liu, Sipeng Zheng, Zongqing Lu. 4705-4724 [doi]
- Contrastive Learning as a Polarizer: Mitigating Gender Bias by Fair and Biased sentencesKyungmin Park, Sihyun Oh, Daehyun Kim, Juae Kim. 4725-4736 [doi]
- PoLLMgraph: Unraveling Hallucinations in Large Language Models via State Transition DynamicsDerui Zhu, Dingfan Chen, Qing Li 0038, Zongxiong Chen, Lei Ma 0003, Jens Grossklags, Mario Fritz. 4737-4751 [doi]
- Improving Health Question Answering with Reliable and Time-Aware Evidence RetrievalJuraj Vladika, Florian Matthes. 4752-4763 [doi]
- DecoderLens: Layerwise Interpretation of Encoder-Decoder TransformersAnna Langedijk, Hosein Mohebbi, Gabriele Sarti, Willem H. Zuidema, Jaap Jumelet. 4764-4780 [doi]