Abstract is missing.
- Frontmatter [doi]
- Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute ManipulationLetian Peng, Yuwei Zhang 0001, Jingbo Shang. 1-16 [doi]
- Match More, Extract Better! Hybrid Matching Model for Open Domain Web Keyphrase ExtractionMingYang Song, Liping Jing, Yi Feng. 17-27 [doi]
- AFPQ: Asymmetric Floating Point Quantization for LLMsYijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu. 28-36 [doi]
- End-to-End Emotion Semantic ParsingXiaotong Jiang, Zhongqing Wang, Guodong Zhou. 37-47 [doi]
- Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue SystemChen Chen 0075, Ruizhe Li 0001, Yuchen Hu, Yuanyuan Chen, Chengwei Qin, Qiang Zhang. 48-61 [doi]
- Unveiling Imitation Learning: Exploring the impact of Data Falsity to Large Language ModelHyunsoo Cho. 62-73 [doi]
- The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?Alex Gu, Wen-Ding Li, Naman Jain, Theo Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama. 74-117 [doi]
- CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review SupportChao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan, David Wadden, Lucy Lu Wang, Aakanksha Naik. 118-132 [doi]
- Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and EvaluationHao Li 0074, Yuping Wu, Viktor Schlegel, Riza Batista-Navarro, Tharindu Madusanka, Iqra Zahid, Jiayan Zeng, Xiaochi Wang, Xinran He, Yizhi Li, Goran Nenadic. 133-150 [doi]
- A Grounded Preference Model for LLM AlignmentTahira Naseem, Guangxuan Xu, Sarathkrishna Swaminathan, Asaf Yehudai, Subhajit Chaudhury, Radu Florian, Ramón Fernandez Astudillo, Asim Munawar. 151-162 [doi]
- Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on GraphsBowen Jin, Chulin Xie, Jiawei Zhang 0001, Kashob Kumar Roy, Yu Zhang 0044, Zheng Li 0018, Ruirui Li 0002, Xianfeng Tang, Suhang Wang, Yu Meng 0001, Jiawei Han 0001. 163-184 [doi]
- Text2DB: Integration-Aware Information Extraction with Large Language Model AgentsYizhu Jiao, Sha Li, Sizhe Zhou, Heng Ji, Jiawei Han 0001. 185-205 [doi]
- How Important is a Language Model for Low-resource ASR?Zoey Liu, Nitin Venkateswaran, Éric Le Ferrand, Emily Prud'hommeaux. 206-213 [doi]
- MediSwift: Efficient Sparse Pre-trained Biomedical Language ModelsVithursan Thangarasa, Mahmoud Salem, Shreyas Saxena, Chen-Yu Leong, Joel Hestness, Sean Lie. 214-230 [doi]
- Lexicon-Level Contrastive Visual-Grounding Improves Language ModelingChengxu Zhuang, Evelina Fedorenko, Jacob Andreas. 231-247 [doi]
- P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language ModelsShuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji Kasneci. 248-264 [doi]
- Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget ScenariosYuhang Zhou, Wei Ai 0002. 265-282 [doi]
- Small Models are Valuable Plug-ins for Large Language ModelsCanwen Xu, Yichong Xu, Shuohang Wang, Yang Liu 0124, Chenguang Zhu 0001, Julian J. McAuley. 283-294 [doi]
- Are self-explanations from Large Language Models faithful?Andreas Madsen, Sarath Chandar, Siva Reddy. 295-337 [doi]
- ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value ExtractionHenry Peng Zou, Vinay Samuel, Yue Zhou, Weizhi Zhang 0001, Liancheng Fang, Zihe Song, Philip S. Yu, Cornelia Caragea. 338-354 [doi]
- Prompt Engineering a Prompt EngineerQinyuan Ye, Mohamed Ahmed, Reid Pryzant, Fereshte Khani. 355-385 [doi]
- ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious CorrelationsSreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, S. Sakshi, Sanjoy Chowdhury, Dinesh Manocha. 386-406 [doi]
- Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMsNaihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen 0001, Lin Ma, Yue Zhang 0004, Rada Mihalcea. 407-426 [doi]
- Biasly: An Expert-Annotated Dataset for Subtle Misogyny Detection and MitigationBrooklyn Sheppard, Anna Richter, Allison Cohen, Elizabeth Allyn Smith, Tamara Kneese, Carolyne Pelletier, Ioana Baldini, Yue Dong. 427-452 [doi]
- BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational AlgebraParker Glenn, Parag Dakle, Liang Wang, Preethi Raghavan. 453-466 [doi]
- LLM-QAT: Data-Free Quantization Aware Training for Large Language ModelsZechun Liu, Barlas Oguz, Changsheng Zhao 0002, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra. 467-484 [doi]
- InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language ModelHaogeng Liu, Quanzeng You, Yiqi Wang, Xiaotian Han, Bohan Zhai, Yongfei Liu, Wentao Chen, Yiren Jian, Yunzhe Tao, Jianbo Yuan, Ran He 0001, Hongxia Yang. 485-492 [doi]
- Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model AttributionXinze Li, Yixin Cao 0002, Liangming Pan, Yubo Ma, Aixin Sun. 493-516 [doi]
- Benchmarking Cognitive Biases in Large Language Models as EvaluatorsRyan Koo, Minhwa Lee, Vipul Raheja, Jong Inn Park, Zae Myung Kim, Dongyeop Kang. 517-545 [doi]
- X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual InstructionsChong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong. 546-566 [doi]
- Muffin: Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI FeedbackJiashuo Wang, Chunpu Xu, Chak Tou Leong, Wenjie Li, Jing Li. 567-585 [doi]
- Resonance RoPE: Improving Context Length Generalization of Large Language ModelsSuyuchen Wang, Ivan Kobyzev, Peng Lu, Mehdi Rezagholizadeh, Bang Liu. 586-598 [doi]
- MedAgents: Large Language Models as Collaborators for Zero-shot Medical ReasoningXiangru Tang, Anni Zou, Zhuosheng Zhang 0001, Ziming Li, Yilun Zhao 0001, Xingyao Zhang, Arman Cohan, Mark Gerstein. 599-621 [doi]
- Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language ModelsYiming Wang, Zhuosheng Zhang 0001, Pei Zhang 0011, Baosong Yang, Rui Wang 0015. 622-643 [doi]
- DPDLLM: A Black-box Framework for Detecting Pre-training Data from Large Language ModelsBaohang Zhou, Zezhong Wang 0004, Lingzhi Wang, Hongru Wang 0003, Ying Zhang 0015, Kehui Song, Xuhui Sui, Kam-Fai Wong. 644-653 [doi]
- PACIT: Unlocking the Power of Examples for Better In-Context Instruction TuningTianci Xue, Ziqi Wang, Yixia Li, Yun Chen 0007, Guanhua Chen 0001. 654-665 [doi]
- Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language ModelsYuchen Hu, Chen Chen 0075, Chengwei Qin, Qiushi Zhu, Engsiong Chng, Ruizhe Li 0001. 666-679 [doi]
- Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction DebiasingHao Yue, Shaopeng Lai, Chengyi Yang, Liang Zhang, Junfeng Yao, Jinsong Su. 680-691 [doi]
- Large Language Models can Share Images, Too!Young-Jun Lee, Dokyong Lee, Joo Won Sung, Jonghwan Hyeon, Ho-Jin Choi. 692-713 [doi]
- CodeM: Less Data Yields More Versatility via Ability MatrixDaoguang Zan, Ailun Yu, Wei Liu 0007, Bo Shen, Shaoxin Lin, Yongshun Gong, Yafen Yao, Yan Liu, Bei Guan, Weihua Luo, Yongji Wang 0002, Qianxiang Wang, LiZhen Cui. 714-729 [doi]
- Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart CaptioningKung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi Fung 0001, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji. 730-749 [doi]
- BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting EvidenceJiajie Jin, Yutao Zhu 0001, Yujia Zhou 0002, Zhicheng Dou. 750-761 [doi]
- Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human IntentionsWenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu. 762-776 [doi]
- Incremental Sequence Labeling: A Tale of Two ShiftsShengjie Qiu, Junhao Zheng, Zhen Liu, Yicheng Luo, Qianli Ma 0001. 777-791 [doi]
- How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question AnsweringJinxin Liu, Shulin Cao, Jiaxin Shi, Tingjian Zhang, Lunyiu Nie, Linmei Hu, Lei Hou 0001, Juanzi Li. 792-815 [doi]
- MELOV: Multimodal Entity Linking with Optimized Visual Features in Latent SpaceXuhui Sui, Ying Zhang, Yu Zhao, Kehui Song, Baohang Zhou, Xiaojie Yuan. 816-826 [doi]
- Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive DecodingFanyi Qu, Hao Sun, Yunfang Wu. 827-838 [doi]
- Conversational Question Answering with Language Models Generated Reformulations over Knowledge GraphLihui Liu, Blaine Hill, Boxin Du, Fei Wang 0001, Hanghang Tong. 839-850 [doi]
- Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step by StepLi Zhong, Zilong Wang 0002, Jingbo Shang. 851-870 [doi]
- Effective In-Context Example Selection through Data CompressionZhongxiang Sun, Kepu Zhang, Haoyu Wang, Xiao Zhang, Jun Xu. 871-877 [doi]
- Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLMYang Chen, Chong Yang, Tu Hu, Xinhao Chen, Man Lan, Li Cai, Xinlin Zhuang, Xuan Lin, Xin Lu, Aimin Zhou. 878-890 [doi]
- Knowledgeable Preference Alignment for LLMs in Domain-specific Question AnsweringYichi Zhang, Zhuo Chen 0007, Yin Fang, Yanxi Lu, Fangming Li, Wen Zhang 0015, Huajun Chen. 891-904 [doi]
- MARIO: MAth Reasoning with code Interpreter Output - A Reproducible PipelineMinpeng Liao, Chengxi Li 0014, Wei Luo, Jing Wu, Kai Fan 0002. 905-924 [doi]
- DiffusPoll: Conditional Text Diffusion Model for Poll GenerationLe Cheng, Shuangyin Li. 925-935 [doi]
- Exploring Mathematical Extrapolation of Large Language Models with Synthetic DataHaolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie Chen. 936-946 [doi]
- Implanting LLM's Knowledge via Reading Comprehension Tree for Toxicity DetectionHankun Kang, Tieyun Qian. 947-962 [doi]
- LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt CompressionZhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang 0001, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang 0001. 963-981 [doi]
- EconNLI: Evaluating Large Language Models on Economics ReasoningYue Guo, Yi Yang. 982-994 [doi]
- Better Late Than Never: Model-Agnostic Hallucination Post-Processing Framework Towards Clinical Text SummarizationSongda Li, Yunqi Zhang, Chunyuan Deng, Yake Niu, Hui Zhao. 995-1011 [doi]
- Finding and Editing Multi-Modal Neurons in Pre-Trained TransformersHaowen Pan, Yixin Cao 0002, Xiaozhi Wang, Xun Yang, Meng Wang 0001. 1012-1037 [doi]
- Realistic Evaluation of Toxicity in Large Language ModelsTinh Luong, Thanh-Thien Le, Linh Ngo, Thien Nguyen. 1038-1047 [doi]
- Controllable Text Generation with Residual Memory TransformerHanqing Zhang, Si Sun, Haiming Wu, Dawei Song 0001. 1048-1066 [doi]
- Prompt-Based Length Controlled Generation with Multiple Control TypesRenlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang 0002, Qun Liu 0001. 1067-1085 [doi]
- PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action ChainLiang Chen 0024, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, YuChi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu 0001, Baobao Chang. 1086-1104 [doi]
- Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation DatasetMinjin Kim, Minju Kim, Hana Kim, Beong-woo Kwak, SeongKu Kang, Youngjae Yu, Jinyoung Yeo, Dongha Lee. 1105-1120 [doi]
- CoLLaVO: Crayon Large Language and Vision mOdelByung kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro. 1121-1138 [doi]
- Modelling Variability in Human Annotator SimulationWen Wu, Wenlin Chen, Chao Zhang 0031, Philip C. Woodland. 1139-1157 [doi]
- BEnQA: A Question Answering Benchmark for Bengali and EnglishSheikh Shafayat, H. M. Quamran Hasan, Minhajur Rahman Chowdhury Mahim, Rifki Afina Putri, James Thorne, Alice Oh. 1158-1177 [doi]
- MORE: Multi-mOdal REtrieval Augmented Generative Commonsense ReasoningWanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng. 1178-1192 [doi]
- Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language ModelsZhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen 0001, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu 0001, Jun Zhao 0001. 1193-1215 [doi]
- BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task TuningQizhi Pei, Lijun Wu, Kaiyuan Gao, Xiaozhuan Liang, Yin Fang, Jinhua Zhu 0001, Shufang Xie 0003, Tao Qin 0001, Rui Yan 0001. 1216-1240 [doi]
- SIBO: A Simple Booster for Parameter-Efficient Fine-TuningZhihao Wen, Jie Zhang, Yuan Fang. 1241-1257 [doi]
- GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-SolvingJiaxin Zhang 0024, Zhongzhi Li, Ming-Liang Zhang 0005, Fei Yin, Cheng-Lin Liu 0001, Yashar Moshfeghi. 1258-1276 [doi]
- Boosting Textural NER with Synthetic Image and Instructive AlignmentJiahao Wang, Wenjun Ke, Peng Wang, Hang Zhang, Dong Nie, Jiajun Liu, Guozheng Li, Ziyu Shang. 1277-1287 [doi]
- Neurons in Large Language Models: Dead, N-gram, PositionalElena Voita, Javier Ferrando, Christoforos Nalmpantis. 1288-1301 [doi]
- LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionJinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan 0002. 1302-1318 [doi]
- Learning Job Title Representation from Job Description Aggregation NetworkNapat Laosaengpha, Thanit Tativannarat, Chawan Piansaddhayanon, Attapol Rutherford, Ekapol Chuangsuwanich. 1319-1329 [doi]
- FlowVQA: Mapping Multimodal Logic in Visual Question Answering with FlowchartsShubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta 0001, Dan Roth. 1330-1350 [doi]
- Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity RecognitionYahan Yu, Duzhen Zhang, Xiuyi Chen, Chenhui Chu. 1351-1358 [doi]
- Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language ModelsYiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D'Haro, Robby T. Tan, Haizhou Li 0001. 1359-1375 [doi]
- Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language ModelsAdian Liusie, Yassir Fathullah, Mark J. F. Gales. 1376-1387 [doi]
- Uncovering Limitations of Large Language Models in Information Seeking from TablesChaoxu Pang, Yixuan Cao 0001, Chunhao Yang, Ping Luo 0001. 1388-1409 [doi]
- An Ensemble-of-Experts Framework for Rehearsal-free Continual Relation ExtractionShen Zhou, Yongqi Li 0002, Xin Miao, Tieyun Qian. 1410-1423 [doi]
- Temporal Validity Change PredictionGeorg Wenzel, Adam Jatowt. 1424-1446 [doi]
- RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language ModelsSaeed Najafi, Alona Fyshe. 1447-1466 [doi]
- Modelling Commonsense Commonalities with Multi-Facet Concept EmbeddingsHanane Kteich, Na Li, Usashi Chatterjee, Zied Bouraoui, Steven Schockaert. 1467-1480 [doi]
- Revisiting Multimodal Transformers for Tabular Data with Text FieldsThomas Bonnier. 1481-1500 [doi]
- An Empirical Study on the Characteristics of Bias upon Context Length Variation for BanglaJayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, Rifat Shahriyar. 1501-1520 [doi]
- ConTempo: A Unified Temporally Contrastive Framework for Temporal Relation ExtractionJingcheng Niu, Saifei Liao, Victoria Ng, Simon de Montigny, Gerald Penn. 1521-1533 [doi]
- CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue SystemsAbbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi. 1534-1551 [doi]
- CriticBench: Benchmarking LLMs for Critique-Correct ReasoningZicheng Lin, Zhibin Gou, Tian Liang, Ruilin Luo, Haowei Liu, Yujiu Yang. 1552-1587 [doi]
- DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language ModelsTaolin Zhang, Qizhou Chen, Dongyang Li, Chengyu Wang 0001, Xiaofeng He, Longtao Huang, Hui Xue', Jun Huang 0007. 1588-1602 [doi]
- Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects - A SurveyAshok Urlana, Pruthwik Mishra, Tathagato Roy, Rahul Mishra. 1603-1623 [doi]
- Benchmarking Large Language Models on Communicative Medical Coaching: A Dataset and a Novel SystemHengguan Huang, Songtao Wang, Hongfu Liu 0002, Hao Wang 0014, Ye Wang 0007. 1624-1637 [doi]
- Everything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationRuomeng Ding, Chaoyun Zhang, Lu Wang 0008, Yong Xu 0010, Minghua Ma, Wei Zhang 0056, Si-qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang 0001. 1638-1662 [doi]
- SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic ParsingHeidi C. Zhang, Sina J. Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica S. Lam. 1663-1678 [doi]
- Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and ChallengesBosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu 0001, Anh Tuan Luu, Shafiq Joty. 1679-1705 [doi]
- k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated TextAbe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He. 1706-1715 [doi]
- ColorSwap: A Color and Word Order Dataset for Multimodal EvaluationJirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush. 1716-1726 [doi]
- Revisiting OPRO: The Limitations of Small-Scale LLMs as OptimizersTuo Zhang, Jinyue Yuan, Salman Avestimehr. 1727-1735 [doi]
- CeeBERT: Cross-Domain Inference in Early Exit BERTDivya Jyoti Bajpai, Manjesh K. Hanawal. 1736-1748 [doi]
- UNIWIZ: A Unified Large Language Model Orchestrated Wizard for Safe Knowledge Grounded ConversationsSouvik Das, Rohini K. Srihari. 1749-1762 [doi]
- A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way ParallelismBrian Thompson 0001, Mehak Preet Dhaliwal, Peter Frisch, Tobias Domhan, Marcello Federico. 1763-1775 [doi]
- RankMean: Module-Level Importance Score for Merging Fine-tuned LLM ModelsGabriel Perin, Xuxi Chen, Shusen Liu 0001, Bhavya Kailkhura, Zhangyang Wang, Brian Gallagher. 1776-1782 [doi]
- VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language ModelsHaoyi Qiu, Wenbo Hu 0006, Zi-Yi Dou, Nanyun Peng. 1783-1805 [doi]
- Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language UnderstandingXuxin Cheng, Zhihong Zhu, Bang Yang, Xianwei Zhuang, Hongxiang Li, Yuexian Zou. 1806-1816 [doi]
- Towards Safer Large Language Models through Machine UnlearningZheyuan Liu 0010, Guangyao Dou, Zhaoxuan Tan, Yijun Tian 0001, Meng Jiang 0001. 1817-1829 [doi]
- The Impact of Reasoning Step Length on Large Language ModelsMingyu Jin, Qinkai Yu, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng, Yongfeng Zhang, Mengnan Du. 1830-1842 [doi]
- Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and ForgetfulnessGuangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson. 1843-1856 [doi]
- SKGSum: Structured Knowledge-Guided Document SummarizationQiqi Wang 0005, Ruofan Wang, Kaiqi Zhao 0001, Robert Amor, Benjamin Liu, Jiamou Liu, Xianda Zheng, Zijian Huang 0003. 1857-1871 [doi]
- Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and ApproachesShilin Zhou, Zhenghua Li, Chen Gong 0004, Lei Zhang, Yu Hong, Min Zhang 0005. 1872-1884 [doi]
- DEBATE: Devil's Advocate-Based Assessment and Text EvaluationAlex Kim, Keonwoo Kim, Sangwon Yoon. 1885-1897 [doi]
- Can Large Multimodal Models Uncover Deep Semantics Behind Images?Yixin Yang, Zheng Li, Qingxiu Dong, Heming Xia, Zhifang Sui. 1898-1912 [doi]
- Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction ParadigmQiang Gao, Zixiang Meng, Bobo Li, Jun Zhou, Fei Li 0021, Chong Teng, Donghong Ji. 1913-1927 [doi]
- A Graph per Persona: Reasoning about Subjective Natural Language DescriptionsEunJeong Hwang, Vered Shwartz, Dan Gutfreund, Veronika Thost. 1928-1942 [doi]
- MolTC: Towards Molecular Relational Modeling In Language ModelsJunfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang 0007, Zhiyuan Liu 0001, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang 0010. 1943-1958 [doi]
- KPEval: Towards Fine-Grained Semantic-Based Keyphrase EvaluationDi Wu, Da Yin, Kai-Wei Chang. 1959-1981 [doi]
- Learning Low-dimensional Multi-domain Knowledge Graph Embedding via Dual Archimedean SpiralsJiang Li, Xiangdong Su, Fujun Zhang, Guanglai Gao. 1982-1994 [doi]
- LoRA Meets Dropout under a Unified FrameworkSheng Wang, Liheng Chen, Jiyue Jiang, Boyang Xue, Lingpeng Kong, Chuan Wu. 1995-2008 [doi]
- Enhancing Text-to-SQL Parsing through Question Rewriting and Execution-Guided RefinementWenxin Mao, Ruiqi Wang, Jiyu Guo, Jichuan Zeng, Cuiyun Gao, Peiyi Han, Chuanyi Liu. 2009-2024 [doi]
- The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language ModelsShuo Zhang, Liangming Pan, Junzhou Zhao, William Yang Wang. 2025-2038 [doi]
- ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language ModelsHaoran Luo, Haihong E, Zichen Tang, Shiyao Peng, Yikai Guo, Wentai Zhang 0004, Chenghao Ma, Guanting Dong, Meina Song, Wei Lin, Yifan Zhu, Anh Tuan Luu. 2039-2056 [doi]
- Achilles-Bench: A Challenging Benchmark for Low-Resource EvaluationYudong Wang, Chang Ma, Qingxiu Dong, Zhifang Sui, Lingpeng Kong, Jingjing Xu. 2057-2080 [doi]
- INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of RepairHanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding 0002, Zhiyuan Liu 0001, Ge Yu 0001. 2081-2107 [doi]
- SocialBench: Sociality Evaluation of Role-Playing Conversational AgentsHongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang 0011, Fei Huang 0004. 2108-2126 [doi]
- From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based ApplicationsYongqiang Ma, Lizhi Qing, Jiawei Liu, Yangyang Kang, Yue Zhang, Wei Lu, Xiaozhong Liu, Qikai Cheng. 2127-2137 [doi]
- Context-Aware Tracking and Dynamic Introduction for Incomplete Utterance Rewriting in Extended Multi-Turn DialoguesXinnan Guo, Qian Zhu, Qiuhui Shi, Xuan Lin, Liubin Wang, DaqianLi DaqianLi, Yongrui Chen 0002. 2138-2148 [doi]
- EmotionQueen: A Benchmark for Evaluating Empathy of Large Language ModelsYuyan Chen, Songzhou Yan, Sijia Liu, Yueze Li, Yanghua Xiao. 2149-2176 [doi]
- Plum: Prompt Learning using MetaheuristicsRui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, Kashun Shum, Jipeng Zhang, Renjie Pi, Tong Zhang. 2177-2197 [doi]
- HOTVCOM: Generating Buzzworthy Comments for VideosYuyan Chen, Songzhou Yan, Qingpei Guo, Jiyuan Jia, Zhixu Li, Yanghua Xiao. 2198-2224 [doi]
- Do Large Language Models have Problem-Solving Capability under Incomplete Information Scenarios?Yuyan Chen, Yueze Li, Songzhou Yan, Sijia Liu, Jiaqing Liang, Yanghua Xiao. 2225-2238 [doi]
- Distilling Robustness into Natural Language Inference Models with Domain-Targeted AugmentationJoe Stacey, Marek Rei. 2239-2258 [doi]
- Into the Unknown: Generating Geospatial Descriptions for New EnvironmentsTzuf Paz-Argaman, John Palowitch, Sayali Kulkarni, Reut Tsarfaty, Jason Baldridge. 2259-2273 [doi]
- Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model PerformanceOmer Goldman, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, Reut Tsarfaty. 2274-2286 [doi]
- Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine TranslationJungseob Lee, Hyeonseok Moon, Seungjun Lee, Chanjun Park, Sugyeong Eo, Hyunwoong Ko, Jaehyung Seo, Seungyoon Lee, HeuiSeok Lim. 2287-2303 [doi]
- Multilingual Instruction Tuning With Just a Pinch of MultilingualityUri Shaham 0002, Jonathan Herzig, Roee Aharoni, Idan Szpektor, Reut Tsarfaty, Matan Eyal. 2304-2317 [doi]
- M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge DistillationJianlyu Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, Zheng Liu 0011. 2318-2335 [doi]
- Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler FeedbackZhangqian Bi, Yao Wan 0001, Zheng Wang, Hongyu Zhang 0002, Batu Guan, Fangxin Lu, Zili Zhang, Yulei Sui, Hai Jin 0001, Xuanhua Shi. 2336-2353 [doi]
- An Element is Worth a Thousand Words: Enhancing Legal Case Retrieval by Incorporating Legal ElementsChenlong Deng, Zhicheng Dou, Yujia Zhou 0002, Peitian Zhang, Kelong Mao. 2354-2365 [doi]
- SoMeLVLM: A Large Vision Language Model for Social Media ProcessingXinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen 0001, Jiebo Luo, Xuanjing Huang 0001, Zhongyu Wei. 2366-2389 [doi]
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language ModelsJaehyung Seo, Jaewook Lee, Chanjun Park, Seongtae Hong, Seungjun Lee, HeuiSeok Lim. 2390-2415 [doi]
- NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language ModelsAmit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurélie C. Lozano, Payel Das, Georgios Kollias. 2416-2430 [doi]
- Ranking Large Language Models without Ground TruthAmit Dhurandhar, Rahul Nair, Moninder Singh, Elizabeth Daly, Karthikeyan Natesan Ramamurthy. 2431-2452 [doi]
- Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process FeedbackChengfeng Dou, Ying Zhang, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhengwei Tao. 2453-2473 [doi]
- LM-Cocktail: Resilient Tuning of Language Models via Model MergingShitao Xiao, Zheng Liu, Peitian Zhang, Xingrun Xing. 2474-2488 [doi]
- Episodic Memory Retrieval from LLMs: A Neuromorphic Mechanism to Generate Commonsense Counterfactuals for Relation ExtractionXin Miao, Yongqi Li 0002, Shen Zhou, Tieyun Qian. 2489-2511 [doi]
- SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 LanguagesNedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine de Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava 0001, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Muhie Winata, Seid Yimam, Saif M. Mohammad. 2512-2530 [doi]
- Alirector: Alignment-Enhanced Chinese Grammatical Error CorrectorHaihui Yang, Xiaojun Quan. 2531-2546 [doi]
- VISPool: Enhancing Transformer Encoders with Vector Visibility Graph Neural NetworksTuna Alikasifoglu, Arda C. Aras, Aykut Koç. 2547-2556 [doi]
- The Emotion Dynamics of Literary NovelsKrishnapriya Vishnubhotla, Adam Hammond, Graeme Hirst, Saif Mohammad. 2557-2574 [doi]
- Accurate and Nuanced Open-QA Evaluation Through Textual EntailmentPeiran Yao, Denilson Barbosa 0001. 2575-2587 [doi]
- Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource LanguagesAntonios Dimakis, Stella Markantonatou, Antonios Anastasopoulos. 2588-2595 [doi]
- LANS: A Layout-Aware Neural Solver for Plane Geometry ProblemZhongzhi Li, Ming-Liang Zhang 0005, Fei Yin, Cheng-Lin Liu 0001. 2596-2608 [doi]
- Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language ModelsWenxuan Ding 0001, Shangbin Feng, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov. 2609-2636 [doi]
- DELL: Generating Reactions and Explanations for LLM-Based Misinformation DetectionHerun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang 0008, Yulia Tsvetkov, Minnan Luo. 2637-2667 [doi]
- The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual ContextsLingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi. 2668-2680 [doi]
- Self-Specialization: Uncovering Latent Expertise within Large Language ModelsJunmo Kang, Hongyin Luo, Yada Zhu, Jacob A. Hansen, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky. 2681-2706 [doi]
- FUSE: Measure-Theoretic Compact Fuzzy Set Representation for Taxonomy ExpansionFred Xu, Song Jiang 0002, Zijie Huang 0002, Xiao Luo 0001, Shichang Zhang, Yuanzhou Chen, Yizhou Sun. 2707-2720 [doi]
- Chain of Logic: Rule-Based Reasoning with Large Language ModelsSergio Servantez, Joe Barrow, Kristian J. Hammond, Rajiv Jain. 2721-2733 [doi]
- Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form GenerationsCheng-Han Chiang, Hung-yi Lee. 2734-2751 [doi]
- Can You Learn Semantics Through Next-Word Prediction? The Case of EntailmentWilliam Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen. 2752-2773 [doi]
- Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model SimulationsWeicheng Ma, Chunyuan Deng, Aram Moossavi, Lili Wang, Soroush Vosoughi, Diyi Yang. 2774-2788 [doi]
- Social Intelligence Data Infrastructure: Structuring the Present and Navigating the FutureMinzhi Li, Weiyan Shi, Caleb Ziems, Diyi Yang. 2789-2805 [doi]
- Selective Prefix Tuning for Pre-trained Language ModelsHongyi Zhang, Zuchao Li, Ping Wang, Hai Zhao 0001. 2806-2813 [doi]
- MODABS: Multi-Objective Learning for Dynamic Aspect-Based SummarizationXiaobo Guo, Soroush Vosoughi. 2814-2827 [doi]
- Non-compositional Expression Generation and its Continual LearningJianing Zhou, Suma Bhat. 2828-2839 [doi]
- Medical Dialogue System: A Survey of Categories, Methods, Evaluation and ChallengesXiaoming Shi, Zeming Liu, Li Du, Yuxuan Wang 0001, Hongru Wang 0003, Yuhang Guo 0001, Tong Ruan, Jie Xu, Xiaofan Zhang 0002, Shaoting Zhang 0001. 2840-2861 [doi]
- Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge GraphsThi Nguyen, Linhao Luo, Fatemeh Shiri, Dinh Phung 0001, Yuan-Fang Li, Thuy-Trang Vu, Gholamreza Haffari. 2862-2883 [doi]
- Comprehensive Abstractive Comment Summarization with Dynamic Clustering and Chain of ThoughtLongyin Zhang, Bowei Zou, Jacintha Yi, AiTi Aw. 2884-2896 [doi]
- Self-Supervised Position Debiasing for Large Language ModelsZHongkun Liu, Zheng Chen, Mengqi Zhang, Zhaochun Ren, Pengjie Ren, Zhumin Chen. 2897-2917 [doi]
- HyperCL: A Contrastive Learning Framework for Hyper-Relational Knowledge Graph Embedding with Hierarchical OntologyYuhuan Lu, Weijian Yu, Xin Jing, Dingqi Yang. 2918-2929 [doi]
- Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology DetectionSongtao Liu, Bang Wang, Wei Xiang 0005, Han Xu 0003, Minghua Xu 0001. 2930-2942 [doi]
- Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word StructureYang Hou, Zhenghua Li. 2943-2956 [doi]
- AlignRE: An Encoding and Semantic Alignment Approach for Zero-Shot Relation ExtractionZehan Li, Fu Zhang, Jingwei Cheng. 2957-2966 [doi]
- Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax ReductionTingchen Fu, Deng Cai 0002, Lemao Liu, Shuming Shi 0001, Rui Yan 0001. 2967-2985 [doi]
- Efficient Knowledge Infusion via KG-LLM AlignmentZhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, Shuhan Luo, Zhiqiang Zhang. 2986-2999 [doi]
- Towards Precise Localization of Critical Errors in Machine TranslationDahyun Jung, Sugyeong Eo, HeuiSeok Lim. 3000-3012 [doi]
- LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-TuningMingyang Zhang 0007, Hao Chen 0041, Chunhua Shen, Zhen Yang 0009, Linlin Ou, Xinyi Yu, Bohan Zhuang. 3013-3026 [doi]
- Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control MechanismJiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai. 3027-3043 [doi]
- Towards Better Utilization of Multi-Reference Training Data for Chinese Grammatical Error CorrectionYumeng Liu, Zhenghua Li, Haochen Jiang, Bo Zhang 0071, Chen Li 0001, Ji Zhang 0011. 3044-3052 [doi]
- AgentTuning: Enabling Generalized Agent Abilities for LLMsAohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu 0036, Yuxiao Dong, Jie Tang 0001. 3053-3077 [doi]
- Transition-based Opinion Generation for Aspect-based Sentiment AnalysisTianlai Ma, Zhongqing Wang, Guodong Zhou. 3078-3087 [doi]
- Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word ExclusionXiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Nguyen, Anh Tuan Luu. 3088-3105 [doi]
- A Chinese Dataset for Evaluating the Safeguards in Large Language ModelsYuxia Wang, Zenan Zhai, Haonan Li 0002, Xudong Han, Shom Lin, Zhenxuan Zhang, Angela Zhao, Preslav Nakov, Timothy Baldwin. 3106-3119 [doi]
- LLMFactor: Extracting Profitable Factors through Prompts for Explainable Stock Movement PredictionMeiyun Wang, Kiyoshi Izumi, Hiroki Sakaji. 3120-3131 [doi]
- You Only Look at Screens: Multimodal Chain-of-Action AgentsZhuosheng Zhang 0001, Aston Zhang. 3132-3149 [doi]
- SP³: Enhancing Structured Pruning via PCA ProjectionYuxuan Hu, Jing Zhang 0001, Zhe Zhao, Chen Zhao, Xiaodong Chen, Cuiping Li 0001, Hong Chen 0001. 3150-3170 [doi]
- GENDEX: Generative Data Augmentation Strategy Leveraging External Data for Abstractive Dialogue SummarizationSangwon Park, Hongseok Choi, Dongha Choi, Hyunju Lee. 3171-3185 [doi]
- Concept-Best-Matching: Evaluating Compositionality In Emergent CommunicationBoaz Carmeli, Yonatan Belinkov, Ron Meir. 3186-3194 [doi]
- A Tale of Two Revisions: Summarizing Changes Across Document VersionsT. Y. S. S. Santosh, Natwar Modani, Apoorv Saxena. 3195-3211 [doi]
- Refine, Align, and Aggregate: Multi-view Linguistic Features Enhancement for Aspect Sentiment Triplet ExtractionGuixin Su, Mingmin Wu, Zhongqiang Huang, Yongcheng Zhang, Tongguan Wang, Yuxue Hu, Ying Sha. 3212-3228 [doi]
- Pro-Woman, Anti-Man? Identifying Gender Bias in Stance DetectionYingjie Li, Yue Zhang 0004. 3229-3236 [doi]
- Likelihood-based Mitigation of Evaluation Bias in Large Language ModelsMasanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki. 3237-3245 [doi]
- The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language ModelsJiajia Li, Lu Yang, Mingni Tang, Chenchong Chenchong, Zuchao Li, Ping Wang, Hai Zhao 0001. 3246-3257 [doi]
- PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM InferenceDongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, Hai Zhao 0001. 3258-3270 [doi]
- From Role-Play to Drama-Interaction: An LLM SolutionWeiqi Wu, Hongqiu Wu, Lai Jiang, Xingyuan Liu, Hai Zhao 0001, Min Zhang 0005. 3271-3290 [doi]
- TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language ModelsJaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim. 3291-3325 [doi]
- Red Teaming Visual Language ModelsMukai Li, Lei Li 0039, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu 0049. 3326-3342 [doi]
- Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented ApproachJingyuan Yang 0008, Dapeng Chen, Yajing Sun, Rongjun Li, Zhiyong Feng, Wei Peng. 3343-3353 [doi]
- Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain EnvironmentsSangwoo Shin, Seunghyun Kim, Youngsoo Jang, Moontae Lee, Honguk Woo. 3354-3376 [doi]
- LIRE: listwise reward enhancement for preference alignmentMingye Zhu, Yi Liu, Lei Zhang 0119, Junbo Guo, Zhendong Mao. 3377-3394 [doi]
- See It All: Contextualized Late Aggregation for 3D Dense CaptioningMinjung Kim 0001, Hyung Lim, Seung Hwan Kim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim. 3395-3405 [doi]
- DARA: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge GraphsHaishuo Fang, Xiaodan Zhu, Iryna Gurevych. 3406-3432 [doi]
- GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM DeploymentYao Yao, Zuchao Li, Hai Zhao 0001. 3433-3446 [doi]
- Compositional Generalization with Grounded Language ModelsSondre Wold, Étienne Simon, Lucas Georges Gabriel Charpentier, Egor V. Kostylev, Erik Velldal, Lilja Øvrelid. 3447-3460 [doi]
- Rethinking Negative Instances for Generative Named Entity RecognitionYuyang Ding, Juntao Li, Pinzheng Wang, Zecheng Tang, Yan Bowen, Min Zhang. 3461-3475 [doi]
- WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge EditingChenhui Hu, Pengfei Cao, Yubo Chen 0001, Kang Liu 0001, Jun Zhao 0001. 3476-3503 [doi]
- DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable Causal InferenceJialong Wu 0007, Linhai Zhang, Deyu Zhou, Guoqiang Xu. 3504-3518 [doi]
- STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language ModelsLinhai Zhang, Jialong Wu 0007, Deyu Zhou, Guoqiang Xu. 3519-3532 [doi]
- How Much Does Nonverbal Communication Conform to Entropy Rate Constancy?: A Case Study on Listener Gaze in InteractionYu Wang, Yang Xu, Gabriel Skantze, Hendrik Buschmeier. 3533-3545 [doi]
- Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine TranslationXu Huang, Zhirui Zhang, Xiang Geng, Yichao Du, Jiajun Chen, Shujian Huang. 3546-3562 [doi]
- Chain-of-Verification Reduces Hallucination in Large Language ModelsShehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason Weston. 3563-3578 [doi]
- Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement MethodTian Xia, Zhiwei He 0002, Tong Ren, Yibo Miao, Zhuosheng Zhang 0001, Yang Yang 0030, Rui Wang 0015. 3579-3602 [doi]
- DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code RepositoriesJia Li, Ge Li 0001, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yuqi Zhu, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li, Bin Gu, Mengfei Yang. 3603-3614 [doi]
- LPNL: Scalable Link Prediction with Large Language ModelsBaolong Bi, Shenghua Liu, Yiwei Wang, Lingrui Mei, Xueqi Cheng. 3615-3625 [doi]
- Aligning Speech Segments Beyond Pure SemanticsKevin Heffernan, Artyom Kozhevnikov, Loïc Barrault, Alexandre Mourachko, Holger Schwenk. 3626-3635 [doi]
- Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data PerspectivesThong Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li 0004, Jay Zhangjie Wu, Cong-Duy Nguyen, See-Kiong Ng, Anh Tuan Luu. 3636-3657 [doi]
- Generative Input: Towards Next-Generation Input Methods ParadigmKeyu Ding, Yongcan Wang, Zihang Xu, Zhenzhen Jia, Enhong Chen. 3658-3669 [doi]
- A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy PotentialWei Tang 0015, Yixin Cao 0006, Jiahao Ying, Bo Wang, Yuyue Zhao, Yong Liao, Peng Zhou. 3670-3685 [doi]
- Functional Overlap Reranking for Neural Code GenerationHung To, Minh Nguyen, Nghi Bui. 3686-3704 [doi]
- Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM GamePengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, Peixin Cao, Nan Du, Xiaolong Li. 3705-3716 [doi]
- Pinpointing Diffusion Grid Noise to Enhance Aspect Sentiment Quad PredictionLinan Zhu, Xiangfan Chen, Xiaolei Guo, Chenwei Zhang, Zhechao Zhu, Zehai Zhou, Xiangjie Kong 0001. 3717-3726 [doi]
- Continual Contrastive Spoken Language UnderstandingUmberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj. 3727-3741 [doi]
- LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge GraphsKai Wang, Yuwei Xu, Zhiyong Wu, Siqiang Luo. 3742-3759 [doi]
- Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument StructuresJunjie Chen, Xiangheng He, Danushka Bollegala, Yusuke Miyao. 3760-3772 [doi]
- Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language ModelsYingji Li, Mengnan Du, Rui Song 0008, Xin Wang, Ying Wang. 3773-3786 [doi]
- Knowledge-Driven Cross-Document Relation ExtractionMonika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru. 3787-3797 [doi]
- Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought ReasoningWen Chang, Yun-Nung Chen. 3798-3812 [doi]
- KG-Adapter: Enabling Knowledge Graph Integration in Large Language Models through Parameter-Efficient Fine-TuningShiyu Tian, Yangyang Luo, Tianze Xu, Caixia Yuan, Huixing Jiang, Chen Wei, Xiaojie Wang 0006. 3813-3828 [doi]
- Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All ScenariosLei Lin, Jia-Yi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang 0006, Di Zhang, Kun Gai. 3829-3852 [doi]
- Evaluating LLMs' Mathematical Reasoning in Financial Document Question AnsweringPragya Srivastava, Manuj Malik, Vivek Gupta 0001, Tanuja Ganu, Dan Roth. 3853-3878 [doi]
- Improving In-Context Learning with Prediction Feedback for Sentiment AnalysisHongling Xu, Qianlong Wang, Yice Zhang, Min Yang 0007, Xi Zeng, Bing Qin 0001, Ruifeng Xu. 3879-3890 [doi]
- Can Large Language Models Mine Interpretable Financial Factors More Effectively? A Neural-Symbolic Factor Mining Agent ModelZhiwei Li 0006, Ran Song, Caihong Sun, Wei Xu 0008, Zhengtao Yu 0001, Ji-Rong Wen. 3891-3902 [doi]
- Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy ConstraintXiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu. 3903-3922 [doi]
- SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language ModelsLijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao 0001, Jing Shao. 3923-3954 [doi]
- Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text RepresentationPablo Messina, René Vidal, Denis Parra, Alvaro Soto, Vladimir Araujo. 3955-3986 [doi]
- GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural NetworkShuzhou Yuan, Ercong Nie, Michael Färber 0001, Helmut Schmid, Hinrich Schütze. 3987-4001 [doi]
- M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question AnsweringAnand Subramanian 0004, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler 0001. 4002-4042 [doi]
- MovieSum: An Abstractive Summarization Dataset for Movie ScreenplaysRohit Saxena, Frank Keller. 4043-4050 [doi]
- Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed RealityJiahuan Pei, Irene Viola 0001, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye 0001, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo César. 4051-4066 [doi]
- Perceptions of Language Technology Failures from South Asian English SpeakersFaye Holt, William Held, Diyi Yang. 4067-4081 [doi]
- A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning TaskJannik Brinkmann, Abhay Sheshadri, Victor Levoso, Paul Swoboda, Christian Bartelt. 4082-4102 [doi]
- Optimal Transport Guided Correlation Assignment for Multimodal Entity LinkingZefeng Zhang, Jiawei Sheng, Chuang Zhang, Liangyunzhi Liangyunzhi, Wenyuan Zhang 0002, Siqi Wang, Tingwen Liu. 4103-4117 [doi]
- On Efficiently Representing Regular Languages as RNNsAnej Svete, Robin Chan, Ryan Cotterell. 4118-4135 [doi]
- A Survey on Modelling Morality for Text AnalysisInes Reinig, Maria Becker, Ines Rehbein, Simone Paolo Ponzetto. 4136-4155 [doi]
- Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data SelectionRuibo Chen, Yihan Wu, Lichang Chen, Guodong Liu, Qi He, Tianyi Xiong, Chenxi Liu, Junfeng Guo, Heng Huang. 4156-4172 [doi]
- DebugBench: Evaluating Debugging Capability of Large Language ModelsRunchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Yinxu Pan, Yesai Wu, Haotian Hui, Weichuan Liu, Zhiyuan Liu 0001, Maosong Sun 0001. 4173-4198 [doi]
- POP-CEE: Position-oriented Prompt-tuning Model for Causal Emotion EntailmentZhihan Zhou 0003, Xue Gu, Yujie Zhao, Hao Xu 0012. 4199-4210 [doi]
- Context Length Extension via Generalized Extrapolation ScaleLinhan Li, Huaping Zhang. 4211-4218 [doi]
- Selectively Answering Visual QuestionsJulian Eisenschlos, Hernán Maina, Guido Ivetta, Luciana Benotti. 4219-4229 [doi]
- Wav2SQL: Direct Generalizable Speech-To-SQL ParsingHuadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao. 4230-4242 [doi]
- E2-LLM: Efficient and Extreme Length Extension of Large Language ModelsJiaheng Liu, ZhiqiBai ZhiqiBai, Yuanxing Zhang, Chenchen Zhang, YuangZh YuangZh, Ge Zhang, JiakaiWang JiakaiWang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng 0007. 4243-4253 [doi]
- Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender TypicalityDa Ju, Karen Ullrich, Adina Williams. 4254-4274 [doi]
- Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured EnvironmentsSitao Cheng, Ziyuan Zhuang, Yong Xu 0010, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, Qingwei Lin, Dongmei Zhang 0001, Saravan Rajmohan, Qi Zhang. 4275-4295 [doi]
- Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian CourtsShubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya 0001. 4296-4315 [doi]
- RulE: Knowledge Graph Reasoning with Rule EmbeddingXiaojuan Tang, Song Chun Zhu, Yitao Liang, Muhan Zhang. 4316-4335 [doi]
- Multi-Objective Linguistic Control of Large Language ModelsDang Nguyen, Jiuhai Chen, Tianyi Zhou 0001. 4336-4347 [doi]
- Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMsShang Zhou, Feng Yao, Chengyu Dong, Zihan Wang 0001, Jingbo Shang. 4348-4362 [doi]
- Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex ScenariosShijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu 0007, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang 0002, Ruifeng Xu, Qun Liu 0001. 4363-4400 [doi]
- Do Androids Know They're Only Dreaming of Electric Sheep?Sky CH-Wang, Benjamin Van Durme, Jason Eisner, Chris Kedzie. 4401-4420 [doi]
- URG: A Unified Ranking and Generation Method for Ensembling Language ModelsBo Lv, Chen Tang, Yanan Zhang, Xin Liu, Ping Luo, Yue Yu. 4421-4434 [doi]
- Multi-Modal Retrieval For Large Language Model Based Speech RecognitionAditya Gourav, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Grant P. Strimel, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko. 4435-4446 [doi]
- LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the WildZiyu Zhao, Leilei Gan, Guoyin Wang 0002, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu 0001. 4447-4462 [doi]
- ELAD: Explanation-Guided Large Language Models Active DistillationYifei Zhang 0006, Bo Pan, Chen Ling 0003, Yuntong Hu, Liang Zhao 0002. 4463-4475 [doi]
- Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQCarolin Holtermann, Paul Röttger, Timm Dill, Anne Lauscher. 4476-4494 [doi]
- Semantics or spelling? Probing contextual word embeddings with orthographic noiseJacob Matthews, John Starr, Marten Van Schijndel. 4495-4504 [doi]
- The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)Shenglai Zeng, Jiankun Zhang, Pengfei He, Yiding Liu, Yue Xing, Han Xu 0002, Jie Ren 0019, Yi Chang 0001, Shuaiqiang Wang, Dawei Yin, Jiliang Tang. 4505-4524 [doi]
- EmpathicStories++: A Multimodal Dataset for Empathy Towards Personal ExperiencesJocelyn Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Park. 4525-4536 [doi]
- MRL Parsing Without Tears: The Case of HebrewShaltiel Shmidman, Avi Shmidman, Moshe Koppel, Reut Tsarfaty. 4537-4550 [doi]
- SyntaxShap: Syntax-aware Explainability Method for Text GenerationKenza Amara, Rita Sevastjanova, Mennatallah El-Assady. 4551-4566 [doi]
- Automated Detection and Analysis of Data Practices Using A Real-World CorpusMukund Srinath, Pranav Narayanan Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson. 4567-4574 [doi]
- Enhancing Hyperbolic Knowledge Graph Embeddings via Lorentz TransformationsXiran Fan, Minghua Xu 0003, Huiyuan Chen, Yuzhong Chen, Mahashweta Das, Hao Yang 0007. 4575-4589 [doi]
- Tell Me What's Next: Textual Foresight for Generic UI RepresentationsAndrea Burns, Kate Saenko, Bryan A. Plummer. 4590-4611 [doi]
- Probing the Uniquely Identifiable Linguistic Patterns of Conversational AI AgentsIqra Zahid, Tharindu Madusanka, Riza Batista-Navarro, Youcheng Sun. 4612-4628 [doi]
- The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model PerformanceAbel Salinas, Fred Morstatter. 4629-4651 [doi]
- X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in ClassificationHanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin 0001. 4652-4665 [doi]
- SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text ClassificationDifan Jiao, Yilun Liu 0002, Zhenwei Tang, Daniel Matter, Jürgen Pfeffer, Ashton Anderson. 4666-4682 [doi]
- Decomposing Co-occurrence Matrices into Interpretable Components as Formal ConceptsAkihiro Maeda, Takuma Torii, Shohei Hidaka. 4683-4700 [doi]
- Two-Pronged Human Evaluation of ChatGPT Self-Correction in Radiology Report SimplificationZiyu Yang, Santhosh Cherian, Slobodan Vucetic. 4701-4714 [doi]
- Planning First, Question Second: An LLM-Guided Method for Controllable Question GenerationKunze Li, Yu Zhang. 4715-4729 [doi]
- RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-FeedbackYanming Liu, Xinyue Peng, Xuhong Zhang 0002, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du. 4730-4749 [doi]
- MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking ModelDanupat Khamnuansin, Tawunrat Chalothorn, Ekapol Chuangsuwanich. 4750-4762 [doi]
- Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question AnsweringYixing Peng, Quan Wang 0002, Licheng Zhang, Yi Liu, Zhendong Mao. 4763-4776 [doi]
- Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment AnalysisGuangmin Zheng, Jin Wang, Liang-Chih Yu, Xuejie Zhang. 4777-4788 [doi]
- Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement SimulationXinyi Mou, Zhongyu Wei, Xuanjing Huang. 4789-4809 [doi]
- Incorporating Syntax and Lexical Knowledge to Multilingual Sentiment Classification on Large Language ModelsHiroshi Kanayama, Yang Zhao, Ran Iwamoto, Takuya Ohko. 4810-4817 [doi]
- Locating and Extracting Relational Concepts in Large Language ModelsZijian Wang, Britney White, Chang Xu. 4818-4832 [doi]
- Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language ModelsMingda Li, Xinyu Li, Yifan Chen, Wenfeng Xuan, Weinan Zhang 0003. 4833-4850 [doi]
- SenticVec: Toward Robust and Human-Centric Neurosymbolic Sentiment AnalysisXulang Zhang, Rui Mao 0010, Erik Cambria. 4851-4863 [doi]
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language ModelsChen Qian 0006, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao 0001, Yong Liu 0007, Jing Shao. 4864-4888 [doi]
- Language Models can Evaluate Themselves via Probability DiscrepancyTingyu Xia, Bowen Yu 0002, Yuan Wu, Yi Chang, Chang Zhou. 4889-4901 [doi]
- Evaluating the Validity of Word-level Adversarial Attacks with Large Language ModelsHuichi Zhou, Zhaoyang Wang, Hongtao Wang 0002, Dongping Chen, Wenhan Mu, Fangyuan Zhang. 4902-4922 [doi]
- On the Language Encoder of Contrastive Cross-modal ModelsMengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya 0001, Hiromi Wakaki, Yuki Mitsufuji. 4923-4940 [doi]
- Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks WorldGuande Wu, Chen Zhao, Cláudio T. Silva, He He 0001. 4941-4957 [doi]
- Anchor-based Large Language ModelsJianhui Pang, Fanghua Ye 0001, Derek F. Wong, Xin He, Wanshun Chen, Longyue Wang. 4958-4976 [doi]
- MLeVLM: Improve Multi-level Progressive Capabilities based on Multimodal Large Language Model for Medical Visual Question AnsweringDexuan Xu, Yanyuan Chen, Jieyi Wang, Yue Huang, Hanpin Wang, Zhi Jin, Hongxing Wang, Weihua Yue, Jing He, Hang Li, Yu Huang. 4977-4997 [doi]
- Disentangling Length from Quality in Direct Preference OptimizationRyan Park, Rafael Rafailov, Stefano Ermon, Chelsea Finn. 4998-5017 [doi]
- MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge EditingJiaqi Li, Miaozeng Du, Chuanyi Zhang, Yongrui Chen 0002, Nan Hu, Guilin Qi, Haiyun Jiang, Siyuan Cheng 0008, Bozhong Tian. 5018-5029 [doi]
- Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise: A Case Study on Chinese Legal DomainZhen Wan, Yating Zhang, Yexiang Wang, Fei Cheng, Sadao Kurohashi. 5030-5041 [doi]
- MemeMQA: Multimodal Question Answering for Memes via Rationale-Based InferencingSiddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty 0002. 5042-5078 [doi]
- Improving Attributed Text Generation of Large Language Models via Preference LearningDongfang Li, Zetian Sun, Baotian Hu, Zhenyu Liu, Xinshuo Hu, Xuebo Liu 0002, Min Zhang 0005. 5079-5101 [doi]
- KOMBO: Korean Character Representations Based on the Combination Rules of SubcharactersSungho Kim, Juhyeong Park, Yeachan Kim, SangKeun Lee 0001. 5102-5119 [doi]
- Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic SupervisionRyo Yoshida, Taiga Someya, Yohei Oseki. 5120-5134 [doi]
- Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit CluesZhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu. 5135-5147 [doi]
- Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical NotesSunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, Yoon Bin Jung, Yohan Jo, Edward Choi. 5148-5168 [doi]
- Extending Context Window of Large Language Models via Semantic CompressionWeizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai 0001, Lei Deng, Wei Han 0004. 5169-5181 [doi]
- Plausible Extractive Rationalization through Semi-Supervised Entailment SignalWei Jie Yeo, Ranjan Satapathy, Erik Cambria. 5182-5192 [doi]
- Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question AnsweringChaeHun Park, Koanho Lee, Hyesu Lim, Jaeseok Kim, JunMo Park, Yu-Jung Heo, Du-Seong Chang, Jaegul Choo. 5193-5221 [doi]
- Scented-EAE: Stage-Customized Entity Type Embedding for Event Argument ExtractionYu Yang, Jinyu Guo, Kai Shuang, Chenrui Mao. 5222-5235 [doi]
- Fast Randomized Low-Rank Adaptation of Pre-trained Language Models with PAC RegularizationZijian Lei, Dong Qian, William Cheung. 5236-5249 [doi]
- SDA: Semantic Discrepancy Alignment for Text-conditioned Image RetrievalYuchen Yang, Yu Wang 0027, Yanfeng Wang. 5250-5261 [doi]
- Se²: Sequential Example Selection for In-Context LearningHaoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun 0015, Weiwei Deng, Furu Wei, Qi Zhang 0066. 5262-5284 [doi]
- Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct DecodingHanling Yi, Feng Lin 0009, Hongbin Li, Peiyang Ning, Xiaotian Yu, Rong Xiao. 5285-5299 [doi]
- StructEval: Deepen and Broaden Large Language Model Assessment via Structured EvaluationBoxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han, Feng Zhang, Junfeng Zhan, Le Sun 0001. 5300-5318 [doi]
- Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation PatchingXinwei Wu, Weilong Dong, Shaoyang Xu, Deyi Xiong. 5319-5332 [doi]
- Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information DecompositionLaura Mascarell, Yan L'Homme, Majed El Helou. 5333-5338 [doi]
- BadActs: A Universal Backdoor Defense in the Activation SpaceBiao Yi, Sishuo Chen, Yiming Li, Tong Li 0011, Baolei Zhang, Zheli Liu. 5339-5352 [doi]
- ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text PretrainingZhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua. 5353-5377 [doi]
- Multi-modal Concept Alignment Pre-training for Generative Medical Visual Question AnsweringQuan Yan, Junwen Duan, Jianxin Wang 0001. 5378-5389 [doi]
- Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit TechniquesSiva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Pattisapu Priyatam, Anish Bhanushali, Prasanna Srinivasa Murthy. 5390-5404 [doi]
- Evaluating Large Language Models on Wikipedia-Style Survey GenerationFan Gao, Hang Jiang, Rui Yang, Qingcheng Zeng, Jinghui Lu, Moritz Blum, Tianwei She, Yuang Jiang, Irene Li. 5405-5418 [doi]
- The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models CollapseWanli Yang, Fei Sun 0001, Xinyu Ma, Xun Liu, Dawei Yin, Xueqi Cheng. 5419-5437 [doi]
- Can We Continually Edit Language Models? On the Knowledge Attenuation in Sequential Model EditingQi Li, Xiaowen Chu. 5438-5455 [doi]
- Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL GenerationGe Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma 0001, Reynold Cheng. 5456-5471 [doi]
- Translatotron-V(ison): An End-to-End Model for In-Image Machine TranslationZhibin Lan, Liqiang Niu, Fandong Meng, Jie Zhou 0016, Min Zhang 0005, Jinsong Su. 5472-5485 [doi]
- StatBot.Swiss: Bilingual Open Data Exploration in Natural LanguageFarhad Nooralahzadeh, Yi Zhang, Ellery Smith, Sabine Maennel, Cyril Matthey-Doret, Raphaël de Fondeville, Kurt Stockinger. 5486-5507 [doi]
- Subtle Signatures, Strong Shields: Advancing Robust and Imperceptible Watermarking in Large Language ModelsYubing Ren, Ping Guo, Yanan Cao, Wei Ma. 5508-5519 [doi]
- Thinking about how to extract: Energizing LLMs' emergence capabilities for document-level event argument extractionKai Shuang, Zhouji Zhouji, Qiwei Wang, Jinyu Guo. 5520-5532 [doi]
- Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative LearningShuzheng Si, Helan Hu, Haozhe Zhao, Shuang Zeng, Kaikai An, Zefan Cai, Baobao Chang. 5533-5546 [doi]
- Predicting Narratives of Climate Obstruction in Social Media AdvertisingHarri Rowlands, Gaku Morio, Dylan Tanner, Christopher D. Manning. 5547-5558 [doi]
- SSS: Editing Factual Knowledge in Language Models towards Semantic Sparse SpaceHuazheng Wang, Haifeng Sun 0001, Jingyu Wang 0001, Qi Qi 0001, Zixuan Xia, Menghao Zhang, Jianxin Liao. 5559-5570 [doi]
- GeoHard: Towards Measuring Class-wise Hardness through Modelling Class SemanticsFengyu Cai, Xinran Zhao, Hongming Zhang 0009, Iryna Gurevych, Heinz Koeppl. 5571-5597 [doi]
- Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language ModelsSheng-Lun Wei, Cheng-Kuang Wu, Hen-Hsen Huang, Hsin-Hsi Chen. 5598-5621 [doi]
- ArabicMMLU: Assessing Massive Multitask Language Understanding in ArabicFajri Koto, Haonan Li 0002, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin. 5622-5640 [doi]
- On the Relationship Between RNN Hidden-State Vectors and Semantic StructuresEdi Muskardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock. 5641-5658 [doi]
- XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label ClassificationYanjiang Liu, Tianyun Zhong, Yaojie Lu 0001, Hongyu Lin, Ben He, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun 0001. 5659-5672 [doi]
- Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation DatasetJie Zhu, Junhui Li, Yalong Wen, Lifan Guo. 5673-5693 [doi]
- Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing ConstraintZhipeng Chen, Kun Zhou 0002, Xin Zhao 0018, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen. 5694-5711 [doi]
- Definition generation for lexical semantic change detectionMariia Fedorova, Andrey Kutuzov, Yves Scherrer. 5712-5724 [doi]
- MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot DetectorMarta R. Costa-Jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alexandre Mourachko, Christophe Ropers, Carleigh Wood. 5725-5734 [doi]
- Phased Instruction Fine-Tuning for Large Language ModelsWei Pang, Chuan Zhou 0013, Xiao-Hua Zhou, Xiaojie Wang. 5735-5748 [doi]
- TOREE: Evaluating Topic Relevance of Student Essays for Chinese Primary and Middle School EducationXinlin Zhuang, Hongyi Wu, Xinshu Shen, Peimin Yu, Gaowei Yi, Xinhao Chen, Tu Hu, Yang Chen, Yupei Ren, Yadong Zhang, Youqi Song, Binxuan Liu, Man Lan. 5749-5765 [doi]
- Predicting the Unpredictable: Uncertainty-Aware Reasoning over Temporal Knowledge Graphs via Diffusion ProcessYuxiang Cai, Qiao Liu 0003, Yanglei Gan, Changlin Li, Xueyi Liu, Run Lin, Da Luo, JiayeYang JiayeYang. 5766-5778 [doi]
- Asymmetric Bias in Text-to-Image Generation with Adversarial AttacksHaz Sameen Shahgir, Xianghao Kong, Greg Ver Steeg, Yue Dong. 5779-5796 [doi]
- Controlled Text Generation for Large Language Model with Dynamic Attribute GraphsXun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu Li, Feiyu Xiong, Bo Tang. 5797-5814 [doi]
- Coconut: Contextualized Commonsense Unified Transformers for Graph-Based Commonsense Augmentation of Language ModelsJun-Hyung Park, Mingyu Lee, Junho Kim, SangKeun Lee 0001. 5815-5830 [doi]
- Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledgeDaniel Mela, Aitor Gonzalez-Agirre, Javier Hernando, Marta Villegas. 5831-5847 [doi]
- BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical DomainsYanis Labrak, Adrien Bazoge, Emmanuel Morin, Pierre-Antoine Gourraud, Mickael Rouvier, Richard Dufour. 5848-5864 [doi]
- All Languages Matter: On the Multilingual Safety of LLMsWenxuan Wang 0001, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang 0001, Wenxiang Jiao, Michael R. Lyu. 5865-5877 [doi]
- LJPCheck: Functional Tests for Legal Judgment PredictionYuan Zhang, Wanhong Huang 0003, Yi Feng 0005, Chuanyi Li, Zhiwei Fei, JiDong Ge, Bin Luo 0003, Vincent Ng 0001. 5878-5894 [doi]
- CMDL: A Large-Scale Chinese Multi-Defendant Legal Judgment Prediction DatasetWanhong Huang 0003, Yi Feng 0005, Chuanyi Li, Honghan Wu, JiDong Ge, Vincent Ng 0001. 5895-5906 [doi]
- Model Editing by Standard Fine-TuningGovind Krishnan Gangadhar, Karl Stratos. 5907-5913 [doi]
- Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical ReasoningQiming Bao 0001, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gaël Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny 0001, Michael Witbrock, Jiamou Liu. 5914-5934 [doi]
- CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack OverflowNathanaël Beau, Benoît Crabbé. 5935-5947 [doi]
- ViHateT5: Enhancing Hate Speech Detection in Vietnamese With a Unified Text-to-Text Transformer ModelLuan Thanh Nguyen. 5948-5961 [doi]
- Bias in News Summarization: Measures, Pitfalls and CorporaJulius Steen, Katja Markert. 5962-5983 [doi]
- When to Trust LLMs: Aligning Confidence with Response QualityShuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun 0001, Jinyang Gao, Huawei Shen, Bolin Ding. 5984-5996 [doi]
- Zero-shot Cross-lingual Alignment for Embedding InitializationXi Ai, Zhiyong Huang. 5997-6007 [doi]
- Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD)Avshalom Manevich, Reut Tsarfaty. 6008-6022 [doi]
- It takes two to borrow: a donor and a recipient. Who's who?Liviu P. Dinu, Ana Sabina Uban, Anca Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Laurentiu Zoicas. 6023-6035 [doi]
- Advancing Post-OCR Correction: A Comparative Study of Synthetic DataShuhao Guan, Derek Greene. 6036-6047 [doi]
- GeoAgent: To Empower LLMs using Geospatial Tools for Address StandardizationChenghua Huang, Shisong Chen, Zhixu Li, Jianfeng Qu, Yanghua Xiao, Jiaxin Liu, Zhigang Chen 0003. 6048-6063 [doi]
- HQP: A Human-Annotated Dataset for Detecting Online PropagandaAbdurahman Maarouf, Dominik Bär, Dominique Geissler, Stefan Feuerriegel. 6064-6089 [doi]
- Teaching Language Models to Self-Improve by Learning from Language FeedbackChi Hu, Yimin Hu, Hang Cao, Tong Xiao, Jingbo Zhu. 6090-6101 [doi]
- Exploring Spatial Schema Intuitions in Large Language and Vision ModelsPhilipp Wicke, Lennart Wachowiak. 6102-6117 [doi]
- Efficient Detection of LLM-generated Texts with a Bayesian Surrogate ModelYibo Miao, Hongcheng Gao, Hao Zhang, Zhijie Deng. 6118-6130 [doi]
- Decoding the Narratives: Analyzing Personal Drug Experiences Shared on RedditLayla Bouzoubaa, Elham Aghakhani, Max Song, Quang Trinh, Rezvaneh (Shadi) Rezapour. 6131-6148 [doi]
- Unveiling the Art of Heading Design: A Harmonious Blend of Summarization, Neology, and AlgorithmShaobo Cui 0006, Yiyang Feng, Yisong Mao, Yifan Hou, Boi Faltings. 6149-6174 [doi]
- Understanding Fine-grained Distortions in Reports of Scientific FindingsAmelie Wührl, Dustin Wright 0001, Roman Klinger, Isabelle Augenstein. 6175-6191 [doi]
- MM-SOC: Benchmarking Multimodal Large Language Models in Social Media PlatformsYiqiao Jin, Minje Choi, Gaurav Verma, Jindong Wang 0001, Srijan Kumar. 6192-6210 [doi]
- Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot PerformanceSaurabh Srivastava, Chengyue Huang, Weiguo Fan, Ziyu Yao. 6211-6232 [doi]
- Benchmarking Retrieval-Augmented Generation for MedicineGuangzhi Xiong, Qiao Jin 0001, Zhiyong Lu, Aidong Zhang. 6233-6251 [doi]
- ChatMusician: Understanding and Generating Music Intrinsically with LLMRuibin Yuan, Hanfeng Lin, Yi Wang 0033, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Liumeng Xue, Ziyang Ma, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Jie Fu, Emmanouil Benetos, Gus Xia, Roger B. Dannenberg, Wei Xue, Shiyin Kang, Yike Guo. 6252-6271 [doi]
- Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction TuningQingyu Tan, Hwee Tou Ng, Lidong Bing. 6272-6286 [doi]
- Mind Your Format: Towards Consistent Evaluation of In-Context Learning ImprovementsAnton Voronov, Lena Wolf, Max Ryabinin. 6287-6310 [doi]
- Knowledge Graph-Enhanced Large Language Models via Path SelectionHaochen Liu, Song Wang, Yaochen Zhu, Yushun Dong, Jundong Li. 6311-6321 [doi]
- OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors DetectionChenyang Huang 0001, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh, Osmar Zaïane, Boxing Chen. 6322-6334 [doi]
- ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language ModelXuanqing Yu, Wangtao Sun, Jingwei Li, Kang Liu, Chengbao Liu, Jie Tan. 6335-6350 [doi]
- Speech-based Slot Filling using Large Language ModelsGuangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang 0031, Milica Gasic, Philip C. Woodland. 6351-6362 [doi]
- Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic AnomaliesChangye Li, Zhecheng Sheng, Trevor Cohen, Serguei Pakhomov. 6363-6377 [doi]
- HeSum: a Novel Dataset for Abstractive Text Summarization in HebrewTzuf Paz-Argaman, Itai Mondshine, Asaf Achi Mordechai, Reut Tsarfaty. 6378-6388 [doi]
- TRAM: Benchmarking Temporal Reasoning for Large Language ModelsYuqing Wang, Yun Zhao 0001. 6389-6415 [doi]
- Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language ModelsAlfonso Amayuelas, Kyle Wong, Liangming Pan, Wenhu Chen, William Yang Wang. 6416-6432 [doi]
- Exploring Defeasibility in Causal ReasoningShaobo Cui 0006, Lazar Milikic, Yiyang Feng, Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Boi Faltings. 6433-6452 [doi]
- Better Synthetic Data by Retrieving and Transforming Existing DatasetsSaumya Gandhi, Ritu Gala, Vijay Viswanathan 0002, Tongshuang Wu, Graham Neubig. 6453-6466 [doi]
- Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language ModelsYanzheng Xiang, Hanqi Yan, Lin Gui 0003, Yulan He 0001. 6467-6481 [doi]
- Perspective Taking through Generating Responses to Conflict SituationsJoan Plepi, Charles Welch, Lucie Flek. 6482-6497 [doi]
- LLM2LLM: Boosting LLMs with Novel Iterative Data EnhancementNicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami. 6498-6526 [doi]
- The Power of Summary-Source AlignmentsOri Ernst, Ori Shapira, Aviv Slobodkin, Sharon Adar, Mohit Bansal, Jacob Goldberger, Ran Levy 0001, Ido Dagan. 6527-6548 [doi]
- An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language ModelsGantavya Bhatt, Yifang Chen, Arnav Mohanty Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeff A. Bilmes, Simon S. Du, Kevin G. Jamieson, Jordan T. Ash, Robert D. Nowak. 6549-6560 [doi]
- Learning Multimodal Contrast with Cross-modal Memory and Reinforced Contrast RecognitionYuanhe Tian, Fei Xia, Yan Song. 6561-6573 [doi]
- Text Simplification via Adaptive TeachingSeyed Ali Bahrainian, Jonathan Dou, Carsten Eickhoff. 6574-6584 [doi]
- A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical textsGokcen Gokceoglu, Devrim Cavusoglu, Emre Akbas, Özen Nergis Dolcerocca. 6585-6596 [doi]
- It is Simple Sometimes: A Study On Improving Aspect-Based Sentiment Analysis PerformanceLaura Cabello, Uchenna Akujuobi. 6597-6610 [doi]
- Whose Emotions and Moral Sentiments do Language Models Reflect?Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman. 6611-6631 [doi]
- LLM can Achieve Self-Regulation via Hyperparameter Aware GenerationSiyin Wang, Shimin Li, Tianxiang Sun, JinLan Fu, Qinyuan Cheng, Jiasheng Ye, Junjie Ye, Xipeng Qiu, Xuanjing Huang 0001. 6632-6646 [doi]
- Forward-Backward Reasoning in Large Language Models for Mathematical VerificationWeisen Jiang, Han Shi, Longhui Yu, Zhengying Liu, Yu Zhang 0006, Zhenguo Li, James T. Kwok. 6647-6661 [doi]
- Towards Uncertainty-Aware Language AgentJiuzhou Han, Wray L. Buntine, Ehsan Shareghi. 6662-6685 [doi]
- Detection and Positive Reconstruction of Cognitive Distortion Sentences: Mandarin Dataset and EvaluationShuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang Ni. 6686-6701 [doi]
- PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMsJiuzhou Han, Nigel Collier, Wray L. Buntine, Ehsan Shareghi. 6702-6718 [doi]
- Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language ModelsYifu Gao, Linbo Qiao, Zhigang Kan, Zhihua Wen, Yongquan He, Dongsheng Li 0001. 6719-6734 [doi]
- VISREAS: Complex Visual Reasoning with Unanswerable QuestionsSyeda Nahida Akter, Sangwu Lee, Yingshan Chang, Yonatan Bisk, Eric Nyberg. 6735-6752 [doi]
- A Unified Generative Framework for Bilingual Euphemism Detection and IdentificationYuxue Hu, Junsong Li, Tongguan Wang, Dongyu Su, Guixin Su, Ying Sha. 6753-6766 [doi]
- StyleDubber: Towards Multi-Scale Style Learning for Movie DubbingGaoxiang Cong, Yuankai Qi, Liang Li 0003, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang 0001, Chenggang Yan 0001, Qingming Huang. 6767-6779 [doi]
- ETAS: Zero-Shot Transformer Architecture Search via Network Trainability and ExpressivityJiechao Yang, Yong Liu. 6780-6795 [doi]
- Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process AlignmentKaishuai Xu, Yi Cheng, Wenjun Hou, Qiaoyu Tan, Wenjie Li. 6796-6814 [doi]
- ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language ModelsYanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, ZhiqiBai ZhiqiBai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng 0007. 6815-6839 [doi]
- REInstruct: Building Instruction Data from Unlabeled CorpusShu Chen, Xinyan Guan, Yaojie Lu 0001, Hongyu Lin, Xianpei Han, Le Sun 0001. 6840-6856 [doi]
- Learning to Maximize Mutual Information for Chain-of-Thought DistillationXin Chen, Hanxian Huang, Yanjun Gao, Yi Wang 0031, Jishen Zhao, Ke Ding. 6857-6868 [doi]
- PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer LearningZhisheng Lin, Han Fu, Chenghao Liu, Zhuo Li, Jianling Sun. 6869-6883 [doi]
- MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkHongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen 0026. 6884-6915 [doi]
- Identifying Semantic Induction Heads to Understand In-Context LearningJie Ren 0018, Qipeng Guo, Hang Yan 0001, Dongrui Liu, Quanshi Zhang, Xipeng Qiu, Dahua Lin. 6916-6932 [doi]
- Chinese Spelling Corrector Is Just a Language LearnerLai Jiang, Hongqiu Wu, Hai Zhao 0001, Min Zhang 0005. 6933-6943 [doi]
- Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language ModelsJunfei Wu, Qiang Liu 0006, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang 0001, Tieniu Tan. 6944-6962 [doi]
- RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question AnsweringZihan Zhang, Meng Fang, Ling Chen. 6963-6975 [doi]
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language ModelsXi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura. 6976-6987 [doi]
- Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data AugmentationMing Gu, Yan Yang 0008. 6988-7005 [doi]
- DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingShanghaoran Quan. 7006-7028 [doi]
- LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data AugmentationIkuya Yamada, Ryokan Ri. 7029-7039 [doi]
- Comments as Natural Logic Pivots: Improve Code Generation via Comment PerspectiveYijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen 0005, Jinan Xu, Jie Zhou 0016. 7040-7051 [doi]
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents IntegrationSunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu 0001, Ji-Rong Wen. 7052-7074 [doi]
- Continual Dialogue State Tracking via Reason-of-Select DistillationYujie Feng, Bo Liu 0049, Xiaoyu Dong, Zexin Lu, Li-Ming Zhan, Xiao-Ming Wu 0003, Albert Y. S. Lam. 7075-7087 [doi]
- Spotting AI's Touch: Identifying LLM-Paraphrased Spans in TextYafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi 0001, Yue Zhang 0004. 7088-7107 [doi]
- SoFA: Shielded On-the-fly Alignment via Priority Rule FollowingXinyu Lu, Bowen Yu 0002, Yaojie Lu 0001, Hongyu Lin, Haiyang Yu, Le Sun 0001, Xianpei Han, Yongbin Li. 7108-7136 [doi]
- Do Zombies Understand? A Choose-Your-Own-Adventure Exploration of Machine CognitionAriel Goldstein, Gabriel Stanovsky. 7137-7143 [doi]
- Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised LearningLukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn W. Schuller. 7144-7159 [doi]
- RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated AdapterMeng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang 0001, Ruyang Liu, Long Chen 0016, Xiaodan Liang, Li Yuan, Ge Li 0002. 7160-7174 [doi]
- Benchmarking and Improving Long-Text Translation with Large Language ModelsLongyue Wang, Zefeng Du, Wenxiang Jiao, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F. Wong, Shuming Shi 0001, Zhaopeng Tu. 7175-7187 [doi]
- Personalized Topic Selection Model for Topic-Grounded DialogueShixuan Fan, Wei Wei 0002, Xiaofei Wen, Xian-Ling Mao, Jixiong Chen, Dangyang Chen. 7188-7202 [doi]
- Debiasing In-Context Learning by Instructing LLMs How to Follow DemonstrationsLvxue Li, Jiaqi Chen, Xinyu Lu, Yaojie Lu 0001, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun 0001. 7203-7215 [doi]
- Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog SystemsChristos Vlachos, Themos Stafylakis, Ion Androutsopoulos. 7216-7240 [doi]
- MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language ProductionJian Ma, Wenguan Wang, Yi Yang 0001, Feng Zheng. 7241-7254 [doi]
- BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language ModelsXueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong. 7255-7279 [doi]
- PartialFormer: Modeling Part Instead of Whole for Machine TranslationTong Zheng, Bei Li, Huiwen Bao, Jiale Wang, Weiqiao Shan, Tong Xiao, Jingbo Zhu. 7280-7294 [doi]
- Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign StrategyJieyong Kim, Ryang Heo, Yongsik Seo, SeongKu Kang, Jinyoung Yeo, Dongha Lee. 7295-7303 [doi]
- PACE: Improving Prompt with Actor-Critic Editing for Large Language ModelYihong Dong, Kangcheng Luo, Xue Jiang, Zhi Jin, Ge Li. 7304-7323 [doi]
- Penetrative AI: Making LLMs Comprehend the Physical WorldHuatao Xu, Liying Han, Qirui Yang, Mo Li 0001, Mani B. Srivastava. 7324-7341 [doi]
- The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional AnalysisMiaoran Zhang, Vagrant Gautam, Mingyang Wang, Jesujoba Alabi, Xiaoyu Shen 0001, Dietrich Klakow, Marius Mosbach. 7342-7371 [doi]
- Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell CheckingMing Dong, Yujing Chen, Miao Zhang, Hao Sun, Tingting He. 7372-7383 [doi]
- An Empirical Study of In-context Learning in LLMs for Machine TranslationPranjal A. Chitale, Jay P. Gala, Raj Dabre. 7384-7406 [doi]
- "My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language ModelsXinpeng Wang 0003, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank. 7407-7416 [doi]
- ODA: Observation-Driven Agent for integrating LLMs and Knowledge GraphsLei Sun, Zhengwei Tao, Youdi Li, Hiroshi Arakawa. 7417-7431 [doi]
- A Comprehensive Study of Jailbreak Attack versus Defense for Large Language ModelsZihao Xu, Yi Liu, Gelei Deng, Yuekang Li, Stjepan Picek. 7432-7449 [doi]
- A Data-Driven Guided Decoding Mechanism for Diagnostic CaptioningPanagiotis Kaliosis, John Pavlopoulos, Foivos Charalampakos, Georgios Moschovis, Ion Androutsopoulos. 7450-7466 [doi]
- Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language ModelHengyuan Zhang, Yanru Wu, Dawei Li, Sak Yang, Rui Zhao, Yong Jiang, Fei Tan. 7467-7509 [doi]
- A Two-Agent Game for Zero-shot Relation Triplet ExtractionTing Xu 0003, Haiqin Yang, Fei Zhao, Zhen Wu, Xinyu Dai. 7510-7527 [doi]
- Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early PruningNaibin Gu, Peng Fu 0008, Xiyu Liu 0003, Bowen Shen, Zheng Lin 0001, Weiping Wang 0005. 7528-7541 [doi]
- Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into GermanManuel Lardelli, Giuseppe Attanasio, Anne Lauscher. 7542-7550 [doi]
- Prompt Chaining or Stepwise Prompt? Refinement in Text SummarizationShichao Sun, Ruifeng Yuan, Ziqiang Cao, Wenjie Li 0002, Pengfei Liu 0003. 7551-7558 [doi]
- Trust in Internal or External Knowledge? Generative Multi-Modal Entity Linking with Knowledge RetrieverXinwei Long, Jiali Zeng, Fandong Meng, Jie Zhou, Bowen Zhou. 7559-7569 [doi]
- A Semantic Distance Metric Learning approach for Lexical Semantic Change DetectionTaichi Aida, Danushka Bollegala. 7570-7584 [doi]
- What Have We Achieved on Non-autoregressive Translation?Yafu Li, Huajian Zhang, Jianhao Yan, Yongjing Yin, Yue Zhang 0004. 7585-7606 [doi]
- From Zero to Hero: Cold-Start Anomaly DetectionTal Reiss, George Kour, Naama Zwerdling, Ateret Anaby-Tavor, Yedid Hoshen. 7607-7617 [doi]
- Large Language Models Fall Short: Understanding Complex Relationships in Detective NarrativesRuncong Zhao, Qinglin Zhu, Hainiu Xu, Jiazheng Li 0002, Yuxiang Zhou, Yulan He 0001, Lin Gui 0003. 7618-7638 [doi]
- DistillMIKE: Editing Distillation of Massive In-Context Knowledge Editing in Large Language ModelsShanbao Qiao, Xuebing Liu, Seung-Hoon Na. 7639-7654 [doi]
- Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingHeming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li 0001, Tao Ge 0001, Tianyu Liu 0001, Wenjie Li 0002, Zhifang Sui. 7655-7671 [doi]
- Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text ClassificationGibaeg Kim, Sanghun Im, Heung-Seon Oh. 7672-7682 [doi]
- Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized ContextsZhuo Chen, Xinyu Wang 0013, Yong Jiang 0001, Pengjun Xie, Fei Huang 0004, Kewei Tu. 7683-7694 [doi]
- CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk ClassificationKorbinian Randl, John Pavlopoulos, Aron Henriksson, Tony Lindgren. 7695-7715 [doi]
- IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens IntactRuikang Liu, Haoli Bai, Haokun Lin, Yuening Li, Han Gao, Zhengzhuo Xu, Lu Hou, Jun Yao, Chun Yuan. 7716-7741 [doi]
- Learning Adverbs with Spectral Mixture KernelsTomoe Taniguchi, Daichi Mochihashi, Ichiro Kobayashi. 7742-7752 [doi]
- E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language ModelsJinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu 0001, Ruifeng Xu, Shiwen Ni, Min Yang 0007. 7753-7774 [doi]
- ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction TuningFanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo. 7775-7803 [doi]
- Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question AnsweringXiang Li, Shizhu He, Fangyu Lei, JunYang JunYang, Tianhuang Su, Kang Liu 0001, Jun Zhao 0001. 7804-7816 [doi]
- ALaRM: Align Language Models via Hierarchical Rewards ModelingYuhang Lai, Siyuan Wang, Shujun Liu, Xuanjing Huang, Zhongyu Wei. 7817-7831 [doi]
- LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term PromptingHaoxin Liu, Zhiyuan Zhao, Jindong Wang, Harshavardhan Kamarthi, B. Aditya Prakash. 7832-7840 [doi]
- Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language ModelsZhenyi Lu, Jie Tian, Wei Wei 0002, Xiaoye Qu, Yu Cheng 0001, Wenfeng Xie, Dangyang Chen. 7841-7864 [doi]
- UOR: Universal Backdoor Attacks on Pre-trained Language ModelsWei Du, Peixuan Li, Haodong Zhao, Tianjie Ju, Ge Ren, Gongshen Liu. 7865-7877 [doi]
- Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differencesPatrick Haller 0001, Lena S. Bolliger, Lena Ann Jäger. 7878-7892 [doi]
- The State of Relation Extraction Data Quality: Is Bigger Always Better?Erica Cai, Brendan T. O'Connor 0001. 7893-7906 [doi]
- NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User QueriesShudan Zhang, Hanlin Zhao, Xiao Liu 0036, Qinkai Zheng, Zehan Qi, Xiaotao Gu, Yuxiao Dong, Jie Tang. 7907-7928 [doi]
- LLMCrit: Teaching Large Language Models to Use CriteriaWeizhe Yuan, Pengfei Liu 0003, Matthias Gallé. 7929-7960 [doi]
- Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrationsLeonardo Ranaldi, Giulia Pucci