Abstract is missing.
- Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations [doi]
- FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language ModelsZhuohao Yu 0001, Chang Gao, Wenjin Yao, Yidong Wang 0003, Zhengran Zeng, Wei Ye 0004, Jindong Wang 0001, Yue Zhang 0004, Shikun Zhang. 1-13 [doi]
- i-Code Studio: A Configurable and Composable Framework for Integrative AIYuwei Fang, Mahmoud Khademi, Chenguang Zhu 0001, Ziyi Yang 0011, Reid Pryzant, Yichong Xu, Yao Qian, Takuya Yoshioka, Lu Yuan, Michael Zeng 0001, Xuedong Huang 0001. 14-24 [doi]
- Evalverse: Unified and Accessible Library for Large Language Model EvaluationJihoo Kim, Wonho Song, Dahyun Kim 0001, Yunsu Kim 0004, Yungi Kim, Chanjun Park. 25-33 [doi]
- Medico: Towards Hallucination Detection and Correction with Multi-source Evidence FusionXinping Zhao, Jindi Yu, Zhenyu Liu, Jifang Wang, Dongfang Li 0002, Yibin Chen, Baotian Hu, Min Zhang 0005. 34-45 [doi]
- OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational AgentsQiang Sun 0006, Yuanyi Luo, Sirui Li, Wenxiao Zhang, Wei Liu 0006. 46-52 [doi]
- Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight DetectionTaichi Nishimura, Shota Nakada, Hokuto Munakata, Tatsuya Komatsu. 53-60 [doi]
- MarkLLM: An Open-Source Toolkit for LLM WatermarkingLeyi Pan, Aiwei Liu, Zhiwei He 0002, Zitian Gao, Xuandong Zhao, Yijian Lu, Binglin Zhou, Shuliang Liu, Xuming Hu, Lijie Wen 0001, Irwin King, Philip S. Yu. 61-71 [doi]
- AUTOGEN STUDIO: A No-Code Developer Tool for Building and Debugging Multi-Agent SystemsVictor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang 0001, Saleema Amershi. 72-79 [doi]
- TinyAgent: Function Calling at the EdgeLutfi Eren Erdogan, Nicholas Lee, Siddharth Jha, Sehoon Kim 0001, Ryan Tabrizi, Suhong Moon, Coleman Richard Charles Hooper, Gopala Anumanchipalli, Kurt Keutzer, Amir Gholami. 80-88 [doi]
- TruthReader: Towards Trustworthy Document Assistant Chatbot with Reliable AttributionDongfang Li 0002, Xinshuo Hu, Zetian Sun, Baotian Hu, Shaolin Ye, Zifei Shan, Qian Chen 0003, Min Zhang 0005. 89-100 [doi]
- Commentator: A Code-mixed Multilingual Text Annotation FrameworkRajvee Sheth, Shubh Nisar, Heenaben Prajapati, Himanshu Beniwal, Mayank Singh 0001. 101-109 [doi]
- Integrating INCEpTION into larger annotation processesRichard Eckart de Castilho, Jan-Christoph Klie, Iryna Gurevych. 110-121 [doi]
- Arxiv Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic AssistanceGuanyu Lin, Tao Feng, Pengrui Han, Ge Liu, Jiaxuan You. 122-130 [doi]
- TransAgents: Build Your Translation Company with Language AgentsMinghao Wu, Jiahao Xu, Longyue Wang. 131-141 [doi]
- Monitoring Hate Speech in Indonesia: An NLP-based Classification of Social Media TextsMusa Izzanardi Wijanarko, Lucky Susanto, Prasetia Anugrah Pratama, Ika Karlina Idris, Traci Hong, Derry Tanti Wijaya. 142-152 [doi]
- CAVA: A Tool for Cultural Alignment Visualization & AnalysisNevan Giuliani, Cheng Charles Ma, Prakruthi Pradeep, Daphne Ippolito. 153-161 [doi]
- ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent SystemsAndrew Zhu, Liam Dugan, Chris Callison-Burch. 162-171 [doi]
- BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical AnalysisShuhang Lin, Wenyue Hua, Lingyao Li, Che-Jui Chang, Lizhou Fan, Jianchao Ji, Hang Hua, Mingyu Jin, Jiebo Luo 0001, Yongfeng Zhang 0003. 172-181 [doi]
- sign.mt: Real-Time Multilingual Sign Language Translation ApplicationAmit Moryossef. 182-186 [doi]
- WebOlympus: An Open Platform for Web Agents on Live WebsitesBoyuan Zheng, Boyu Gou, Scott Salisbury, Zheng Du, Huan Sun 0001, Yu Su 0001. 187-197 [doi]
- TAIL: A Toolkit for Automatic and Realistic Long-Context Large Language Model EvaluationGefei Gu, Yilun Zhao 0001, Ruoxi Ning, Yanan Zheng, Arman Cohan. 198-208 [doi]
- OpenResearcher: Unleashing AI for Accelerated Scientific ResearchYuxiang Zheng, Shichao Sun, Lin Qiu, Dongyu Ru, Cheng Jiayang, Xuefeng Li 0003, Jifan Lin, Binjie Wang, Yun Luo, Renjie Pan 0001, Yang Xu 0010, Qingkai Min, Zizhao Zhang, Yiwen Wang, Wenjie Li 0002, Pengfei Liu 0003. 209-218 [doi]
- OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMsHasan Iqbal, Yuxia Wang 0003, Minghan Wang, Georgi Nenkov Georgiev, Jiahui Geng, Iryna Gurevych, Preslav Nakov. 219-229 [doi]
- ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented LearningHieu Man, Nghia Trung Ngo, Franck Dernoncourt, Thien Huu Nguyen. 230-239 [doi]
- To the Globe (TTG): Towards Language-Driven Guaranteed Travel PlanningDa Ju, Song Jiang, Andrew Cohen, Aaron Foss, Sasha Mitts, Arman Zharmagambetov, Brandon Amos, Xian Li 0003, Justine T. Kao, Maryam Fazel-Zarandi, Yuandong Tian. 240-249 [doi]
- MATSA: Multi-Agent Table Structure AttributionPuneet Mathur, Alexa Siu, Nedim Lipka, Tong Sun 0005. 250-258 [doi]
- OpenT2T: An Open-Source Toolkit for Table-to-Text GenerationHaowei Zhang 0002, Shengyun Si, Yilun Zhao 0001, Lujing Xie, Zhijian Xu, Lyuhao Chen, Linyong Nan, Pengcheng Wang, Xiangru Tang, Arman Cohan. 259-269 [doi]
- ChatHF: Collecting Rich Human Feedback from Real-time ConversationsAndrew Li, Zhenduo Wang, Ethan Mendes, Duong Minh Le, Wei Xu 0004, Alan Ritter. 270-279 [doi]
- KMatrix: A Flexible Heterogeneous Knowledge Enhancement Toolkit for Large Language ModelShun Wu, Di Wu, Kun Luo, Xueyou Zhang, Jun Zhao 0001, Kang Liu 0001. 280-290 [doi]
- Xinference: Making Large Model Serving EasyWeizheng Lu, Lingfeng Xiong, Feng Zhang 0001, Xuye Qin, Yueguo Chen. 291-300 [doi]
- RETAIN: Interactive Tool for Regression Testing Guided LLM MigrationTanay Dixit, Daniel Lee, Sally Fang, Sai Sree Harsha, Anirudh Sureshan, Akash V. Maharaj, Yunyao Li 0001. 301-310 [doi]
- ClaimLens: Automated, Explainable Fact-Checking on Voting Claims Using Frame-SemanticsJacob Daniel Devasier, Rishabh Mediratta, Phuong Anh Le, David Huang, Chengkai Li 0001. 311-319 [doi]
- RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationTevin Wang, Jingyuan He, Chenyan Xiong. 320-327 [doi]
- PyMarian: Fast Neural Machine Translation and Evaluation in PythonThamme Gowda, Roman Grundkiewicz, Elijah Rippeth, Matt Post, Marcin Junczys-Dowmunt. 328-335 [doi]
- LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text DetectionMervat Abassy, Kareem Ashraf Elozeiri, Alexander Aziz, Minh Ngoc Ta, Raj Vardhan Tomar, Bimarsha Adhikari, Saad El Dine Ahmed, Yuxia Wang 0003, Osama Mohammed Afzal, Zhuohan Xie, Jonibek Mansurov, Ekaterina Artemova, Vladislav Mikhailov, Rui Xing 0002, Jiahui Geng, Hasan Iqbal, Zain Muhammad Mujahid, Tarek Mahmoud, Akim Tsvigun, Alham Fikri Aji, Artem Shelmanov, Nizar Habash, Iryna Gurevych, Preslav Nakov. 336-343 [doi]
- Translation Canvas: An Explainable Interface to Pinpoint and Analyze Translation SystemsChinmay Dandekar, Wenda Xu, Xi Xu, Siqi Ouyang, Lei Li 0005. 344-350 [doi]
- mbrs: A Library for Minimum Bayes Risk DecodingHiroyuki Deguchi 0002, Yusuke Sakai 0010, Hidetaka Kamigaito, Taro Watanabe. 351-362 [doi]
- Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational NotebooksKonstantin Grotov, Artem Borzilov, Maksim Krivobok, Timofey Bryksin, Yaroslav Zharov. 363-371 [doi]
- Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-PlaySha Li, Revanth Gangi Reddy, Khanh-Duy Nguyen, Qingyun Wang 0005, Yi Fung 0001, Chi Han, Jiawei Han 0001, Kartik Natarajan, Clare R. Voss, Heng Ji 0001. 372-381 [doi]
- SparkRA: A Retrieval-Augmented Knowledge Service System Based on Spark Large Language ModelDayong Wu, Jiaqi Li 0004, Baoxin Wang, Honghong Zhao, Siyuan Xue, Yanjie Yang, Zhijun Chang, Rui Zhang, Li Qian, Bo Wang, Shijin Wang 0001, Zhixiong Zhang, Guoping Hu. 382-389 [doi]
- Generative Dictionary: Improving Language Learner Understanding with Contextual DefinitionsKai-Wen Tuan, Hai-Lun Tu, Jason S. Chang. 390-396 [doi]
- WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language ModelsPrannaya Gupta, Le Qi Yau, Hao Han Low, I-Shiang Lee, Hugo Maximus Lim, Yu Xin Teoh, Jia Hng Koh, Dar Win Liew, Rishabh Bhardwaj, Rajat Bhardwaj, Soujanya Poria. 397-407 [doi]
- RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationXuanwang Zhang, Yunze Song, Yidong Wang 0003, Shuyun Tang, Xinfeng Li, Zhengran Zeng, Zhen Wu 0002, Wei Ye 0004, Wenyuan Xu 0001, Yue Zhang 0004, Xinyu Dai, Shikun Zhang, Qingsong Wen. 408-418 [doi]
- AutoTrain: No-code training for state-of-the-art modelsAbhishek Thakur. 419-423 [doi]
- Sailor: Open Language Models for South-East AsiaLongxu Dou, Qian Liu 0033, Guangtao Zeng, Jia Guo, Jiahui Zhou, Xin Mao, Ziqi Jin, Wei Lu 0011, Min Lin. 424-435 [doi]
- RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationQinyu Luo, Yining Ye, Shihao Liang, Zhong Zhang 0004, Yujia Qin, Yaxi Lu, Yesai Wu, Xin Cong, Yankai Lin 0001, Yingli Zhang, Xiaoyin Che, Zhiyuan Liu 0001, Maosong Sun 0001. 436-464 [doi]
- DeepPavlov 1.0: Your Gateway to Advanced NLP Models Backed by Transformers and Transfer LearningMaksim Savkin, Anastasia Voznyuk, Fedor Ignatov, Anna Korzanova, Dmitry Karpov, Alexander Popov, Vasily Konovalov. 465-474 [doi]
- Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkVladimir Arkhipkin, Viacheslav Vasilev, Andrei Filatov, Igor Pavlov, Julia Agafonova, Nikolai Gerasimenko, Anna Averchenkova, Evelina Mironova, Anton Bukashkin, Konstantin Kulikov, Andrey Kuznetsov, Denis Dimitrov. 475-485 [doi]
- MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific ApplicationsXiangru Tang, Chunyuan Deng, Hanming Wang, Haoran Wang 0005, Yilun Zhao 0001, Wenqi Shi, May Fung, Wangchunshu Zhou, Jiannan Cao, Heng Ji 0001, Arman Cohan, Mark Gerstein. 486-496 [doi]
- WildVis: Open Source Visualizer for Million-Scale Chat Logs in the WildYuntian Deng, Wenting Zhao, Jack Hessel, Xiang Ren 0001, Claire Cardie, Yejin Choi 0001. 497-506 [doi]
- Instruction-Driven Game Engine: A Poker Case StudyHongqiu Wu, Xingyuan Liu, Yan Wang, Hai Zhao 0001. 507-519 [doi]
- LM-Interview: An Easy-to-use Smart Interviewer System via Knowledge-guided Language Model ExploitationHanming Li, Jifan Yu, Ruimiao Li, Zhanxin Hao, Yan Xuan, Jiaxi Yuan, Bin Xu 0001, Juanzi Li, Zhiyuan Liu 0001. 520-528 [doi]