CrossBench: Reproducible LLM Memory Evaluation Across Conversational and Coding Workloads

Valentin Radu, Myles Foley, Maryam Vahdat Pour, Massimo Pegoraro, Frank Faricy, Mark Girolami. CrossBench: Reproducible LLM Memory Evaluation Across Conversational and Coding Workloads. In Sourav Battacharya, Shijia Pan, editors, Proceedings of the 24th Annual International Conference on Mobile Systems, Applications and Services Workshops, MobiSys Workshops 2026, University of Cambridge, Cambridge, United Kingdom, June 21-25, 2026. pages 87-92, ACM, 2026. [doi]

Abstract

Abstract is missing.