- Rui Yang, Rajiv Gupta 0001. DREAM: Distributed Regional Efficient Agent Management with LLMs for Online Multi-Agent Pathfinding. Operating Systems Review, 59(1):24-33, July 2025.
- Payman Behnam, Yaosheng Fu, Ritchie Zhao, Po-An Tsai, Zhiding Yu, Alexey Tumanov. EMPIRIC: Exploring Missing Pieces in KV Cache Compression for Reducing Computation, Storage, and Latency in Long-Context LLM Inference. Operating Systems Review, 59(1):46-54, July 2025.
- Payman Behnam, Alind Khare, Dhruv Garg, Alexey Tumanov. Toward Weight Sharing Paradigm for Efficient AI: Training and Inference Serving. Operating Systems Review, 59(1):34-45, July 2025.
- Robin Vonk, Joost Hoozemans, Zaid Al-Ars. GSST: Parallel string decompression at 191 GB/s on GPU. Operating Systems Review, 59(1):55-61, July 2025.
- Arney Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee. Efficient LLM Inference via Chunked Prefills. Operating Systems Review, 59(1):9-16, July 2025.
- Zhenning Yang, Archit Bhatnagar, Yiming Qiu, Tongyuan Miao, Patrick Tser Jern Kon, Yunming Xiao, Yibo Huang 0005, Martin Casado, Ang Chen 0001. Cloud Infrastructure Management in the Age of AI Agents. Operating Systems Review, 59(1):1-8, July 2025.
- Shadi Ibrahim, Jad Darrous. Erasure Coding Aware Block Placement for Data-Intensive Applications. Operating Systems Review, 59(1):62-69, July 2025.
- Yuan Wang, Zhenyuan Yang, Zhanbo Wang, Mingyu Li, Zhilin Wu, Haibo Chen. Towards Large Language Model-Friendly APls. Operating Systems Review, 59(1):17-23, July 2025.
- Pedro Henrique B. Las-Casas, Alok Gautum Kumbhare, Rodrigo Fonseca, Sharad Agarwal. LLexus: an AI agent system for incident management. Operating Systems Review, 58(1):23-36, June 2024.
- Luke Logan, Jay F. Lofstead, Xian-He Sun, Anthony Kougkas. An Evaluation of DAOS for Simulation and Deep Learning HPCWorkloads. Operating Systems Review, 58(1):37-44, June 2024.