Abstract is missing.
- Sabre: Hardware-Accelerated Snapshot Compression for Serverless MicroVMsNikita Lazarev, Varun Gohil, James Tsai, Andy Anderson, Bhushan Chitlur, Zhiru Zhang, Christina Delimitrou. 1-18 [doi]
- Nomad: Non-Exclusive Memory Tiering via Transactional Page MigrationLingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu 0001, Jia Rao, Yifan Yuan, Ren Wang 0001. 19-35 [doi]
- Managing Memory Tiers with CXL in Virtualized EnvironmentsYuhong Zhong, Daniel S. Berger, Carl A. Waldspurger, Ryan Wee, Ishwar Agarwal, Rajat Agarwal, Frank Hady, Karthik Kumar, Mark D. Hill, Mosharaf Chowdhury, Asaf Cidon. 37-56 [doi]
- Harvesting Memory-bound CPU Stall Cycles in Software with MSHZhihong Luo, Sam Son, Sylvia Ratnasamy, Scott Shenker. 57-75 [doi]
- A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory ApplicationsLei Chen, Shi Liu, Chenxi Wang, Haoran Ma, Yifan Qiao, Zhe Wang 0017, Chenggang Wu 0002, Youyou Lu, Xiaobing Feng 0002, Huimin Cui, Shan Lu 0001, Harry Xu 0001. 77-95 [doi]
- DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra EfficiencyHaoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang 0005, Miryung Kim, Harry Xu 0001. 97-115 [doi]
- Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeAmey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee. 117-134 [doi]
- ServerlessLLM: Low-Latency Serverless Inference for Large Language ModelsYao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai. 135-153 [doi]
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache ManagementWonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim. 155-172 [doi]
- Llumnix: Dynamic Scheduling for Large Language Model ServingBiao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li 0020, Wei Lin 0016. 173-191 [doi]
- DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model ServingYinmin Zhong, Shengyu Liu, Junda Chen, Jianbo Hu, Yibo Zhu, Xuanzhe Liu, Xin Jin 0008, Hao Zhang 0108. 193-210 [doi]
- ACCL+: an FPGA-Based Collective Engine for Distributed ApplicationsZhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso. 211-231 [doi]
- Beaver: Practical Partial Snapshots for Distributed Cloud ServicesLiangcheng Yu, Xiao Zhang, Haoran Zhang, John Sonchack, Dan R. K. Ports, Vincent Liu 0001. 233-249 [doi]
- Fast and Scalable In-network Lock Management Using Lock FissionHanze Zhang, Ke Cheng, Rong Chen 0001, Haibo Chen 0001. 251-268 [doi]
- Chop Chop: Byzantine Atomic Broadcast to the Network LimitMartina Camaioni, Rachid Guerraoui, Matteo Monti, Pierre-Louis Roman, Manuel Vidigueira, Gauthier Voron. 269-287 [doi]
- Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep LearningYi Zhai, Sijia Yang, Keyu Pan, Renwei Zhang, Shuo Liu, Chao Liu, ZiChun Ye, Jianmin Ji, Jie Zhao, Yu Zhang 0086, Yanyong Zhang. 289-305 [doi]
- Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor TransformationLei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi 0001, Ningxin Zheng, Ziming Miao, Fan Yang 0024, Ting Cao, Yuqing Yang 0001, Mao Yang. 307-323 [doi]
- Caravan: Practical Online Learning of In-Network ML Models with Labeling AgentsQizheng Zhang, Ali Imran 0005, Enkeleda Bardhi, Tushar Swamy, Nathan Zhang, Muhammad Shahbaz, Kunle Olukotun. 325-345 [doi]
- nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning TrainingZhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang 0024, Yi Zhu, Cheng Li 0001, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou. 347-363 [doi]
- ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML ApplicationsYuhan Liu, Chengcheng Wan 0001, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu 0001, Michael Maire. 365-386 [doi]
- SquirrelFS: using the Rust compiler to check file-system crash consistencyHayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram. 387-404 [doi]
- High-throughput and Flexible Host Networking for Accelerated ComputingAthinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao, Qizhe Cai, Saksham Agarwal, Jacob Adelmann, David Ahern, Carlo Contavalli, Michael D. Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton, Rachit Agarwal 0001, Shrijeet Mukherjee, Christos Kozyrakis. 405-423 [doi]
- IntOS: Persistent Embedded Operating System and Language Support for Multi-threaded Intermittent ComputingYilun Wu, Byounguk Min, Mohannad Ismail, Wenjie Xiong 0001, Changhee Jung, Dongyoon Lee. 425-443 [doi]
- Data-flow Availability: Achieving Timing Assurance in Autonomous SystemsAo Li 0006, Ning Zhang 0017. 445-463 [doi]
- Microkernel Goes General: Performance and Compatibility in the HongMeng Production MicrokernelHaibo Chen 0001, Xie Miao, Ning Jia, Nan Wang, Yu Li, Nian Liu, Yutao Liu, Fei Wang, Qiang Huang, Kun Li, Hongyang Yang, Hui Wang, Jie Yin, Yu Peng, Fengwei Xu. 465-485 [doi]
- When will my ML Job finish? Toward providing Completion Time Estimates through Predictability-Centric SchedulingAbdullah Bin Faisal, Noah Martin, Hafiz Mohsin Bashir, Swaminathan Lamelas, Fahad R. Dogar. 487-505 [doi]
- Optimizing Resource Allocation in Hyperscale Datacenters: Scalability, Usability, and ExperiencesNeeraj Kumar, Pol Mauri Ruiz, Vijay Menon, Igor Kabiljo, Mayank Pundir, Andrew Newell, Daniel Lee, Liyuan Wang, Chunqiang Tang. 507-528 [doi]
- μSlope: High Compression and Fast Search on Semi-Structured LogsRui Wang, Devin Gibson, Kirk Rodrigues, Yu Luo 0006, Yun Zhang, Kaibo Wang, Yupeng Fu, Ting Chen, Ding Yuan 0004. 529-544 [doi]
- ServiceLab: Preventing Tiny Performance Regressions at Hyperscale through Pre-Production TestingMike Chow, Yang Wang 0009, William Wang, Ayichew Hailu, Rohan Bopardikar, Bin Zhang, Jialiang Qu, David Meisner, Santosh Sonawane, Yunqi Zhang, Rodrigo Paim, Mack Ward, Ivor Huang, Matt McNally, Daniel Hodges, Zoltan Farkas, Caner Gocmen, Elvis Huang, Chunqiang Tang. 545-562 [doi]
- MAST: Global Scheduling of ML Training across Geo-Distributed Datacenters at HyperscaleArnab Choudhury, Yang Wang 0009, Tuomas Pelkonen, Kutta Srinivasan, Abha Jain, Shenghao Lin, Delia David, Siavash Soleimanifard, Michael Chen, Abhishek Yadav, Ritesh Tijoriwala, Denis Samoylov, Chunqiang Tang. 563-580 [doi]
- Automatically Reasoning About How Systems Code Uses the CPU CacheRishabh R. Iyer, Katerina J. Argyraki, George Candea. 581-598 [doi]
- VeriSMo: A Verified Security Module for Confidential VMsZiqiao Zhou, Anjali, Weiteng Chen, Sishuai Gong, Chris Hawblitzel, Weidong Cui. 599-614 [doi]
- Validating the eBPF Verifier via State EmbeddingHao Sun, Zhendong Su 0001. 615-628 [doi]
- Using Dynamically Layered Definite Releases for Verifying the RefFS File SystemMo Zou, Dong Du 0003, Mingkai Dong 0002, Haibo Chen 0001. 629-648 [doi]
- Anvil: Verifying Liveness of Cluster Management ControllersXudong Sun 0013, Wenjie Ma, Jiawei Tyler Gu, Zicheng Ma, Tej Chajed, Jon Howell, Andrea Lattuada 0001, Oded Padon, Lalith Suresh, Adriana Szekeres, Tianyin Xu. 649-666 [doi]
- DSig: Breaking the Barrier of Signatures in Data CentersMarcos K. Aguilera, Clément Burgelin, Rachid Guerraoui, Antoine Murat, Athanasios Xygkis, Igor Zablotchi. 667-685 [doi]
- Ransom Access Memories: Achieving Practical Ransomware Protection in Cloud with DeftPunkZhongyu Wang, Yaheng Song, Erci Xu, Haonan Wu, Guangxun Tong, Shizhuo Sun, Haoran Li, Jincheng Liu, Lijun Ding, Rong Liu, Jiaji Zhu, Jiesheng Wu. 687-702 [doi]
- Secret Key Recovery in a Global-Scale End-to-End Encryption SystemGraeme Connell, Vivian Fang, Rolfe Schmidt, Emma Dauterman, Raluca Ada Popa. 703-719 [doi]
- Flock: A Framework for Deploying On-Demand Distributed TrustDarya Kaviani, Sijun Tan, Pravein Govindan Kannan, Raluca Ada Popa. 721-743 [doi]
- FairyWREN: A Sustainable Cache for Emerging Write-Read-Erase Flash InterfacesSara McAllister, Yucong Wang, Benjamin Berg, Daniel S. Berger, George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger. 745-764 [doi]
- Massively Parallel Multi-Versioned Transaction ProcessingShujian Qian, Ashvin Goel. 765-781 [doi]
- Burstable Cloud Block Storage with Data Processing UnitsJunyi Shu, Kun Qian, Ennan Zhai, Xuanzhe Liu, Xin Jin 0008. 783-799 [doi]
- Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated MemoryMing Zhang, Yu Hua 0001, Zhijun Yang. 801-819 [doi]
- Detecting Logic Bugs in Database Engines via Equivalent Expression TransformationZu-Ming Jiang, Zhendong Su 0001. 821-835 [doi]
- Inductive Invariants That Spark Joy: Using Invariant Taxonomies to Streamline Distributed Protocol ProofsTony Nuda Zhang, Travis Hance, Manos Kapritsos, Tej Chajed, Bryan Parno. 837-853 [doi]
- Performance Interfaces for Hardware AcceleratorsJiacheng Ma 0002, Rishabh R. Iyer, Sahand Kashani, Mahyar Emami, Thomas Bourgeat, George Candea. 855-874 [doi]
- IronSpec: Increasing the Reliability of Formal SpecificationsEli Goldweber, Weixin Yu, Seyed Armin Vakil-Ghahani, Manos Kapritsos. 875-891 [doi]
- Identifying On-/Off-CPU Bottlenecks Together with Blocked SamplesMinwoo Ahn, Jeongmin Han, Youngjin Kwon, Jinkyu Jeong. 893-910 [doi]
- dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM ServingBingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, Xin Jin 0008. 911-927 [doi]
- Parrot: Efficient Serving of LLM-based Applications with Semantic VariableChaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang 0001, Fan Yang 0024, Chen Chen, Lili Qiu. 929-945 [doi]
- USHER: Holistic Interference Avoidance for Resource Optimized ML InferenceSudipta Saha Shubha, Haiying Shen, Anand Iyer. 947-964 [doi]
- Fairness in Serving Large Language ModelsYing Sheng 0007, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li 0001, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica. 965-988 [doi]
- MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric ArchitecturesDonglin Zhuang, Zhen Zheng, Haojun Xia, Xiafei Qiu, Junjie Bai, Wei Lin 0016, Shuaiwen Leon Song. 989-1005 [doi]