Abstract is missing.
- Hops: Fine-grained heterogeneous sensing, efficient and fair Deep Learning cluster scheduling systemQinghe Wang, Futian Wang, Xinwei Zheng. 1-17 [doi]
- Queue Management for SLO-Oriented Large Language Model ServingArchit Patke, Dhemath Reddy, Saurabh Jha, Haoran Qiu, Christian Pinto, Chandra Narayanaswami, Zbigniew Kalbarczyk, Ravishankar K. Iyer. 18-35 [doi]
- Kale: Elastic GPU Scheduling for Online DL Model TrainingZiyang Liu, Renyu Yang, Jin Ouyang, Weihan Jiang, Tianyu Ye, Menghao Zhang 0001, Sui Huang, Jiaming Huang, Chengru Song, Di Zhang, Tianyu Wo, Chunming Hu. 36-51 [doi]
- FedCaSe: Enhancing Federated Learning with Heterogeneity-aware Caching and SchedulingRedwan Ibne Seraj Khan, Arnab K. Paul, Yue Cheng 0001, Xun Steve Jian, Ali Reza Butt. 52-68 [doi]
- SQLStateGuard: Statement-Level SQL Injection Defense Based on Learning-Driven MiddlewareXin Liu 0050, Yuanyuan Huang, Tianyi Wang, Song Li 0006, Weina Niu, Jun Shen 0001, Qingguo Zhou, Xiaokang Zhou. 69-82 [doi]
- Vista: Machine Learning based Database Performance Troubleshooting Framework in Amazon RDSVikramank Y. Singh, Zhao Song, Balakrishnan (Murali) Narayanaswamy, Kapil Eknath Vaidya, Tim Kraska. 83-98 [doi]
- Building AI Agents for Autonomous Clouds: Challenges and Design PrinciplesManish Shetty, Yinfang Chen, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Xuchao Zhang, Jonathan Mace, Dax Vandevoorde, Pedro Las-Casas, Shachee Mishra Gupta, Suman Nath, Chetan Bansal, Saravan Rajmohan. 99-110 [doi]
- Zero-SAD: Zero-Shot Learning Using Synthetic Abnormal Data for Abnormal Behavior Detection on Private CloudJae Seok Kim, Joonho Seo, SeonJin Hwang, Jin-Myeong Shin, Yoon Ho Choi. 111-125 [doi]
- Forecasting Algorithms for Intelligent Resource Scaling: An Experimental AnalysisYanlei Diao, Dominik Horn, Andreas Kipf, Oleksandr Shchur, Ines Benito, Wenjian Dong, Davide Pagano, Pascal Pfeil, Vikram Nathan, Balakrishnan Narayanaswamy, Tim Kraska. 126-143 [doi]
- Snapipeline: Accelerating Snapshot Startup for FaaS ContainersYuqiao Lan, Xiaohui Peng, Yifan Wang. 144-159 [doi]
- En4S: Enabling SLOs in Serverless Storage SystemsMinghao Xie, Chen Qian 0001, Heiner Litz. 160-177 [doi]
- Pre-Warming is Not Enough: Accelerating Serverless Inference With Opportunistic Pre-LoadingYifan Sui, Hanfei Yu, Yitao Hu, Jianxun Li, Hao Wang. 178-195 [doi]
- Faascale: Scaling MicroVM Vertically for Serverless Computing with Memory ElasticityXinmin Zhang, Qiang He, Hao Fan 0006, Song Wu 0001. 196-212 [doi]
- Rethinking the Networking Stack for Serverless Environments: A Sidecar ApproachVishwanath Seshagiri, Abhinav Gupta, Vahab Jabrayilov, Avani Wildani, Kostis Kaffes. 213-222 [doi]
- Process-as-a-Service: Unifying Elastic and Stateful Clouds with Serverless ProcessesMarcin Copik, Alexandru Calotoiu, Gyorgy Réthy, Roman Böhringer, Rodrigo Bruno, Torsten Hoefler. 223-242 [doi]
- AutoBurst: Autoscaling Burstable Instances for Cost-effective Latency SLOsRubaba Hasan, Timothy Zhu, Bhuvan Urgaonkar. 243-258 [doi]
- Is It Time To Put Cold Starts In The Deep Freeze?Carlos Segarra, Ivan Durev, Peter R. Pietzuch. 259-268 [doi]
- Towards Swap-Free, Continuous Ballooning for Fast, Cloud-Based Virtual Machine MigrationsKevin Alarcón Negy, Tycho Nightingale, Hakim Weatherspoon, Zhiming Shen. 269-283 [doi]
- PCLive: Pipelined Restoration of Application Containers for Reduced Service DowntimeShiv Bhushan Tripathi, Debadatta Mishra. 284-301 [doi]
- Scheduling for Reduced Tail Task Latencies in Highly Utilized DatacentersSmita Vijayakumar, Anil Madhavapeddy, Evangelia Kalyvianaki. 302-321 [doi]
- Krios: Scheduling Abstractions and Mechanisms for Enabling a LEO Compute CloudVaibhav Bhosale, Ada Gavrilovska, Ketan Bhardwaj. 322-340 [doi]
- Demystifying the Fight Against Complexity: A Comprehensive Study of Live Debugging Activities in Production Cloud SystemsP. C. Sruthi, Zinan Guo, Deming Chu, Zhengyan Chen, Yongle Zhang. 341-360 [doi]
- Deoxys: A Causal Inference Engine for Unhealthy Node Mitigation in Large-scale Cloud InfrastructureChaoyun Zhang, Randolph Yao, Si-qin, Ze Li, Shekhar Agrawal, Binit R. Mishra, Tri Tran, Minghua Ma, Qingwei Lin, Murali Chintalapati, Dongmei Zhang 0001. 361-379 [doi]
- INS: Identifying and Mitigating Performance Interference in Clouds via Interference-Sensitive PathsZiwei Huang, Mengyao Xie, Shibo Tang, Zihao Chang, Zhicheng Yao, Yungang Bao, Sa Wang. 380-397 [doi]
- TailClipper: Reducing Tail Response Time of Distributed Services Through System-Wide SchedulingNathan Ng, Abel Souza, Ahmed Ali-Eldin, David E. Irwin, Don Towsley, Prashant J. Shenoy. 398-414 [doi]
- On-demand and Parallel Checkpoint/Restore for GPU ApplicationsYanning Yang, Dong Du 0003, Haitao Song 0001, Yubin Xia. 415-433 [doi]
- MoEsaic: Shared Mixture of ExpertsUmesh Deshpande, Travis Janssen, Mudhakar Srivatsa, Swaminathan Sundararaman. 434-442 [doi]
- FaPES: Enabling Efficient Elastic Scaling for Serverless Machine Learning PlatformsXiaoyang Zhao, Siran Yang, Jiamang Wang, Lansong Diao, Lin Qu, Chuan Wu 0001. 443-459 [doi]
- KACE: Kernel-Aware Colocation for Efficient GPU Spatial SharingBing-Shiun Han, Tathagata Paul, Zhenhua Liu 0002, Anshul Gandhi. 460-469 [doi]
- Pack: Towards Communication-Efficient Homomorphic Encryption in Federated LearningZeyuan Zuo, Ningxin Su, Baochun Li, Teng Zhang. 470-486 [doi]
- InferCool: Enhancing AI Inference Cooling through Transparent, Non-Intrusive Task ReassignmentQiangyu Pei, Lin Wang, Dong Zhang, Bingheng Yan, Chen Yu 0003, Fangming Liu. 487-504 [doi]
- CDN-Shifter: Leveraging Spatial Workload Shifting to Decarbonize Content Delivery NetworksJorge Murillo, Walid A. Hanafy, David E. Irwin, Ramesh K. Sitaraman, Prashant J. Shenoy. 505-521 [doi]
- Accountable Carbon Footprints and Energy Profiling For Serverless FunctionsPrateek Sharma, Alexander Fuerst. 522-541 [doi]
- The Sunk Carbon Fallacy: Rethinking Carbon Footprint Metrics for Effective Carbon-Aware SchedulingNoman Bashir, Varun Gohil, Anagha Belavadi Subramanya, Mohammad Shahrad, David Irwin, Elsa Olivetti, Christina Delimitrou. 542-551 [doi]
- Exploring the Efficiency of Renewable Energy-based Modular Data Centers at ScaleJinghan Sun, Zibo Gong, Anup Agarwal, Shadi A. Noghabi, Ranveer Chandra, Marc Snir, Jian Huang 0006. 552-569 [doi]
- The Hidden Carbon Footprint of Serverless ComputingRohan Basu Roy, Raghavendra Kanakagiri, Yankai Jiang 0002, Devesh Tiwari. 570-579 [doi]
- uIO: Lightweight and Extensible UnikernelsMasanori Misono, Peter Okelmann, Charalampos Mainas, Pramod Bhatotia. 580-599 [doi]
- Racos: Improving Erasure Coding State Machine Replication using Leaderless ConsensusJonathan Zarnstorff, Lucas Lebow, Christopher Siems, Dillon Remuck, Colin Ruiz, Lewis Tseng. 600-617 [doi]
- Occam's Razor for Distributed ProtocolsZiliang Lai, Fan Cui, Hua Fan 0002, Eric Lo 0001, Wenchao Zhou, Feifei Li 0001. 618-636 [doi]
- VWeiST: A Scalable and Efficient Proof-of-Stake Blockchain ConsensusHang Xiong, Cheng Qu, Jing Li. 637-649 [doi]
- Securing a Multiprocessor KVM Hypervisor with RustYu-Hsun Chiang, Wei-Lin Chang, Shih-wei Li, Jan-Ting Tu. 650-667 [doi]
- SURE: Secure Unikernels Make Serverless Computing Rapid and EfficientFederico Parola, Shixiong Qi, Anvaya B. Narappa, K. K. Ramakrishnan, Fulvio Risso. 668-688 [doi]
- TianMen: a DPU-based storage network offloading structure for disaggregated datacentersWeiyue Zhao, Jingya Wu, Wenyan Lu, Xiaowei Li 0001, Guihai Yan. 689-703 [doi]
- H2C-Dedup: Reducing I/O and GC Amplification for QLC SSDs from the Deduplication Metadata PerspectiveYunsheng Dong, Boju Chen, Yanqi Pan, Xiangyu Zou, Wen Xia. 704-719 [doi]
- RomeFS: A CXL-SSD Aware File System Exploiting Synergy of Memory-Block Dual PathsYekang Zhan, Haichuan Hu, Xiangrui Yang, Shaohua Wang, Qiang Cao, Hong Jiang, Jie Yao. 720-736 [doi]
- SmartGraph: A Framework for Graph Processing in Computational StorageSoheil Khadirsharbiyani, Nima Elyasi, Armin Haj Aboutalebi, Chun-Yi Liu 0002, Changho Choi, Mahmut Taylan Kandemir. 737-754 [doi]
- ConMonitor: Lightweight Container Protection with Virtualization and VM FunctionsShaowen Xu, Qihang Zhou, Zhicong Zhang, Xiaoqi Jia, Donglin Liu, Heqing Huang, Haichao Du, Zhenyu Song. 755-773 [doi]
- ByteMQ: A Cloud-native Streaming Data Layer in ByteDanceYancan Mao, Ruohang Yin, Liyuan Lei, Peng Ye, Shengfu Zou, Shizheng Tang, Yunzhe Guo, Ye Yuan, Xiaochen Yu, Bo Wan, Yunfei Gong, Changli Gao, Guanghui Zhang, Jian Shen, Rui Shi, Richard T. B. Ma. 774-791 [doi]
- Dynamic Idle Resource Leasing To Safely Oversubscribe Capacity At MetaNishant Gupta, Iyswarya Narayanan, Shivam Handa, Sayak Chakraborti, Pankit Thapar, Baohua Shan, Ariel Rao, Yuanlai Liu, Pengyuan Wang, Yuqing Wu, Qingyi Gao, Chris Chao-Chun Cheng, Sihan You, Louis Huang, Jingyuan Fan, Kenny Yu, Kevin Lin, Tengfei Mu, Parth Malani, Haiying Wang, Trey Lu, Peter Zhang. 792-810 [doi]
- Byways: High-Performance, Isolated Network Functions for Multi-Tenant Cloud ServersXinyu Han, Yuan Gao, Gabriel Parmer, Timothy Wood 0001. 811-829 [doi]
- Cloud-native Workflow Scheduling using a Hybrid Priority Rule, Dynamic Resource Allocation, and Dynamic Task PartitionJungeun Shin, Diana Arroyo, Asser N. Tantawi, Chen Wang 0039, Alaa Youssef, Rakesh Nagi. 830-846 [doi]
- Streamlining Cloud-Native Application Development and Deployment with Robust EncapsulationPawissanutt Lertpongrujikorn, Hai Duc Nguyen 0005, Mohsen Amini Salehi. 847-865 [doi]
- Komet: A Serverless Platform for Low-Earth Orbit Edge ServicesTobias Pfandzelter, David Bermbach. 866-882 [doi]
- A Data Optimizer for Region-Aware Self-describing Files in Scientific ComputingYanjie Song, Tianyuan Wu, Yuanhao Li, Guancheng Li, Yuchen Liu, Shu Yin 0001, Wei Xue, Junchao Wang. 883-897 [doi]
- Rethinking State Management in Actor Systems for Cloud-Native ApplicationsYijian Liu, Rodrigo Laigner, Yongluan Zhou. 898-914 [doi]
- IncBoost: Scaling Incremental Graph Processing for Edge Deletions and Weight UpdatesXizhe Yin, Zhijia Zhao 0001, Rajiv Gupta 0001. 915-932 [doi]
- Memory Management in Complex Join Queries: A Re-evaluation StudyShiva Jahangiri, Michael J. Carey 0001, Johann Christoph Freytag. 933-942 [doi]
- FAAStloop: Optimizing Loop-Based Applications for Serverless ComputingShruti Mohanty, Vivek M. Bhasi, Myungjun Son, Mahmut Taylan Kandemir, Chita Das. 943-960 [doi]
- Distributed Training of Large Language Models on AWS TrainiumXinwei Fu, Zhen Zhang, Haozheng Fan, Guangtai Huang, Mohammad El-Shabani, Randy Huang, Rahul Solanki, Fei Wu, Ron Diamant, Yida Wang 0003. 961-976 [doi]
- Near-Lossless Gradient Compression for Data-Parallel Distributed DNN TrainingXue Li, Cheng Guo, Kun Qian, Menghao Zhang 0001, Mengyu Yang, Mingwei Xu. 977-994 [doi]
- Accelerating Transfer Learning with Near-Data Computation on Cloud Object StoresDiana Petrescu, Arsany Guirguis, Do Le Quoc, Javier Picorel, Rachid Guerraoui, Florin Dinu. 995-1011 [doi]
- Inshrinkerator: Compressing Deep Learning Training Checkpoints via Dynamic QuantizationAmey Agrawal, Sameer Reddy, Satwik Bhattamishra, Venkata Prabhakara Sarath Nookala, Vidushi Vashishth, Kexin Rong 0001, Alexey Tumanov. 1012-1031 [doi]
- ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial NetworksZiji Shi, Jialin Li, Yang You 0001. 1032-1044 [doi]