Abstract is missing.
- Accelerating RL-Based Scheduler Adaptation with Transfer Learning in Evolving HPC ArchitecturesLingfei Wang, Maria A. Rodriguez 0001, Nir Lipovetzky. 1-11 [doi]
- LLM-Powered Automated Cloud Forensics: From Log Analysis to InvestigationDalal Alharthi, Rozhin Yasaei. 12-22 [doi]
- Korel: Mitigating Stragglers via Real-Time Automatic Mixed Precision in Distributed Deep Learning EnvironmentsHyunseung Jung, HyungJun Kim, HeonChang Yu. 23-31 [doi]
- Multi-Agent Reinforcement Learning-Based In-Place Scaling Engine for Edge-Cloud SystemsJovan Prodanov, Blaz Bertalanic, Carolina Fortuna, Shih-Kai Chou, Matjaz B. Juric, Ramon Sanchez-Iborra, Jernej Hribar. 32-42 [doi]
- Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design FrameworkJulien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez, Paul Théron. 43-53 [doi]
- The IoT Whisperer: A Framework for Intelligent IoT Service Composition Through LLMsEwan Warburton, Abdessalam Elhabbash, Saad Ezzini, Yehia Elkhatib. 54-64 [doi]
- Dynamic In-node Group-Aware Scheduling for Multi-Tenant Machine Learning Services on KubernetesPeini Liu, Jordi Guitart. 65-74 [doi]
- ESTHER: Application-First Hardware-Level QoS-Enforcement for Cloud Native EnvironmentsOliver Larsson, Thijs Metsch, Cristian Klein, Erik Elmroth. 75-85 [doi]
- Towards Secure Cloud-Native Computing: Unveiling Kubernetes Misconfigurations with Large Language ModelsMostafa Anouar Ghorab, Mohamed Aymen Saied. 86-96 [doi]
- Is Your Cluster Truly Fully Loaded? Exploring Shadow Resources in Host State SynchronizationJiawen Liu, Yuehao Xu, Zhijun Ding. 97-108 [doi]
- Helm-ET: Reducing Exposure to Lateral Movement in Kubernetes ArtifactsJacopo Bufalino, Jose Luiz Martin Navarro, Aleksi Peltonen, Tuomas Aura. 109-120 [doi]
- HeteroScheduler: Dynamic Task Scheduling for CPU-GPU Optimization and Contention Mitigation in Cloud Data CentersSeokwon Choi, Hyeonsang Eom. 121-131 [doi]
- MOBOS: Co-Optimizing Cost and Execution Time in Serverless Workflow with Multi-Objective Bayesian OptimizationMinjae Kang, HeonChang Yu. 132-142 [doi]
- Causal Latency Modelling for Cloud MicroservicesChristopher Lohse, Diego Tsutsumi, Amadou Ba, Pavithra Harsha, Chitra Subramanian, Martin Straesser, Marco Ruffini. 143-151 [doi]
- HotSwap: Enabling Live Dependency Sharing in Serverless ComputingRui Li, Devesh Tiwari, Gene Cooperman. 152-162 [doi]
- Speeding up Model Loading with FastsafetensorsTakeshi Yoshimura, Tatsuhiro Chiba, Manish Sethi, Daniel G. Waddington, Swaminathan Sundararaman. 163-174 [doi]
- Cost-Efficient VM Selection for Cloud-Based LLM Inference with KV Cache OffloadingKihyun Kim, Jinwoo Kim, Hyunsun Chung, Myung-Hoon Cha, Hong-Yeon Kim, Youngjae Kim. 175-185 [doi]
- ZipNN: Lossless Compression for AI ModelsMoshik Hershcovitch, Andrew Wood, Leshem Choshen, Guy Girmonsky, Roy Leibovitz, Or Ozeri, Ilias Ennmouri, Michal Malka, Sang (Peter) Chin, Swaminathan Sundararaman, Danny Harnik. 186-198 [doi]
- Disk-Based Shared KV Cache Management for Fast Inference in Multi-Instance LLM RAG SystemsHyungwoo Lee, Kihyun Kim, Jinwoo Kim, Jungmin So, Myung-Hoon Cha, Hong-Yeon Kim, James J. Kim, Youngjae Kim. 199-209 [doi]
- ClusterLink: Redefining Application Connectivity for the Multi-cloud EraKfir Toledo, Pravein Govindan Kannan, Michal Malka, Etai Lev-Ran, Or Ozeri, Vita Bortnikov, Ziv Nevo, Kathy Barabash. 210-222 [doi]
- Precomputation-Optimized Lakehouse Architecture for Online Analytical Processing TasksHaida Zhang, Lin Sun, Zhengtong Zhang, Jiayang Xia, Ziang Huang, Jiansi Wang, Haopeng Chen, Yan Jiao, Yongming Xu. 223-232 [doi]
- Energy-Aware Resource Allocation and Container Migration in Distributed Data Centers Under Variable Energy Pricing: A Genetic Programming Hyper-Heuristic ApproachMathew Falloon, Hui Ma, Gang Chen. 233-242 [doi]
- EnergyLess: An Energy-Aware Serverless Workflow Batch Orchestration on the Computing ContinuumReza Farahani, Radu Prodan. 243-254 [doi]
- Carbon-Aware Temporal Data Transfer Scheduling Across Cloud DatacentersElvis Rodrigues, Jacob Goldverg, Tevfik Kosar. 255-264 [doi]
- TraceWizard: End-to-End Distributed Tracing Across Host and Network Devices in CloudKuangyuan Li, Jingrun Zhang, Pengfei Chen, Hongyang Chen, Ruipeng Hong, Wanqi Yang, Chen Sun. 265-276 [doi]
- Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM InferencePol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang 0039, Eun-Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Lluís Berral. 277-287 [doi]
- Efficient Microservice Monitoring Via Kernel Transformation and FFT ForecastingMarianna Ojanen, Maryam Sabzevari, Sándor Szedmák. 288-295 [doi]
- Efficient Versioning for UnikernelsGaulthier Gain, Benoit Knott, Laurent Mathy. 296-307 [doi]
- Real-Time Interference-Aware CPU and I/O Capping Mechanism for Multi-Tenant ContainersMohammadReza HoseinyFarahabady, Albert Y. Zomaya. 308-317 [doi]
- SLO-Aware Container Orchestration on Kubernetes ClustersAngelo Marchese, Orazio Tomarchio. 318-327 [doi]
- ReSACO: A Meta Reinforcement Learning Method for Fast Offloading in Mobile Edge ComputingMyeongjun Kim, HeonChang Yu. 328-338 [doi]
- MSTH-Former: Optimizing Workload Prediction in Edge-Cloud Continuum with Multi-Scale Temporal and Hierarchical Knowledge Convergence and DistillationSharmen Akhter, Eui-nam Huh. 339-350 [doi]
- PROBA: Enhancing Serverless Edge Computing via Adaptive Task Scheduling and Probabilistic Resource SharingManish Pandey, Byungchul Tak, Young-Woo Kwon 0001. 351-361 [doi]
- RACS-SADL: Robust and Understandable Randomized Consensus in the CloudPasindu Tennage, Antoine Desjardins, Lefteris Kokoris-Kogias. 362-373 [doi]
- An Experimental Validation of Architectural Measures for Cloud-Native Quality EvaluationsRobin Lichtenthäler, Guido Wirtz. 374-384 [doi]
- Routing Strategies for RoCE Networks in AI CloudsAbdul Alim, Ali Sydney, Liran Schour, Abdullah Kayi, Laurent Schares, Pavlos Maniotis, Anand Singh, Bengi Karacali. 385-396 [doi]
- QPS- Fit: An Efficient and Performant Parallel Algorithm for Hybrid Optical and Packet SwitchingDongzhao Song, Jingfan Meng, Qianru Yu, Jun Jim Xu. 397-408 [doi]
- HEART: Heterogeneous-Aware Traffic Allocation in Multi-Replica Deployments on KubernetesHokun Park, Donggyun Kim, HyungJun Kim, Gyujeong Lim, HeonChang Yu. 409-419 [doi]
- Optimizing Receive Flow Steering for Mixed Traffic in High-Performance Cloud DatacentersJunseo Jang, Jaehyun Hwang. 420-429 [doi]
- Avoiding Pitfalls in Networked Key-Value Store for Tiered MemorySeungmin Shin, Leeiu Kim, Wookyung Lee, Eyee Hyun Nam, Seungmin Kim, Bryan S. Kim, Sungjin Lee 0001, Eunji Lee. 430-441 [doi]
- Universal Workers: A Vision for Eliminating Cold Starts in Serverless ComputingSaman Akbari, Manfred Hauswirth. 442-444 [doi]
- Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM InferenceYue Zhu, Hao Yu, Chen Wang, Zhuoran Liu, Eun-Kyung Lee. 445-447 [doi]
- Automated LLM Deployment and Evaluation: A Cloud-Native Approach Using LLM-as-a-JudgeAnsar Rafique, Brian D. Marsden. 448-450 [doi]
- DNN-Adapt: Reinforcement Learning-Based Hybrid Batching for Efficient DNN ServingMilind Varma, Sai Venkat Malreddy, Liting Hu. 451-453 [doi]
- Game-Theoretic Reinforcement Learning for Task Optimization Under Time-Sensitive ConstraintsEmanuele Carlini 0001, Patrizio Dazzi, Matteo Mordacchini. 454-456 [doi]
- Revisiting SQL Statement Logging for SQLite on AWS S3Yewon Shin, Jonghyeok Park. 457-459 [doi]
- Serverless Data Analytics (Finally) Bridging the Gap: Introducing the Ortzi DataFrameGermán T. Eizaguirre, Marc Hostau, Marc Sáanchez-Artigas. 460-467 [doi]
- Temporal Fusion Transformer Based Vertical Scaling Management for KubernetesKemalcan Bora, Elli Kartsakli, Eduardo Quiñones Moreno. 468-473 [doi]