Abstract is missing.
- Welcome Message from the IEEE Cluster 2024 General Co-ChairsSatoshi Matsuoka, James Lin. [doi]
- Welcome Message from the IEEE Cluster 2024 Workshop chairYohei Miki 0001. 1 [doi]
- PowerSched - Managing Power Consumption in Overprovisioned SystemsChristian Simmendinger, Marcel Marquardt, Jan Mäder, Ralf Schneider. 1-8 [doi]
- Welcome Message from LLMxHPC WorkshopKevin A. Brown, Tanwi Mallick, Juliane Mueller 0002, Aleksandr Drozd. 1 [doi]
- "How-to" Guide for Transitioning from Air to Liquid-Cooled High Performance Computing SystemsDave Martinez, David Sickinger, Aaron Andersen. 9-18 [doi]
- Optimizing Idle Power of HPC Systems: Practical Insights and MethodsThomas Ilsche, Sebastian Schräder, Robert Schöne. 19-25 [doi]
- Calculating User-Centric Carbon Footprints for HPCChristian Wassermann, Mario Bielert, Gert Vanberg, Daniel Hackenberg, Christian Terboven, Matthias S. Müller. 26-35 [doi]
- Evolving Large Scale HPC Monitoring & Analysis to Track Modern Dynamic EnvironmentsKathleen Shoga, Jim M. Brandt, Benjamin Schwaller, Thomas W. Tucker. 36-43 [doi]
- Microgrid Integration with High Performance Computing Systems for Microreactor OperationMatthew Anderson, Matthew Sgambati. 44-54 [doi]
- Power-Efficiency Variation on A64FX Supercomputers and its Application to System OperationTomoya Kusaba, Yusuke Awaki, Kohei Yoshida, Shinobu Miwa, Hayato Yamaki, Toshihiro Hanawa, Hiroki Honda. 55-65 [doi]
- Towards Improving Resource Allocation for Multi-Tenant HPC Systems: An Exploratory HPC Cluster Utilization Case StudyRobert Keßler, Simon Volpert, Stefan Wesner. 66-75 [doi]
- 16 Years of SPEC Power: An Analysis of x86 Energy Efficiency TrendsHannes Tröpgen, Robert Schöne, Thomas Ilsche, Daniel Hackenberg. 76-80 [doi]
- Advanced Visualization of Power, Temperature, and Energy Metrics in HPE Cray EX SystemsLavanya L, Stefan Ceballos. 81-85 [doi]
- Enabling High- Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O MonitoringJeremy J. Williams, Daniel Medeiros 0002, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis. 86-95 [doi]
- Understanding Highly Configurable Storage for Diverse WorkloadsOlga Kogiou, Hariharan Devarajan, Chen Wang 0004, Weikuan Yu, Kathryn M. Mohror. 96-103 [doi]
- Object-Centric Data Management in HPC Workflows - A Case StudyChen Wang, Houjun Tang, Jean Luca Bez, Suren Byna. 104-108 [doi]
- Studying the Effects of Asynchronous I/O on HPC I/O PatternsArnav Gupta, Druva Dhakshinamoorthy, Arnab K. Paul. 109-112 [doi]
- Challenges in Understanding Metadata Performance: A Case of Metadata Analysis Using Score-PBoris Kosmynin, Radita Liem. 113-117 [doi]
- RAPID: A Rapid Automatic Parallelizer for Immense Deep Neural NetworksThibaut Tachon, Haoran Wang, Chong Li. 118-126 [doi]
- Automatic Parallelization with CodeT5+: A Model for Generating OpenMP DirectivesSoratouch Pornmaneerattanatri, Keichi Takahashi, Yutaro Kashiwa, Kohei Ichikawa, Hajimu Iida. 127-135 [doi]
- LASSI: An LLM-Based Automated Self-Correcting Pipeline for Translating Parallel Scientific CodesMatthew T. Dearing, Yiheng Tao, Xingfu Wu, Zhiling Lan, Valerie Taylor 0001. 136-143 [doi]
- Design, Implementation and Deployment of Sunrise Integrated Health Care ClusterTao Yu, Zhifeng Gu, Yizheng Sun, Xiaofei Wang. 144-145 [doi]
- An Optimization Pass for Training Speed-Up and Strategy Search in 3D ParallelismRyubu Hosoki, Kento Sato, Toshio Endo, Julien Bigot, Edouard Audit. 146-147 [doi]
- Beyond Training: A Zero-Shot Framework to Neural Architecture and Accelerator Co-ExplorationWei Fu, Wenqi Lou, Lei Gong, Chao Wang, Xuehai Zhou. 148-149 [doi]
- Implementing Fast Modal Filtering of SCALE-DGXuanzhengbo Ren, Yuta Kawai, Hirofumi Tomita, Seiya Nishizawa, Takahiro Katagiri, Masatoshi Kawai, Tetsuya Hoshino, Toru Nagai. 150-151 [doi]
- Enhancing Large Scale Brain Simulation with Optimized Parallel Algorithms on Fugaku SupercomputerTianxiang Lyu, Mitsuhisa Sato, Shigeki Aoki, Ryutaro Himeno, Zhe Sun. 152-153 [doi]
- Innovative Computational Science by Integration of Simulation/Data/Learning on Heterogeneous SupercomputersKengo Nakajima, Takashi Furumura, France Boillod-Cerneux, Edoardo Di Napoli, Estela Suarez, Takashi Arakawa, Shinji Sumimoto, Hisashi Yashiro. 154-155 [doi]
- Neko: A Modern, Portable, and Scalable Framework for Extreme-Scale Computational Fluid DynamicsNiclas Jansson, Martin Karp, Stefano Markidis, Philipp Schlatter. 156-157 [doi]
- vBoost: A Lock-free Distributed Index Based on vEB Tree for Disaggregated MemoryYuting Li, Yun Xu, Pengcheng Wang, Yonghui Xu, Weiguang Wang. 158-159 [doi]
- Communication Optimization for Distributed GCN Training on ABCI SupercomputerChen Zhuang, Peng Chen 0035, Xin Liu, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib. 160-161 [doi]
- Optimizing Star Aligner for High Throughput Computing in the CloudPiotr Kica, Sabina Licholai, Michal Orzechowski, Maciej Malawski. 162-163 [doi]
- A Lossless-Ethernet-Based Interconnect for FPGA Clusters Toward FTQCYoshito Higa, Yasunori Osana. 164-165 [doi]
- Post-Route Power Estimation: A Case Study of RIKEN-CGRAChenlin Shi, Boma A. Adhi, Shinobu Miwa, Kentaro Sano. 166-167 [doi]
- Scalable Connection of Qubits to Quantum Error Correction Systems Using EthernetJan-Erik R. Wichmann, Kentaro Sano. 168-169 [doi]
- Workload Analytics of LLMs Training on ABCIYusuke Tanimura, Naoki Onishi, Shin'ichiro Takizawa. 170-171 [doi]
- Evaluating MPI Performance on SGX and GramineKota Shimojima, Shinobu Miwa, Hayato Yamaki, Hiroki Honda. 172-173 [doi]
- Investigating Nvidia GPU Architecture Trends via MicrobenchmarksLingqi Zhang 0001, Ryan Barton, Peng Chen 0035, Xiao Wang 0004, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib. 174-175 [doi]
- Leveraging Portals4 Microbenchmarks to Enhance GASPI Performance on BXI NetworksNiklas Bartelheimer, Sarah Neuwirth. 176-177 [doi]
- Evaluation of Vectorization Methods on Arm SVE Using the Exo LanguageRin Iwai, Emil Vatai, Jens Domke, Yukinori Sato. 178-179 [doi]
- Introduction of WHEEL: An Analysis Workflow Tool for Industrial Users and its Use Case on Supercomputer FugakuTomohiro Kawanabe, Naoyuki Sogo, Kenji Ono. 180-181 [doi]
- Heterogeneous Application Coupling Library for Center-Wide QC-HPC Hybrid ComputingShinji Sumimoto, Kazuya Yamazaki, Yao Hu, Kengo Nakajima. 182-183 [doi]
- Preliminary Performance Evaluation of Grace-Hopper GH200Toshihiro Hanawa, Kengo Nakajima, Yohei Miki 0001, Takashi Shimokawabe, Kazuya Yamazaki, Shinji Sumimoto, Osamu Tatebe, Taisuke Boku, Daisuke Takahashi, Akira Nukada, Norihisa Fujita, Ryohei Kobayashi 0001, Hiroto Tadano, Akira Naruse. 184-185 [doi]
- Performance Insights Into Supporting Kokkos Views in the Kokkos Comm MPI LibraryC. Nicole Avans, Jan Ciesko, Carl Pearson, Evan Drake Suggs, Stephen L. Olivier, Anthony Skjellum. 186-187 [doi]
- Toward Providing Root Privilege to Flagship HPC Users with Thin-HypervisorTakaaki Fukai, Manami Mori, Keiji Yamamoto, Takahiro Hirofuchi, Takuya Asaka. 188-189 [doi]
- Cheetah: An Efficient Deterministic Concurrency Control Scheme with Non-Visible Write Elimination and Re-Designed Garbage CollectionHaowen Li, Rina Onishi, Hideyuki Kawashima. 190-191 [doi]
- Using SYCLomatic to Migrate CUDA Code to oneAPI Adapting NVIDIA GPUWentao Liang, Norihisa Fujita, Ryohei Kobayashi 0001, Taisuke Boku. 192-193 [doi]
- Preliminary Evaluation of Kyokko for Inter-FPGA Communication Framework CIRCUSKaito Kitazume, Norihisa Fujita, Ryohei Kobayashi 0001, Taisuke Boku. 194-195 [doi]
- Asynchronous I/O Optimization for X-Ray Imaging via GPUDirect StorageDu Wu, Peng Chen 0035, Yiyu Tan, Yusuke Tanimura, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib. 196-197 [doi]
- FDPVirt: Flexible Data Placement SSD EmulatorJoonyeop Park, Haeram Kim, Jiwon Ha, Hyungsoo Jung 0001, Hyeonsang Eom. 198-199 [doi]
- Enhanced Simulation and Analysis of Air Pollutants Using Multi-Platform HPC and In-Situ VisualizationChongke Bi, Fumiyoshi Shoji, Kenji Ono, Naohisa Sakamoto, Jorji Nonaka, Honggang Yin, Wenjuan Cui. 200-201 [doi]
- On the Building of a Common In-Situ Visualization Environment for Arm A64FX SupercomputersJorji Nonaka, Masahiro Nakao, Hitoshi Murai, Keiji Yamamoto, Masaaki Terai, Tomohiro Kawanabe, Toshihiko Kai, Fumiyoshi Shoji, Daichi Obinata, Hiroyuki Ito, Shunji Uno, Takanori Haga, Manabu Motokawa, Atsushi Fujino, Naoyuki Fujita, Seiji Tsutsumi, Atsushi Toyoda, Naohisa Sakamoto. 202-203 [doi]
- Refining Compaction Offloading I/O Stack for LSM-Based Key-Value Stores with SPDKHonghyeon Yoo, Hongsu Byun, Sungyong Park. 204-205 [doi]