Abstract is missing.
- Scaling molecular dynamics for large-scale simulation of biological systems on AMD CPU/GPU supercomputers: Lessons from LUMI: Optimizing GENESIS for maximizing the computational efficiency of CPU and GPU kernels on the LUMI supercomputerDiego Ugarte La Torre, Jaewoon Jung, Yuji Sugita. 1-12 [doi]
- Dimensionality Reduction-based Interactive Visual Analytics Approach for Investigating Ensemble Weather SimulationsGo Tamura, Sena Kobayashi, Naohisa Sakamoto, Yasumitsu Maejima, Jorji Nonaka. 13-22 [doi]
- PBHS: A Prediction-Based Scheduler for Hyperparameter TuningHong-Feng Yu, Cheng-Hsun Chang, Jerry Chou 0001. 23-32 [doi]
- Revisiting Memory Swapping for Big-Memory ApplicationsShun Kida, Satoshi Imamura, Kenji Kono. 33-42 [doi]
- PCIe Bandwidth-Aware Scheduling for Multi-Instance GPUsYan Mei Tang, Wei-Fang Sun, Hsu-Tzu Ting, Ming-Hung Chen, I-Hsin Chung, Jerry Chou 0001. 43-51 [doi]
- Scalable Dual Coordinate Descent for Kernel MethodsZishan Shao, Aditya Devarakonda. 52-63 [doi]
- Large Scale Ensemble Coupling of Non-hydrostatic Atmospheric Model NICAMTakashi Arakawa, Hisashi Yashiro, Shinji Sumimoto, Kengo Nakajima. 64-71 [doi]
- Accelerating General Relativistic Radiation Magnetohydrodynamic Simulations with GPUsRyohei Kobayashi 0001, Hiroyuki R. Takahashi, Akira Nukada, Yuta Asahina, Taisuke Boku, Ken Ohsuga. 72-79 [doi]
- Lattice QCD code on GPUs: Implementation and performancecomparison with OpenACC and CUDAWei-Lun Chen, Issaku Kanamori, Hideo Matsufuru. 80-89 [doi]
- ITTPD: In-place Tensor Transposition with Permutation Decomposition on GPUsKai-Jung Cheng, Che-Rung Lee. 90-98 [doi]
- When HPC Scheduling Meets Active Learning: Maximizing The Performance with Minimal DataJiheon Choi, JaeHyun Lee, Minsol Choo, Taeyoung Yoon, Oh-Kyoung Kwon, Sangyoon Oh 0001. 99-109 [doi]
- A Full-Path Priority Based Workflow Scheduling ApproachWei-Cheng Tseng, Kai-Yang Rong, Kuo-Chan Huang. 110-119 [doi]
- Qsync: Extending Simplified SMR Protocol with Partial Network Partition ToleranceAoi Kida, Hideyuki Kawashima. 120-130 [doi]
- Using a Large Language Model as a Building Block to Generate UsableValidation and Verification Suite for OpenMPSwaroop Pophale, Wael R. Elwasif, David E. Bernholdt. 131-141 [doi]
- Libra: A Python-Level Tensor Re-Materialization Strategy for Reducing Deep Learning GPU Memory UsageLing-Sung Wang, Sao-Hsuan Lin, Jerry Chou 0001. 142-152 [doi]
- Fast Malicious Packets Inspection Framework Using Converged AcceleratorChuan-Ming Ou, Yong-Xuan Huang, Ming-Hung Chen, I-Hsin Chung, Jerry Chou 0001. 153-161 [doi]
- Lossy Compressed Collective Inter-FPGA CommunicationsMichihiro Koibuchi, Yoshinobu Ishida, Shoichi Hirasawa, Yao Hu, Takumi Honda, Yusuke Nagasaka, Naoto Fukumoto. 162-172 [doi]