Abstract is missing.
- ML-based Performance Portability for Time-Dependent Density Functional Theory in HPC EnvironmentsAdrián Pérez Diéguez, Min Choi, Xinran Zhu, Bryan M. Wong, Khaled Z. Ibrahim. 1-12 [doi]
- A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning WorkloadsMurali Emani, Zhen Xie, Siddhisanket Raskar, Varuni Sastry, William Arnold, Bruce Wilson, Rajeev Thakur, Venkatram Vishwanath, Zhengchun Liu, Michael E. Papka, Cindy Orozco Bohorquez, Rick Weisner, Karen Li, Yongning Sheng, Yun Du, Jian Zhang, Alexander Tsyplikhin, Gurdaman Khaira, Jeremy Fowers, Ramakrishnan Sivakumar, Victoria Godsoe, Adrian Macias, Chetan Tekur, Matthew Boyd. 13-25 [doi]
- Frontier vs the Exascale Report: Why so long? and Are We Really There Yet?Peter M. Kogge, William J. Dally. 26-35 [doi]
- Evaluating ISO C++ Parallel Algorithms on Heterogeneous HPC SystemsWei-Chen Lin, Tom Deakin, Simon McIntosh-Smith. 36-47 [doi]
- Going green: optimizing GPUs for energy efficiency through model-steered auto-tuningRichard Schoonhoven, Bram Veenboer, Ben van Werkhoven, Kees Joost Batenburg. 48-59 [doi]
- Performance Analysis with Unified Hardware Counter MetricsBrian J. Gravelle, William David Nystrom, Boyana Norris. 60-70 [doi]
- A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated ArchitecturesTaylor Groves, Christopher S. Daley, Rahulkumar Gayatri, Hai Ah Nam, Nan Ding 0006, Lenny Oliker, Nicholas J. Wright, Samuel Williams 0001. 71-81 [doi]
- Benchmarking Fortran DO CONCURRENT on CPUs and GPUs Using BabelStreamJeff R. Hammond, Tom Deakin, James Cownie, Simon McIntosh-Smith. 82-99 [doi]
- WfBench: Automated Generation of Scientific Workflow BenchmarksTainã Coleman, Henri Casanova, Ketan Maheshwari, Loïc Pottier, Sean R. Wilkinson, Justin M. Wozniak, Frédéric Suter, Mallikarjun Shankar, Rafael Ferreira da Silva. 100-111 [doi]
- High-Performance GMRES Multi-Precision Benchmark: Design, Performance, and ChallengesIchitaro Yamazaki, Christian A. Glusa, Jennifer A. Loe, Piotr Luszczek, Sivasankaran Rajamanickam, Jack J. Dongarra. 112-122 [doi]
- OMPICollTune: Autotuning MPI Collectives by Incremental Online LearningSascha Hunold, Sebastian Steiner. 123-128 [doi]
- AppEKG: A Simple Unifying View of HPC Applications in ProductionMohammad Al-Tahat, Strahinja Trecakov, Jonathan Cook. 129-134 [doi]
- An Initial Evaluation of Arm's Scalable Matrix ExtensionFinn Wilkinson, Simon McIntosh-Smith. 135-140 [doi]
- Time-series ML-regression on Graphcore IPU-M2000 and Nvidia A100Jan Balewski, Zhenying Liu, Alexander Tsyplikhin, Manuel Lopez Roland, Kristofer E. Bouchard. 141-146 [doi]