Abstract is missing.
- Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FXChristie L. Alappat, Jan Laukemann, Thomas Gruber, Georg Hager, Gerhard Wellein, Nils Meyer, Tilo Wettig. 1-7 [doi]
- The Performance and Energy Efficiency Potential of FPGAs in Scientific ComputingTan Nguyen, Samuel Williams, Marco Siracusa, Colin MacLean, Douglas Doerfler, Nicholas J. Wright. 8-19 [doi]
- Benchmarking Julia's Communication Performance: Is Julia HPC ready or Full HPC?Sascha Hunold, Sebastian Steiner. 20-25 [doi]
- Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched ComputationsHartwig Anzt, Yuhsiang M. Tsai, Ahmad Abdelfattah, Terry Cojean, Jack J. Dongarra. 26-38 [doi]
- Exploiting the Potentials of the Second Generation SX-Aurora TSUBASARyusuke Egawa, Souya Fujimoto, Tsuyoshi Yamashita, Daisuke Sasaki, Yoko Isobe, Yoichi Shimomura, Hiroyuki Takizawa. 39-49 [doi]
- Lightweight Measurement and Analysis of HPC Performance VariabilityJered Dominguez-Trujillo, Keira Haskins, Soheila Jafari Khouzani, Christopher Leap, Sahba Tashakkori, Quincy Wofford, Trilce Estrada, Patrick G. Bridges, Patrick M. Widener. 50-60 [doi]
- Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian OptimizationXingfu Wu, Michael Kruse, Prasanna Balaprakash, Hal Finkel, Paul D. Hovland, Valerie E. Taylor, Mary W. Hall. 61-70 [doi]
- Warwick Data Store: A Data Structure Abstraction LibraryRichard O. Kirk, Martin Nolten, Robert Kevis, Timothy R. Law, Satheesh Maheswaran, Steven A. Wright 0001, Seimon Powell, Gihan R. Mudalige, Stephen A. Jarvis. 71-85 [doi]
- Accelerating High-Order Stencils on GPUsRyuichi Sai, John M. Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng. 86-108 [doi]
- Developing Models for the Runtime of Programs With Exponential Runtime BehaviorMichael Burger 0001, Giang Nam Nguyen, Christian Bischof. 109-125 [doi]
- Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated ApproachesTaylor Groves, Ben Brock, Yuxin Chen, Khaled Z. Ibrahim, Lenny Oliker, Nicholas J. Wright, Samuel Williams, Katherine A. Yelick. 126-137 [doi]
- Evaluation of the Communication Motif for a Distributed Eigensolver using the SST Network Simulation ToolMd. Afibuzzaman, Pieter Maris, Taylor Groves, Dossay Oryspayev, Brandon Cook 0001, Chao Yang, Hasan Metin Aktulga. 138-148 [doi]