Abstract is missing.
- Memory Demands in Disaggregated HPC: How Accurate Do We Need to Be?Felippe Vieira Zacarias, Paul M. Carpenter, Vinicius Petrucci. 1-6 [doi]
- Architectural Requirements for Deep Learning Workloads in HPC EnvironmentsKhaled Z. Ibrahim, Tan Nguyen, Hai Ah Nam, Wahid Bhimji, Steven Farrell, Leonid Oliker, Michael Rowan, Nicholas J. Wright, Samuel Williams 0001. 7-17 [doi]
- Multilevel simulation-based co-design of next generation HPC microprocessorsLilia Zaourar, Mohamed Benazouz, Ayoub Mouhagir, Fatma Jebali, Tanguy Sassolas, Jean-Christophe Weill, Carlos Falquez, Nam Ho, Dirk Pleiter, Antoni Portero, Estela Suarez, Polydoros Petrakis, Vassilis Papaefstathiou, Manolis Marazakis, Milan Radulovic, Francesc MartÃnez, Adrià Armejach, Marc Casas, Alejandro Nocua, Romain Dolbeau. 18-29 [doi]
- An Extended Roofline Performance Model with PCI-E and Network CeilingsAmauda S. Dufek, Jack R. Deslippe, Paul T. Lin, Charlene J. Yang, Brandon G. Cook, Jonathan Madsen. 30-39 [doi]
- Exploration of Congestion Control Techniques on Dragonfly-class HPC Networks Through SimulationNeil McGlohon, Christopher D. Carothers, K. Scott Hemmert, Michael Levenhagen, Kevin A. Brown, Sudheer Chunduri, Robert B. Ross. 40-50 [doi]
- Understanding power variation and its implications on performance optimization on the Cori supercomputerSridutt Bhalachandra, Brian Austin, Nicholas J. Wright. 51-62 [doi]
- Using the Semi-Stencil Algorithm to Accelerate High-Order Stencils on GPUsRyuichi Sai, John M. Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng. 63-68 [doi]
- MicroBench Maker: Reproduce, Reuse, ImproveSascha Hunold, Jordy I. Ajanohoun, Alexandra Carpen-Amarie. 69-74 [doi]
- Enabling Cache Aware Roofline analysis with Portable Hardware Counter MetricsBrian J. Gravelle, William David Nystrom, Dewi Yokelson, Boyana Norris. 75-81 [doi]
- Customized Monte Carlo Tree Search for LLVM/Polly's Composable Loop Optimization TransformationsJaehoon Koo, Prasanna Balaprakash, Michael Kruse, Xingfu Wu, Paul D. Hovland, Mary W. Hall. 82-93 [doi]
- Comparing Julia to Performance Portable Parallel Programming Models for HPCWei-Chen Lin, Simon McIntosh-Smith. 94-105 [doi]
- Bayesian Optimization for auto-tuning GPU kernelsFloris-Jan Willemsen, Rob van Nieuwpoort, Ben van Werkhoven. 106-117 [doi]
- Narrowing the Search Space of Applications Mapping on Hierarchical TopologiesNicolas Denoyelle, Swann Perarnau, Brice Videau, Pete Beckman, Emmanuel Jeannot. 118-128 [doi]