Abstract is missing.
- Flexible batched sparse matrix-vector product on GPUsHartwig Anzt, Gary Collins, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí. [doi]
- Analyzing the criticality of transient faults-induced SDCS on GPU applicationsFernando Fernandes dos Santos, Paolo Rech. [doi]
- Dynamic task discovery in PaRSEC: a data-flow task-based runtimeReazul Hoque, Thomas Hérault, George Bosilca, Jack J. Dongarra. [doi]
- Dynamic load balancing of massively parallel unstructured meshesGerrett Diamond, Cameron W. Smith, Mark S. Shephard. [doi]
- Leveraging NVLINK and asynchronous data transfer to scale beyond the memory capacity of GPUsDavid Appelhans, Bob Walkup. [doi]
- Investigating half precision arithmetic to accelerate dense linear system solversAzzam Haidar, Panruo Wu, Stanimire Tomov, Jack J. Dongarra. [doi]
- A highly scalable, algorithm-based fault-tolerant solver for gyrokinetic plasma simulationsMichael Obersteiner, Alfredo Parra-Hinojosa, Mario Heene, Hans-Joachim Bungartz, Dirk Pflüger. [doi]
- Parallel jaccard and related graph clustering techniquesAlexandre Fender, Nahid Emad, Serge G. Petiton, Joe Eaton, Maxim Naumov. [doi]
- Application of a communication-avoiding generalized minimal residual method to a gyrokinetic five dimensional eulerian code on many core platformsYasuhiro Idomura, Takuya Ina, Akie Mayumi, S. Yamada, K. Matsumoto, Yuuichi Asahi, Toshiyuki Imamura. [doi]
- Snowpack: efficient parameter choice for GPU kernels via static analysis and statistical predictionRanvijay Singh, Paul Wood, Ravi Gupta, Saurabh Bagchi, Ignacio Laguna. [doi]