Abstract is missing.
- A Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUsLangshi Chen, Serge G. Petiton, Leroy A. Drummond, Maxime R. Hugues. 3-16 [doi]
- Mixed-Precision Orthogonalization Scheme and Adaptive Step Size for Improving the Stability and Performance of CA-GMRES on GPUsIchitaro Yamazaki, Stanimire Tomov, Tingxing Dong, Jack Dongarra. 17-30 [doi]
- Heterogenous Acceleration for Linear Algebra in Multi-coprocessor EnvironmentsAzzam Haidar, Piotr Luszczek, Stanimire Tomov, Jack Dongarra. 31-42 [doi]
- A Study of SpMV Implementation Using MPI and OpenMP on Intel Many-Core ArchitectureFan Ye, Christophe Calvin, Serge G. Petiton. 43-56 [doi]
- SIMD Implementation of a Multiplicative Schwarz Smoother for a Multigrid Poisson Solver on an Intel Xeon Phi CoprocessorMasatoshi Kawai, Takeshi Iwashita, Hiroshi Nakashima. 57-65 [doi]
- Performance Optimization of the 3D FDM Simulation of Seismic Wave Propagation on the Intel Xeon Phi Coprocessor Using the ppOpen-APPL/FDM LibraryFutoshi Mori, Masaharu Matsumoto, Takashi Furumura. 66-76 [doi]
- Machine-Learning-Based Load Balancing for Community Ice Code Component in CESMPrasanna Balaprakash, Yuri Alexeev, Sheri A. Mickelson, Sven Leyffer, Robert L. Jacob, Anthony P. Craig. 79-91 [doi]
- Domain Decomposition for Heterojunction Problems in SemiconductorsTimothy Costa, David Foster, Malgorzata Peszynska. 92-101 [doi]
- A Hybrid Approach for Parallel Transistor-Level Full-Chip Circuit SimulationHeidi K. Thornquist, Sivasankaran Rajamanickam. 102-111 [doi]
- Self-adaptive Multiprecision Preconditioners on Multicore and Manycore ArchitecturesHartwig Anzt, Dimitar Lukarski, Stanimire Tomov, Jack Dongarra. 115-123 [doi]
- Fault Tolerance in an Inner-Outer Solver: A GVR-Enabled Case StudyZiming Zheng, Andrew A. Chien, Keita Teranishi. 124-132 [doi]
- Using Random Butterfly Transformations to Avoid Pivoting in Sparse Direct MethodsMarc Baboulin, Xiaoye S. Li, François-Henry Rouet. 135-144 [doi]
- Hybrid Sparse Linear Solutions with Substituted FactorizationJoshua Dennis Booth, Padma Raghavan. 145-155 [doi]
- Modeling 1D Distributed-Memory Dense Kernels for an Asynchronous Multifrontal Sparse SolverPatrick Amestoy, Jean-Yves L'Excellent, François-Henry Rouet, Wissam M. Sid-Lakhdar. 156-169 [doi]
- Performance Characteristics of HYDRA - A Multi-physics Simulation Code from LLNLSteven H. Langer, Ian Karlin, Michael M. Marinak. 173-181 [doi]
- Accelerating Computation of Eigenvectors in the Dense Nonsymmetric Eigenvalue ProblemMark Gates, Azzam Haidar, Jack Dongarra. 182-191 [doi]
- Low Byte/Flop Implementation of Iterative Solver for Sparse Matrices Derived from Stencil ComputationsKenji Ono, Shuichi Chiba, Shunsuke Inoue, Kazuo Minami. 192-205 [doi]
- Environment-Sensitive Performance Tuning for Distributed Service OrchestrationYu Lin, Franjo Ivancic, Pallavi Joshi, Gogul Balakrishnan, Malay K. Ganai, Aarti Gupta. 209-223 [doi]
- Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific ApplicationsShahzeb Siddiqui, Fatemah AlZayer, Saber Feki. 224-235 [doi]
- Capturing the Expert: Generating Fast Matrix-Multiply Kernels with SpiralRichard Veras, Franz Franchetti. 236-244 [doi]
- A Study on the Influence of Caching: Sequences of Dense Linear Algebra KernelsElmar Peise, Paolo Bientinesi. 245-258 [doi]
- Toward Restarting Strategies Tuning for a Krylov Eigenvalue SolverFrance Boillod-Cerneux, Serge G. Petiton, Christophe Calvin, Leroy A. Drummond. 259-268 [doi]
- Performance Analysis of the Householder-Type Parallel Tall-Skinny QR Factorizations Toward Automatic Algorithm SelectionTakeshi Fukaya, Toshiyuki Imamura, Yusaku Yamamoto. 269-283 [doi]
- Automatic Parameter Tuning of Three-Dimensional Tiled FDTD KernelTakeshi Minami, Motoharu Hibino, Tasuku Hiraishi, Takeshi Iwashita, Hiroshi Nakashima. 284-297 [doi]
- Automatic Parameter Tuning of Hierarchical Incremental CheckpointingAlfian Amrizal, Shoichi Hirasawa, Hiroyuki Takizawa, Hiroaki Kobayashi. 298-309 [doi]