143 | -- | 144 | Julian M. Kunkel, Thomas Ludwig 0002, Hans Werner Meuer. ISC 11 research paper sessions proceedings |
145 | -- | 151 | R. Spurzem, P. Berczik, I. Berentzen, K. Nitadori, T. Hamada, G. Marcus, Andreas Kugel, Reinhard Männer, J. Fiestas, R. Banerjee, R. Klessen. Astrophysical particle simulations with large custom GPU clusters on three continents |
153 | -- | 164 | Matthias Bach, Matthias Kretz, Volker Lindenstruth, David Rohr. Optimized HPL for AMD GPU and multi-core CPU usage |
165 | -- | 174 | Sandra Wienke, Dmytro Plotnikov, Dieter an Mey, Christian H. Bischof, Ario Hardjosuwito, Christof Gorgels, Christian Brecher. Simulation of bevel gear cutting with GPGPUs - performance and productivity |
175 | -- | 185 | J. A. Davis, Gihan R. Mudalige, Simon D. Hammond, J. A. Herdman, I. Miller, Stephen A. Jarvis. Predictive analysis of a hydrodynamics application on large-scale CMP clusters |
187 | -- | 195 | Adrian Jackson, M. Sergio Campobasso. Shared-memory, distributed-memory, and mixed-mode parallelisation of a CFD simulation code |
197 | -- | 203 | L. H. Han, T. Indinger, X. Y. Hu, N. A. Adams. Wavelet-based adaptive multi-resolution solver on heterogeneous parallel architecture for computational fluid dynamics |
205 | -- | 210 | Matthias Christen, Olaf Schenk, Helmar Burkhart. Automatic code generation and tuning for stencil kernels on modern shared memory architectures |
211 | -- | 220 | Michael Deisher, Mikhail Smelyanskiy, Brian Nickerson, Victor W. Lee, Michael Chuvelev, Pradeep Dubey. Designing and dynamically load balancing hybrid LU for multi/many-core |
221 | -- | 228 | Malte Förster, Jiri Kraus. Scalable parallel AMG on ccNUMA machines with OpenMP |
229 | -- | 236 | Rui Machado, Carsten Lojewski, Salvador Abreu, Franz-Josef Pfreundt. Unbalanced tree search on a manycore system using the GPI programming model |
237 | -- | 246 | Krishna Chaitanya Kandalla, Hari Subramoni, Karen A. Tomko, Dmitry Pekurovsky, Sayantan Sur, Dhabaleswar K. Panda. High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT |
247 | -- | 256 | Pavan Balaji, Rinku Gupta, Abhinav Vishnu, Peter H. Beckman. Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems |
257 | -- | 266 | Hao Wang, Sreeram Potluri, Miao Luo, Ashish Kumar Singh, Sayantan Sur, Dhabaleswar K. Panda. MVAPICH2-GPU: optimized GPU to GPU communication for InfiniBand clusters |
267 | -- | 273 | Gilad Shainer, Ali Ayoub, Pak Lui, Tong Liu, Michael Kagan, Christian Trott, Greg Scantlen, Paul S. Crozier. The development of Mellanox/NVIDIA GPUDirect over InfiniBand - a new model for GPU to GPU communications |
275 | -- | 283 | Wolfgang Frings, Michael Hennecke. A system level view of Petascale I/O on IBM Blue Gene/P |
285 | -- | 295 | Narate Taerat, Jim M. Brandt, Ann C. Gentile, Matthew Wong, Chokchai Leangsuksun. Baler: deterministic, lossless log message clustering tool |
297 | -- | 305 | Yevgeniy Vorobeychik, Jackson Mayo, Robert C. Armstrong, Ronald Minnich, Don W. Rudish. Fault oblivious high performance computing with dynamic task replication and substitution |
307 | -- | 315 | Davide Pasetto, Karol Lynch, Robert Tucker, Brendan Maguire, Fabrizio Petrini, Hubertus Franke. Ultra low latency market data feed on IBM PowerEN:::TM::: |
317 | -- | 324 | Michael Oberg, Matthew Woitaszek, Theron Voran, Henry M. Tufo. A system architecture supporting high-performance and cloud computing in an academic consortium environment |
325 | -- | 337 | Jack B. Dennis, Guang R. Gao, Xiao X. Meng. Experiments with the Fresh Breeze tree-based memory model |