Abstract is missing.
- Extreme Data Science at the National Energy Research Scientific Computing (NERSC) CenterSudip S. Dosanjh, Shane Canon, Jack Deslippe, Kjiersten Fagnan, Richard Gerber, Lisa Gerhardt, Jason Hick, Douglas Jacobsen, David Skinner, Nicholas J. Wright. 3-18 [doi]
- Performance Analysis Techniques for the Exascale Co-Design ProcessMartin Schulz, Jim Belak, Abhinav Bhatele, Peer-Timo Bremer, Greg Bronevetsky, Marc Casas, Todd Gamblin, Katherine E. Isaacs, Ignacio Laguna, Joshua A. Levine, Valerio Pascucci, David Richards, Barry Rountree. 19-32 [doi]
- XMP-IO function and its application to MapReduce on the K computerTomotake Nakamura, Mitsuhisa Sato. 35-42 [doi]
- POLCA - A Programming Model for Large Scale, Strongly Heterogeneous InfrastructuresLutz Schubert, Jan Kuper, José Gracia. 43-52 [doi]
- Exploitation of Quality/Throughput Tradeoffs in Image Processing through Invasive ComputingAlexandru Tanase, Vahid Lari, Frank Hannig, Jürgen Teich. 53-62 [doi]
- An Efficient Thread Mapping Strategy for Multiprogramming on Manycore ProcessorsAshkan Tousimojarad, Wim Vanderbauwhede. 63-71 [doi]
- A Scalable Farm Skeleton for Heterogeneous Parallel ProgrammingSteffen Ernsting, Herbert Kuchen. 72-81 [doi]
- Towards Truly Boolean Arrays in Data-Parallel Array ProcessingClemens Grelck, Hraban Luyat. 82-91 [doi]
- Deep Packet Inspection on Commodity Hardware using FastFlowMarco Danelutto, Luca Deri, D. De Sensi, Massimo Torquati. 92-99 [doi]
- Formalizing Bottlenecks in Task-Based OpenMP ApplicationsShajulin Benedict, Michael Gerndt, Diana-Mihaela Gudu. 103-112 [doi]
- Characterizing Performance of Applications on Blue Gene/QPaul F. Baumeister, Hans Boettiger, Thorsten Hater, Michael Knobloch, Thilo Maurer, Andrea Nobile, Dirk Pleiter, Nicolas Vandenbergen. 113-122 [doi]
- Specification of Periscope Tuning Framework PluginsRobert Mijakovic, Antonio Pimenta Soto, Isaías A. Comprés Ureña, Michael Gerndt, Anna Sikora, Eduardo César. 123-132 [doi]
- On Using Speculative Computations for Parallel Reduction to Tridiagonal FormSergey Kuznetsov. 135-142 [doi]
- Fast Approximate Solution of the Non-Symmetric Generalized Eigenvalue Problem on Multicore ArchitecturesPeter Benner, Martin Köhler, Jens Saak. 143-152 [doi]
- Locality Optimization on a NUMA Architecture for Hybrid LU FactorizationAdrien Rémy, Marc Baboulin, Masha Sosonkina, Brigitte Rozoy. 153-162 [doi]
- Variable Block Algebraic Recursive Multilevel Solver (VBARMS) for Sparse Linear SystemsBruno Carpentieri, Jia Liao, Masha Sosonkina. 163-172 [doi]
- A Proposal of a Single-Synchronized Solver Suited to Large Scale Linear Systems on Parallel Computers with Distributed MemorySeiji Fujino, Keiichi Murakami, Kosuke Iwasato. 173-182 [doi]
- Approximate Inverse Preconditioners for Krylov Methods on Heterogeneous Parallel ComputersDaniele Bertaccini, Salvatore Filippone. 183-192 [doi]
- Cache and Energy Efficiency of Sparse Matrix-Vector Multiplication for Different BLAS Numerical Types with the RSB FormatMichele Martone. 193-202 [doi]
- Heterogeneous Sparse Matrix Computations on Hybrid GPU/CPU PlatformsValeria Cardellini, Alessandro Fanfarillo, Salvatore Filippone. 203-212 [doi]
- MapReduce Streaming Algorithms for Laplace Relaxation on the CloudAtanas Radenski, Boyana Norris. 215-224 [doi]
- Space Exploration using Parallel Orbits: a Study in Parallel Symbolic ComputingVladimir Janjic, Christopher Brown, Max Neunhöffer, Kevin Hammond, Steve Linton, Hans-Wolfgang Loidl. 225-232 [doi]
- SFC-based Communication Metadata Encoding for Adaptive Mesh RefinementMartin Schreiber, Tobias Weinzierl, Hans-Joachim Bungartz. 233-242 [doi]
- Graph Repartitioning with both Dynamic Load and Dynamic Processor AllocationClement Vuchener, Aurélien Esnard. 243-252 [doi]
- ForestClaw: Hybrid forest-of-octrees AMR for hyperbolic conservation lawsCarsten Burstedde, Donna Calhoun, Kyle T. Mandli, Andy R. Terrel. 253-262 [doi]
- A space-time parallel solver for the three-dimensional heat equationRobert Speck, Daniel Ruprecht, Matthew Emmett, Matthias Bolten, Rolf Krause. 263-272 [doi]
- An Efficient Pipelined Implementation of Space-Time Parallel ApplicationsToshiya Takami, Daiki Fukudome. 273-281 [doi]
- Efficient GPU-based Optimization of Volume MeshesEric Shaffer, Zuofu Cheng, Raine Yeh, George Zagaris, Luke Olson. 285-294 [doi]
- Fast Uniform Grid Construction on GPGPUs Using Atomic OperationsDavide Barbieri, Valeria Cardellini, Salvatore Filippone. 295-304 [doi]
- Porting Large HPC Applications to GPU Clusters: The Codes GENE and VERTEXTilman Dannert, Andreas Marek, Markus Rampp. 305-314 [doi]
- Numerical Simulation of the Low Compressible Viscous Gas Flows on GPU-based Hybrid SupercomputersAlexander A. Davydov, Evgeny V. Shilnikov. 315-323 [doi]
- Simulation of Multiphase Flows in the Subsurface on GPU-based SupercomputersMarina A. Trapeznikova, Natalia G. Churbanova, Anastasiya Lyupa, Dmitry Morozov. 324-333 [doi]
- Atomic computing - a different perspective on massively parallel problemsAndrew D. Brown, Rob Mills, Jeffrey S. Reeve, Kier Dugan, Steve Furber. 334-343 [doi]
- Accelerating SeisSol by Generating Vectorized Code for Sparse Matrix OperatorsAlexander Breuer, Alexander Heinecke, Michael Bader, Christian Pelties. 347-356 [doi]
- Experience with the MPI/STARSS programming model on a large production codeDirk Brömmel, Paul Gibbon, Marta Garcia, Víctor López, Vladimir Marjanovic, Jesús Labarta. 357-366 [doi]
- Exploiting Data- and Task-Parallelism in the Solution of Riccati Equations on Multicore Servers and GPUsPeter Benner, Pablo Ezzatti, Enrique S. Quintana-Ortí, Alfredo Remón. 367-374 [doi]
- Testing and Implementing Some New Algorithms Using the FFTW Library on Massively Parallel SupercomputersMassimiliano Guarrasi, Ning Li, Sandro Frigio, Andrew P. J. Emerson, Giovanni Erbacci. 375-386 [doi]
- Performance Measurements of MHD Simulation for Planetary Magnetosphere on Peta-Scale Computer FX10Keiichiro Fukazawa, Takeshi Nanri, Takayuki Umeda. 387-394 [doi]
- Parallel Simulations of Self-propelled MicroorganismsKristina Pickl, Matthias Hofmann, Tobias Preclik, Harald Köstler, Ana-Suncana Smith, Ulrich Rüde. 395-404 [doi]
- Improving Communication Performance of Sparse Linear Algebra for an Atomistic Simulation ApplicationChristiane Pousa Ribeiro, Jürg Hutter, Joost VandeVondele. 405-414 [doi]
- NEMORB's Fourier Filter and Distributed Matrix Transposition on Petaflop SystemsTiago Ribeiro, Matthieu Haefele. 415-426 [doi]
- Parallel Computing Design for Exact Diagonalization Scheme on Multi-band Hubbard Cluster ModelsSusumu Yamada, Toshiyuki Imamura, Masahiko Machida. 427-436 [doi]
- ParCo 2013 PhD SymposiumJosef Weidendorfer, Michael Bader. 439-440 [doi]
- Numerical Experiments with New Algorithms for Parallel Decomposition of Large Computational MeshesEvdokia Golovchenko, Elizaveta Dorofeeva, Irina Gasilova, Alexei S. Boldarev. 441-450 [doi]
- A distributed algorithm for the Permutation Flow Shop Problem - An empirical analysisSamia Kouki, Mohamed Jemni, Talel Ladhari. 451-460 [doi]
- GPI2 for GPUs: A PGAS framework for efficient communication in hybrid clustersLena Oden. 461-470 [doi]
- A fault tolerant implementation of Multi-Level Monte Carlo methodsStefan Pauli, Manuel Kohler, Peter Arbenz. 471-480 [doi]
- High performance CPU/GPU multiresolution Poisson solverWim M. van Rees, Diego Rossinelli, Panagiotis E. Hadjidoukas, Petros Koumoutsakos. 481-490 [doi]
- ParaFPGA 2013: Harnessing Programs, Power and Performance in Parallel FPGA applicationsErik H. D'Hollander, Dirk Stroobandt, Abdellah Touhafi. 493-496 [doi]
- High-Level Synthesis Revised - Generation of FPGA Accelerators from a Domain-Specific Language using the Polyhedron ModelMoritz Schmid, Frank Hannig, Alexandru Tanase, Jürgen Teich. 497-506 [doi]
- Compiling a Dataflow-based Language Abstraction onto an FPGAEva Burrows. 507-514 [doi]
- Timing Driven C-Slow Retiming on RTL for MultiCores on FPGAsTobias Strauch. 515-522 [doi]
- Performance and Resource Modeling for FPGAs using High-Level Synthesis toolsBruno da Silva, An Braeken, Erik H. D'Hollander, Abdellah Touhafi. 523-531 [doi]
- Interactive Graph Cuts using FPGADaichi Kobori, Tsutomu Maruyama. 532-539 [doi]
- An Image Filter System based on dynamic partial reconfiguration on FPGAHisaaki Kurita, Tsutomu Maruyama. 540-547 [doi]
- Investigating Energy Consumption of an SRAM-based FPGA for Duty-Cycle ApplicationsKhurram Shahzad, Bengt Oelmann. 548-559 [doi]
- High-Dimensional Meets Parallel: Algorithms and ApplicationsHans-Joachim Bungartz, Dirk Pflüger, Markus Hegland. 563-563 [doi]
- Global Communication Schemes for the Sparse Grid Combination TechniquePhilipp Hupp, Riko Jacob, Mario Heene, Dirk Pflüger, Markus Hegland. 564-573 [doi]
- Load Balancing for Massively Parallel Computations with the Sparse Grid Combination TechniqueMario Heene, Christoph Kowitz, Dirk Pflüger. 574-583 [doi]
- A Parallel Fault Tolerant Combination TechniqueBrendan Harding, Markus Hegland. 584-592 [doi]
- Managing Complexity in the Parallel Sparse Grid Combination TechniqueJay W. Larson, Peter E. Strazdins, Markus Hegland, Brendan Harding, Stephen Roberts, Linda Stals, Alistair P. Rendell, M. M. Ali, James Southern. 593-602 [doi]
- Scalability and Fault Tolerance of the Alternating Direction Method of Multipliers for Sparse GridsValeriy Khakhutskyy, Dirk Pflüger, Markus Hegland. 603-612 [doi]
- Mini-Symposium on Application Autotuning for HPCSiegfried Benkner, Matthias Brehm, Michael Gerndt, Wolfram Hesse, Anna Sikora. 615-615 [doi]
- Investigating Performance Benefits from OpenACC Kernel DirectivesBenjamin Eagan, Gilles Civario, Renato Miceli. 616-625 [doi]
- Application-independent Autotuning for GPUsMartin Tillmann, Thomas Karcher, Carsten Dachsbacher, Walter F. Tichy. 626-635 [doi]
- Autotuning of Pattern Runtimes for Accelerated Parallel SystemsEnes Bajrovic, Siegfried Benkner, Jirí Dokulil, Martin Sandrieser. 636-645 [doi]
- Empirical performance modeling of GPU kernels using active learningPrasanna Balaprakash, Karl Rupp, Azamat Mametjanov, Robert B. Gramacy, Paul D. Hovland, Stefan M. Wild. 646-655 [doi]
- Crowdtuning: systematizing auto-tuning using predictive modeling and crowdsourcingAbdul Wahid Memon, Grigori Fursin. 656-667 [doi]
- Autotuning the energy consumptionCarmen B. Navarrete, Carla Guillen, Wolfram Hesse, Matthias Brehm. 668-677 [doi]
- Potentials and Limitations for Energy Efficiency Auto-TuningRobert Schöne, Andreas Knüpfer, Daniel Molka. 678-687 [doi]
- Extreme Scaling Workshop at the LRZMomme Allalen, Gurvan Bazin, Christoph Bernau, Arndt Bode, David Brayford, Matthias Brehm, Jürg Diemand, Klaus Dolag, Jan Engels, Nicolay Hammer, Herbert Huber, Ferdinand Jamitzky, Anupam Kamakar, Carsten Kutzner, Andreas Marek, Carmen B. Navarrete, Helmut Satzger, Wolfram Schmidt, Philipp Trisjono. 691-697 [doi]
- Extreme Scaling of Lattice Quantum ChromodynamicsDavid Brayford, Momme Allalen, Volker Weinberg. 698-702 [doi]
- End-to-end Parallel Simulations with APESHarald Klimach, Kartik Jain, Sabine Roller. 703-711 [doi]
- Towards Petaflops Capability of the VERTEX Supernova CodeAndreas Marek, Markus Rampp, Florian Hanke, Hans-Thomas Janka. 712-721 [doi]
- Scaling of the GROMACS 4.6 molecular dynamics code on SuperMUCCarsten Kutzner, Rossen Apostolov, Berk Hess, Helmut Grubmüller. 722-727 [doi]
- Parallel Programming for Heterogeneous ArchitecturesBettina Krammer, Hartmut Mix, Markus Geimer. 731-732 [doi]
- Execution Schemes for the NPB-MZ Benchmarks on Hybrid Architectures: A Comparative StudyJörg Dümmler, Gudula Rünger. 733-742 [doi]
- Scilab on a hybrid platformVictor Lomüller, Sylvestre Ledru, Henri-Pierre Charles. 743-752 [doi]
- Divide and Conquer Parallelization of Finite Element Method AssemblyLoïc Thébault, Eric Petit, Marc Tchiboukdjian, Quang Dinh, William Jalby. 753-762 [doi]
- Cudagrind: A Valgrind Extension for CUDAThomas M. Baumann, José Gracia. 763-772 [doi]
- Profiling Hybrid HMPP Applications with Score-P on Heterogeneous HardwareMarc Schlütter, Peter Philippen, Laurent Morin, Markus Geimer, Bernd Mohr. 773-782 [doi]
- Binary Instrumentation for Scalable Performance Measurement of OpenMP ApplicationsJulien Jaeger, Peter Philippen, Eric Petit, Andres Charif Rubial, Christian Rössel, William Jalby, Bernd Mohr. 783-792 [doi]
- A Case Study: Holistic Performance Analysis on Heterogeneous Architectures using the Vampir ToolchainRobert Dietrich, Frank Winkler, Thomas William, Jonas Stolle, Robert Henschel, Donald K. Berry. 793-802 [doi]
- PRACE DECI (Distributed European Computing Initiative) MinisymposiumChris Johnson, Anastasia V. Bochenkova, Alexander A. Granovsky, Peter J. Bond, Teresa Paramo, Tristan Glatard, William A. Romero R., Denis Friboulet, Stefan J. Zasada, Peter V. Coveney. 805-812 [doi]
- A Generic Prototype to Benchmark Algorithms and Data Structures for Hierarchical Hybrid GridsSebastian Kuckuk, Björn Gmeiner, Harald Köstler, Ulrich Rüde. 813-822 [doi]
- Towards a Performance Engineering Workflow for OpenMP 4.0Dirk Schmidl, Christian Iwainsky, Christian Terboven, Christian H. Bischof, Matthias S. Müller. 823-832 [doi]
- Theoretical Measures of Cache Efficiency for Tetrahedral Adaptive Meshes - A Case Study with a Quasi Space-Filling Curve OrderOliver Kunst, Jörn Behrens. 833-842 [doi]