Abstract is missing.
- Bio-Inspired Massively-Parallel ComputationSteve Furber. 3-10 [doi]
- Automatic Tuning of Task Scheduling Policies on Multicore ArchitecturesAkshatha Bhat, Andrew Lenharth, Donald Nguyen, Qing Yi, Keshav Pingali. 11-21 [doi]
- Algorithmic scheme for hybrid computing with CPU, Xeon-Phi/MIC and GPU devices on a single machineSylvain Contassot-Vivier, Stéphane Vialle. 25-34 [doi]
- A Many-Core Machine Model for Designing Algorithms with Minimum Parallelism OverheadsSardar Anisul Haque, Marc Moreno Maza, Ning Xie 0001. 35-44 [doi]
- CPU performance analysis using Score-P on PRIMEHPC FX100 SupercomputerTomotake Nakamura. 45-51 [doi]
- Performance improvements of polydisperse DEM simulations using a loose octree approachG. Stein, S. Wirtz, V. Scherer. 53-62 [doi]
- Execution Performance Analysis of the ABySS Genome Sequence Assembler using Scalasca on the K ComputerItaru Kitayama, Brian J. N. Wylie, Toshiyuki Maeda. 63-72 [doi]
- Performance model based on memory footprint for OpenMP memory bound applicationsCésar Allande, Josep Jorba, Anna Sikora, Eduardo César. 73-82 [doi]
- Evaluating OpenMP Performance on Thousands of Cores on the Numascale ArchitectureDirk Schmidl, Atle Vesterkjær, Matthias S. Müller. 83-92 [doi]
- Acceleration of Large Scale OpenFOAM Simulations on Distributed Systems with Multicore CPUs and GPUsBoris Krasnopolsky, Alexey Medvedev. 93-102 [doi]
- Optimized variant-selection code generation for loops on heterogeneous multicore systemsErik Hansson, Christoph W. Kessler. 103-112 [doi]
- MPI communication on MPPA Many-core NoC: design, modeling and performance issuesMinh Quan Ho, Bernard Tourancheau, Christian Obrecht, Benoît Dupont de Dinechin, Jérôme Reybert. 113-122 [doi]
- Drivers for Device to Device StreamingDominic Eschweiler, Volker Lindenstruth. 123-132 [doi]
- Portable Parallelization of the EDGE CFD Application for GPU-based Systems using the SkePU Skeleton Programming LibraryOskar Sjöström, Soon-Heum Ko, Usman Dastgeer, Lu Li, Christoph W. Kessler. 135-144 [doi]
- Structured parallel implementation of Tree Echo State Network model selectionMarco Danelutto, Claudio Gallicchio, Alessio Micheli, Massimo Torquati, Daniele Virgilio. 145-154 [doi]
- Java Implementation of Data Parallel Skeletons on GPUsSteffen Ernsting, Herbert Kuchen. 155-164 [doi]
- Data parallel patterns in Erlang/OpenCLUgo Albanese, Marco Danelutto. 165-174 [doi]
- Hybrid Coarrays: a PGAS Feature for Many-Core ArchitecturesValeria Cardellini, Alessandro Fanfarillo, Salvatore Filippone, Damian W. I. Rouson. 175-184 [doi]
- Lapedo: Hybrid Skeletons for Programming Heterogeneous Multicore Machines in ErlangVladimir Janjic, Christopher Brown, Kevin Hammond. 185-195 [doi]
- Evaluation of 3-D Stencil Codes on the Intel Xeon Phi CoprocessorMario Hernández, Juan M. Cebrian, José M. Cecilia, José M. García 0001. 197-206 [doi]
- Hierarchical Parallelism in a Physical Modelling Synthesis CodeJames Perry, Stefan Bilbao, Alberto Torin. 207-216 [doi]
- Harnessing CUDA Dynamic Parallelism for the Solution of Sparse Linear SystemsJosé Ignacio Aliaga, Davor Davidovic, Joaquín Pérez Ortega, Enrique S. Quintana-Ortí. 217-226 [doi]
- Model-Driven Development of GPU ApplicationsChristoph Winter, Jan Dünnweber. 227-236 [doi]
- Exploring the Offload Execution Model in the Intel Xeon Phi via Matrix InversionPeter Benner, Pablo Ezzatti, Enrique S. Quintana-Ortí, Alfredo Remón. 237-246 [doi]
- Programming GPUs with C++14 and Just-In-Time CompilationMichael Haidl, Bastian Hagedorn, Sergei Gorlatch. 247-256 [doi]
- Active Packet Pacing as a Congestion Avoidance Technique in Interconnection NetworkHidetomo Shibamura. 257-264 [doi]
- Hybrid Parallelization of Hyper-Dimensional Vlasov Code with OpenMP Loop Collapse DirectiveTakayuki Umeda, Keiichiro Fukazawa. 265-274 [doi]
- Active Resource Management for Multi-Core Runtime Systems Serving Malleable ApplicationsClemens Grelck. 275-284 [doi]
- Improving Energy-Efficiency of Static Schedules by Core Consolidation and Switching Off Unused CoresNicolas Melot, Christoph W. Kessler, Jörg Keller 0001. 285-294 [doi]
- Efficient Parallel Linked List ProcessingAshkan Tousimojarad, Wim Vanderbauwhede. 295-304 [doi]
- Streams as an alternative to halo exchangeDaniel J. Holmes, Caoimhín Laoide-Kemp. 305-316 [doi]
- An Embedded C++ Domain-Specific Language for Stream ParallelismDalvan Griebler, Marco Danelutto, Massimo Torquati, Luiz Gustavo Fernandes. 317-326 [doi]
- Pipeline Template for Streaming Applications on Heterogeneous ChipsAndrés Rodríguez, Angeles G. Navarro, Rafael Asenjo, Francisco Corbera, Antonio Vilches, María Jesús Garzarán. 327-336 [doi]
- Efficient and scalable distributed-memory hierarchization algorithms for the sparse grid combination techniqueMario Heene, Dirk Pflüger. 339-348 [doi]
- Adapting a Finite-Element Type Solver for Bioelectromagnetics to the DEEP-ER PlatformRaphaël Léger, Damián A. Mallón, Alejandro Duran, Stéphane Lanteri. 349-359 [doi]
- High Performance Eigenvalue Solver in Exact-diagonalization Method for Hubbard Model on CUDA GPUSusumu Yamada, Toshiyuki Imamura, Masahiko Machida. 361-369 [doi]
- A general tridiagonal solver for coprocessors: Adapting g-Spike for the Intel Xeon PhiIoannis E. Venetis, Alexandros Sobczyk, Alexandros Kouris, Alexandros Nakos, Nikolaos Nikoloutsakos, Efstratios Gallopoulos. 371-380 [doi]
- CAHTR: Communication-Avoiding Householder TRidiagonalizationToshiyuki Imamura, Takeshi Fukaya, Yusuke Hirota, Susumu Yamada, Masahiko Machida. 381-390 [doi]
- Simulation of external aerodynamics of the DrivAer model with the LBM on GPGPUsAndrea Pasquali, Martin Schönherr, Martin Geier, Manfred Krafczyk. 391-400 [doi]
- A Parallel Algorithm for Decomposition of Finite LanguagesTomasz Jastrzab, Zbigniew J. Czech, Wojciech Wieczorek. 401-410 [doi]
- Exploiting the Space Filling Curve Ordering of Particles in the Neighbour Search of Gadget3Antonio Ragagnin, Nikola Tchipev, Michael Bader, Klaus Dolag, Nicolay Hammer. 411-420 [doi]
- On-the-fly memory compression for multibody algorithmsWolfgang Eckhardt, Robert Glas, Denys Korzh, Stefan Wallner, Tobias Weinzierl. 421-430 [doi]
- Flexible and Generic Workflow ManagementSebastian Lührs, Daniel Rohe, Alexander Schnurpfeil, Kay Thust, Wolfgang Frings. 431-438 [doi]
- A Massively Parallel Barnes-Hut Tree Code with Dual Tree TraversalBenedikt Steinbusch, Marvin-Lucas Henkel, Mathias Winkel, Paul Gibbon. 439-448 [doi]
- Performance modeling of a compressible hydrodynamics solver on multicore CPUsRaphaël Poncet, Mathieu Peybernes, Thibault Gasc, Florian De Vuyst. 449-458 [doi]
- Developing a scalable and flexible high-resolution DNS code for two-phase flowsIain Bethune, Antonia B. K. Collis, Lennon Ó. Náraigh, David Scott, Prashant Valluri. 459-468 [doi]
- FPGA Port of a Large Scientific Model from Legacy Code: The Emanuel Convection SchemeKristian Thorin Hentschel, Wim Vanderbauwhede, Syed Waqar Nabi. 469-478 [doi]
- How to Keep a Geographic Map Up-To-DateMarco Grebe, Tilman Lacko, Rita Loogen. 479-488 [doi]
- Static and Dynamic Big Data Partitioning on Apache SparkMassimiliano Bertolucci, Emanuele Carlini, Patrizio Dazzi, Alessandro Lulli, Laura Ricci. 489-498 [doi]
- ParaFPGA15: Exploring threads and trends in programmable hardwareErik H. D'Hollander, Dirk Stroobandt, Abdellah Touhafi. 501-504 [doi]
- FPGAs as Components in Heterogeneous High-Performance Computing Systems: Raising the Abstraction LevelWim Vanderbauwhede, Syed Waqar Nabi. 505-514 [doi]
- FPGA Acceleration of SAT PreprocessorMasayuki Suzuki, Tsutomu Maruyama. 515-524 [doi]
- Leveraging FPGA clusters for SAT computationsMichal Kouril. 525-532 [doi]
- High-Speed Calculation of Convex Hull in 2D Images Using FPGAKenji Kanazawa, Kahori Kemmotsu, Yamato Mori, Noriyuki Aibe, Moritoshi Yasunaga. 533-542 [doi]
- Workload distribution and balancing in FPGAs and CPUs with OpenCL and TBBRafael Asenjo, Angeles G. Navarro, Andrés Rodríguez, José L. Núñez-Yáñez. 543-551 [doi]
- A Run-Time System for Partially Reconfigurable FPGAs: The case of STMicroelectronics SPEAr boardGeorge Charitopoulos, Dionisios N. Pnevmatikatos, Marco D. Santambrogio, Kyprianos Papadimitriou, Danilo Pau. 553-562 [doi]
- Exploring Automatically Generated Platforms in High Performance FPGAsPanagiotis Skrimponis, Georgios Zindros, Ioannis Parnassos, Muhsen Owaida, Nikolaos Bellas, Paolo Ienne. 563-570 [doi]
- Symposium on Experiences of Porting and Optimising Code for Xeon Phi ProcessorsAdrian Jackson, Michèle Weiland, Mark Parsons, Simon McIntosh-Smith. 573-573 [doi]
- Experiences Porting Production Codes to Xeon Phi ProcessorsEmmanouil Farsarakis, Adrian Jackson, Fiona Reid, David Scott, Michèle Weiland. 575-583 [doi]
- Preparing a Seismic Imaging Code for the Intel Knights Landing Xeon Phi processorGilles Civario, Seán Delaney, Michael Lysaght. 585-590 [doi]
- LU Factorisation on Xeon and Xeon Phi ProcessorsAdrian Jackson, Mateusz Iwo Dubaniowski. 591-599 [doi]
- Mini-Symposium on Coordination Programming - PrefaceClemens Grelck, Alex Shafarenko. 603-604 [doi]
- Claud: Coordination, Locality And Universal DistributionJossekin Beilharz, Frank Feinbube, Felix Eberhardt, Max Plauth, Andreas Polze. 605-614 [doi]
- Coordination with Structured Composition for Cyber-physical SystemsSimon Maurer, Raimund Kirner. 615-624 [doi]
- On Efficient Time Stepping using the Discontinuous Galerkin Method for Numerical Weather PredictionAndreas Dedner, Robert Klöfkorn. 627-636 [doi]
- Porting the COSMO dynamical core to heterogeneous platforms using STELLA LibraryCarlos Osuna, Oliver Fuhrer, Tobias Gysi, Thomas C. Schulthess. 637-646 [doi]
- Towards Compiler-Agnostic Performance in Finite-Difference CodesA. R. Porter, R. W. Ford, Mike Ashworth, G. D. Riley, M. Modani. 647-658 [doi]
- Is the Programming Environment ready for hybrid supercomputers?Alistair Hart, Harvey Richardson. 661-662 [doi]
- Utilizing Hybrid Programming Environments: CSCS Case StudiesWilliam Sawyer, Anton Kozhevnikov, Raffaele Solcà. 663-672 [doi]
- SYCL: Single-source C++ accelerator programmingRuyman Reyes, Victor Lomüller. 673-682 [doi]
- Using Task-Based Parallelism Directly on the GPU for Automated Asynchronous Data TransferAidan B. G. Chalk, Pedro Gonnet, Matthieu Schaller. 683-696 [doi]
- A Strategy for Developing a Performance Portable Highly Scalable ApplicationMichael Neff, Stefan Andersson, Aaron Vose, John M. Levesque. 697-706 [doi]
- Mini-Symposium on Energy and Resilience in Parallel ProgrammingDimitrios S. Nikolopoulos, Christos D. Antonopoulos. 709-709 [doi]
- Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM ArchitecturesJosé Ignacio Aliaga, Sandra Catalán, Charalampos Chalios, Dimitrios S. Nikolopoulos, Enrique S. Quintana-Ortí. 711-720 [doi]
- Compiling for Resilience: the Performance GapNorman A. Rink, Dmitrii Kuvaiskii, Jerónimo Castrillon, Christof Fetzer. 721-730 [doi]
- Automation of Significance Analyses with Interval SplittingJens Deussen, Jan Riehme, Uwe Naumann. 731-740 [doi]
- Energy Minimization on Heterogeneous Systems through Approximate ComputingMichalis Spyrou, Christos Kalogirou, Christos Konstantas, Panos K. Koutsovasilis, Manolis Maroudas, Christos D. Antonopoulos, Nikolaos Bellas. 741-752 [doi]
- Landing Containment Domains on SWARM: Toward a Robust Resiliency Solution on a Dynamic Adaptive Runtime MachineSam Kaplan, Sergio Pino, Aaron Landwehr, Guang R. Gao. 753-761 [doi]
- MAXI - Multi-System Application Extreme-Scaling ImperativeDirk Brömmel, Wolfgang Frings, Brian J. N. Wylie. 765-766 [doi]
- High throughput simulations of two-phase flows on Blue Gene/QPanagiotis E. Hadjidoukas, Diego Rossinelli, Fabian Wermelinger, Jonas Sukys, Ursula Rasthofer, Christian Conti, Babak Hejazialhosseini, Petros Koumoutsakos. 767-776 [doi]
- Direct Numerical Simulation of Fluid Turbulence at Extreme Scale with psOpenJens Henrik Göbbert, Michael Gauding, Cedrick Ansorge, Bernd Hentschel, Torsten Kuhlen, Heinz Pitsch. 777-785 [doi]
- Simulating Morphologically Detailed Neuronal Networks at Extreme ScaleAleksandr Ovcharenko, Pramod S. Kumbhar, Michael L. Hines, Francesco Cremonesi, Timothée Ewart, Stuart Yates, Felix Schürmann, Fabien Delalondre. 787-796 [doi]
- 2TI: Computational Scale Bridging for Dual-Phase SteelsAxel Klawonn, Martin Lanser, Oliver Rheinbach. 797-806 [doi]
- Performance Evaluation of the LBM Solver Musubi on Various HPC ArchitecturesJiaxing Qi, Kartik Jain, Harald Klimach, Sabine Roller. 807-816 [doi]
- Extreme-scaling Applications 24/7 on JUQUEEN Blue Gene/QDirk Brömmel, Wolfgang Frings, Brian J. N. Wylie. 817-826 [doi]
- Extreme Scale-out SuperMUC Phase 2 - lessons learnedNicolay Hammer, Ferdinand Jamitzky, Helmut Satzger, Momme Allalen, Alexander Block, Anupam Karmakar, Matthias Brehm, Reinhold Bader, Luigi Iapichino, Antonio Ragagnin, Vasilios Karakasis, Dieter Kranzlmüller, Arndt Bode, Herbert Huber, Martin Kühn, Rui Machado, Daniel Grünewald, Philipp V. F. Edelmann, Friedrich K. Röpke, Markus Wittmann, Thomas Zeiser, Gerhard Wellein, Gerald Mathias, Magnus Schwörer, Konstantin Lorenzen, Christoph Federrath, Ralf Klessen, Karl-Ulrich Bamberg, Hartmut Ruhl, Florian Schornbaum, Martin Bauer, Anand Nikhil, Jiaxing Qi, Harald Klimach, Hinnerk Stüben, Abhishek Deshmukh, Tobias Falkenstein, Klaus Dolag, Margarita Petkova. 827-836 [doi]
- "K-scale" applications on the K computer and co-design effort for the development of "post-K"Miwako Tsuji. 837-846 [doi]