Abstract is missing.
- Concurrent Systems: Hybrid Object Implementations and Abortable ObjectsMichel Raynal. 3-15 [doi]
- Runtime-Aware ArchitecturesMarc Casas, Miquel Moretó, Lluc Alvarez, Emilio Castillo, Dimitrios Chasapis, Timothy Hayes 0001, Luc Jaulmes, Oscar Palomar, Osman S. Unsal, Adrián Cristal, Eduard Ayguadé, Jesús Labarta, Mateo Valero. 16-27 [doi]
- MPI Thread-Level Checking for MPI+OpenMP ApplicationsEmmanuelle Saillard, Patrick Carribault, Denis Barthou. 31-42 [doi]
- Event-Action Mappings for Parallel Tools InfrastructuresTobias Hilbrich, Martin Schulz, Holger Brunst, Joachim Protze, Bronis R. de Supinski, Matthias S. Müller. 43-54 [doi]
- Low-Overhead Detection of Memory Access Patterns and Their Time EvolutionHarald Servat, Germán Llort, Juan Gonzalez, Judit Gimenez, Jesús Labarta. 57-69 [doi]
- Automatic On-Line Detection of MPI Application Structure with Event Flow GraphsXavier Aguilar, Karl Fürlinger, Erwin Laure. 70-81 [doi]
- Online Automated Reliability Classification of Queueing Models for Streaming Processing Using Support Vector MachinesJonathan C. Beard, Cooper Epstein, Roger D. Chamberlain. 82-93 [doi]
- A Duplicate-Free State-Space Model for Optimal Task SchedulingMichael Orr, Oliver Sinnen. 97-108 [doi]
- On the Heterogeneity Bias of Cost Matrices When Assessing Scheduling AlgorithmsLouis-Claude Canon, Laurent Philippe. 109-121 [doi]
- Hardware Round-Robin Scheduler for Single-ISA Asymmetric Multi-coreNikola Markovic, Daniel Nemirovsky, Veljko Milutinovic, Osman S. Unsal, Mateo Valero, Adrián Cristal. 122-134 [doi]
- Moody Scheduling for Speculative ParallelizationAlvaro Estebanez, Diego R. Llanos, David Orden, Belén Palop. 135-146 [doi]
- Allocating Jobs with Periodic Demand VariationsOlivier Beaumont, Ikbel Belaid, Lionel Eyraud-Dubois, Juan Angel Lorenzo del Castillo. 147-158 [doi]
- A Multi-level Hypergraph Partitioning Algorithm Using Rough Set ClusteringFoad Lotfifar, Matthew Johnson. 159-170 [doi]
- Non-preemptive Throughput Maximization for Speed-Scaling with Power-DownEric Angel, Evripidis Bampis, Vincent Chau, Nguyen Kim Thang. 171-182 [doi]
- Scheduling Tasks from Selfish Multi-tasks AgentsJohanne Cohen, Fanny Pascual. 183-195 [doi]
- Locality and Balance for Communication-Aware Thread Mapping in Multicore SystemsMatthias Diener, Eduardo Henrique Molina da Cruz, Marco Antonio Zanata Alves, Mohammad S. Alhakeem, Philippe Olivier Alexandre Navaux, Hans-Ulrich Heiß. 196-208 [doi]
- Priority Queues Are Not Good Concurrent Priority SchedulersAndrew Lenharth, Donald Nguyen, Keshav Pingali. 209-221 [doi]
- Load Balancing Prioritized Tasks via Work-StealingShams Imam, Vivek Sarkar. 222-234 [doi]
- Optimizing Task Parallelism with Library-Semantics-Aware CompilationPeter Thoman, Stefan Moosbrugger, Thomas Fahringer. 237-249 [doi]
- Data Layout Optimization for Portable PerformanceKamal Sharma, Ian Karlin, Jeff Keasler, James R. McGraw, Vivek Sarkar. 250-262 [doi]
- Automatic Data Layout Optimizations for GPUsKlaus Kofler, Biagio Cosenza, Thomas Fahringer. 263-274 [doi]
- Performance Impacts with Reliable Parallel File Systems at Exascale LevelRamon Nou, Alberto Miranda, Toni Cortes. 277-288 [doi]
- Rapid Tomographic Image Reconstruction via Large-Scale ParallelizationTekin Bicer, Doga Gürsoy, Rajkumar Kettimuthu, Francesco De Carlo, Gagan Agrawal, Ian T. Foster. 289-302 [doi]
- Software Consolidation as an Efficient Energy and Cost Saving Solution for a SaaS/PaaS Cloud ModelAlain Tchana, Noel De Palma, Ibrahim Safieddine, Daniel Hagimont, Bruno Diot, Nicolas Vuillerme. 305-316 [doi]
- VMPlaceS: A Generic Tool to Investigate and Compare VM Placement AlgorithmsAdrien Lèbre, Jonathan Pastor, Mario Südholt. 317-329 [doi]
- A Connectivity Model for Agreement in Dynamic SystemsCarlos Gómez-Calzado, Arnaud Casteigts, Alberto Lafuente, Mikel Larrea. 333-345 [doi]
- DFEP: Distributed Funding-Based Edge PartitioningAlessio Guerrieri, Alberto Montresor. 346-358 [doi]
- PR-STM: Priority Rule Based Software Transactions for the GPUQi Shen, Craig Sharp, William Blewitt, Gary Ushaw, Graham Morgan. 361-372 [doi]
- Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime SystemsHuan Zhou, Kamran Idrees, José Gracia. 373-384 [doi]
- A Practical Transactional Memory InterfaceShahar Timnat, Maurice Herlihy, Erez Petrank. 387-401 [doi]
- A Multicore Parallelization of Continuous Skyline Queries on Data StreamsTiziano De Matteis, Salvatore Di Girolamo, Gabriele Mencagli. 402-413 [doi]
- A Fast and Scalable Graph Coloring Algorithm for Multi-core and Many-core ArchitecturesGeorgios Rokos, Gerard Gorman, Paul H. J. Kelly. 414-425 [doi]
- A Composable Deadlock-Free Approach to Object-Based IsolationShams Imam, Jisheng Zhao, Vivek Sarkar. 426-437 [doi]
- Scalable Data-Driven PageRank: Algorithms, System Issues, and Lessons LearnedJoyce Jiyoung Whang, Andrew Lenharth, Inderjit S. Dhillon, Keshav Pingali. 438-450 [doi]
- How Many Threads will be too Many? On the Scalability of OpenMP ImplementationsChristian Iwainsky, Sergei Shudler, Alexandru Calotoiu, Alexandre Strube, Michael Knobloch, Christian H. Bischof, Felix Wolf. 451-463 [doi]
- Efficient Nested Dissection for Multicore ArchitecturesDominique Lasalle, George Karypis. 467-478 [doi]
- Scheduling Trees of Malleable Tasks for Sparse Linear AlgebraAbdou Guermouche, Loris Marchal, Bertrand Simon, Frédéric Vivien. 479-490 [doi]
- Elastic Tasks: Unifying Task Parallelism and SPMD Parallelism with an Adaptive RuntimeAlina Simion Sbîrlea, Kunal Agrawal, Vivek Sarkar. 491-503 [doi]
- Semi-discrete Matrix-Free Formulation of 3D Elastic Full Waveform Inversion ModelingStephen Moore, Devi Sudheer Chunduri, Sergiy Zhuk, Tigran T. Tchrakian, Ewout van den Berg, Albert Akhriev, Alberto Costa Nogueira Jr., Andrew Rawlinson, Lior Horesh. 507-518 [doi]
- 10, 000 Performance Models per Minute - Scalability of the UG4 Simulation FrameworkAndreas Vogel, Alexandru Calotoiu, Alexandre Strube, Sebastian Reiter, Arne Nägel, Felix Wolf, Gabriel Wittum. 519-531 [doi]
- Exploiting Task-Based Parallelism in Bayesian Uncertainty QuantificationPanagiotis E. Hadjidoukas, Panagiotis Angelikopoulos, Lina Kulakova, Costas Papadimitriou, Petros Koumoutsakos. 532-544 [doi]
- Parallelization of an Advection-Diffusion Problem Arising in Edge Plasma Physics Using Hybrid MPI/OpenMP ProgrammingMatthieu Kuhn, Guillaume Latu, Nicolas Crouseilles, Stéphane Genaud. 545-557 [doi]
- Behavioral Non-portability in Scientific Numeric ComputingYijia Gu, Thomas Wahl, Mahsa Bayati, Miriam Leeser. 558-569 [doi]
- Fast Parallel Suffix Array on the GPULeyuan Wang, Sean Baxter, John D. Owens. 573-587 [doi]
- Effective Barrier Synchronization on Intel Xeon Phi CoprocessorAndrey Rodchenko, Andy Nisbet, Antoniu Pop, Mikel Luján. 588-600 [doi]
- High Performance Multi-GPU SpMV for Multi-component PDE-Based ApplicationsAhmad Abdelfattah, Hatem Ltaief, David E. Keyes. 601-612 [doi]
- Accelerating Lattice Boltzmann Applications with OpenACCEnrico Calore, Jiri Kraus, Sebastiano Fabio Schifano, Raffaele Tripiccione. 613-624 [doi]
- High-Performance and Scalable Design of MPI-3 RMA on Xeon Phi ClustersMingzhe Li, Khaled Hamidouche, Xiaoyi Lu, Jian Lin, Dhabaleswar K. Panda. 625-637 [doi]
- Improving Performance of Convolutional Neural Networks by Separable Filters on GPUHao-Ping Kang, Che-Rung Lee. 638-649 [doi]
- Iterative Sparse Triangular Solves for PreconditioningHartwig Anzt, Edmond Chow, Jack Dongarra. 650-661 [doi]
- Targeting the ParallellaSpiros N. Agathos, Alexandros Papadogiannakis, Vassilios V. Dimakopoulos. 662-674 [doi]
- Systematic Fusion of CUDA Kernels for Iterative Sparse Linear System SolversJosé Ignacio Aliaga, Joaquín Pérez, Enrique S. Quintana-Ortí. 675-686 [doi]
- Efficient Execution of Multiple CUDA Applications Using Transparent Suspend, Resume and MigrationTaichiro Suzuki, Akira Nukada, Satoshi Matsuoka. 687-699 [doi]