Abstract is missing.
- MPI Trace Compression Using Event Flow GraphsXavier Aguilar, Karl Fürlinger, Erwin Laure. 1-12 [doi]
- ScalaJack: Customized Scalable Tracing with In-situ Data AnalysisSrinath Krishna Ananthakrishnan, Frank Mueller. 13-25 [doi]
- Performance Measurement and Analysis of Transactional Memory and Speculative Execution on IBM Blue Gene/QJie Jiang, Peter Philippen, Michael Knobloch, Bernd Mohr. 26-37 [doi]
- c-Eclipse: An Open-Source Management Framework for Cloud ApplicationsChrystalla Sofokleous, Nicholas Loulloudes, Demetris Trihinas, George Pallis, Marios D. Dikaiakos. 38-49 [doi]
- Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core ArchitecturesLuka Stanisic, Samuel Thibault, Arnaud Legrand, Brice Videau, Jean-François Méhaut. 50-62 [doi]
- Modeling the Impact of Reduced Memory Bandwidth on HPC ApplicationsAnanta Tiwari, Anthony Gamst, Michael A. Laurenzano, Martin Schulz, Laura Carrington. 63-74 [doi]
- ParaShares: Finding the Important Basic Blocks in Multithreaded ProgramsMelanie Kambadur, Kui Tang, Martha A. Kim. 75-86 [doi]
- Multi-Objective Auto-Tuning with Insieme: Optimization and Trade-Off Analysis for Time, Energy and Resource UsagePhilipp Gschwandtner, Juan José Durillo, Thomas Fahringer. 87-98 [doi]
- Performance Prediction and Evaluation of Parallel Applications in KVM, Xen, and VMwareCheol-Ho Hong, Beom-Joon Kim, Young-Pil Kim, Hyunchan Park, Chuck Yoo. 99-110 [doi]
- DReAM: Per-Task DRAM Energy Metering in Multicore SystemsQixiao Liu, Miquel Moretó, Jaume Abella, Francisco J. Cazorla, Mateo Valero. 111-123 [doi]
- Characterizing the Performance-Energy Tradeoff of Small ARM Cores in HPC ComputationMichael A. Laurenzano, Ananta Tiwari, Adam Jundt, Joshua Peraza, William A. Ward Jr., Roy L. Campbell, Laura Carrington. 124-137 [doi]
- On Interactions among Scheduling Policies: Finding Efficient Queue Setup Using High-Resolution SimulationsDalibor Klusácek, Simon Tóth. 138-149 [doi]
- ProPS: A Progressively Pessimistic Scheduler for Software Transactional MemoryHugo Rito, João P. Cachopo. 150-161 [doi]
- A Queueing Theory Approach to Pareto Optimal Bags-of-Tasks Scheduling on CloudsCosmin Dumitru, Ana-Maria Oprescu, Miroslav Zivkovic, Rob van der Mei, Paola Grosso, Cees de Laat. 162-173 [doi]
- SPAGHETtI: Scheduling/Placement Approach for Task-Graphs on HETerogeneous archItectureDenis Barthou, Emmanuel Jeannot. 174-185 [doi]
- Energy-Aware Multi-Organization Scheduling ProblemJohanne Cohen, Daniel Cordeiro, Pedro Luis F. Raphael. 186-197 [doi]
- Energy Efficient Scheduling of MapReduce JobsEvripidis Bampis, Vincent Chau, Dimitrios Letsios, Giorgio Lucarelli, Ioannis Milis, Georgios Zois. 198-209 [doi]
- Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUsDafei Huang, Mei Wen, Changqing Xun, Dong Chen, Xing Cai, Yuran Qiao, Nan Wu 0003, Chunyuan Zhang. 210-221 [doi]
- Switchable Scheduling for Runtime Adaptation of OptimizationLénaïc Bagnères, Cédric Bastoul. 222-233 [doi]
- A New GCC Plugin-Based Compiler Pass to Add Support for Thread-Level Speculation into OpenMPSergio Aldea, Alvaro Estebanez, Diego R. Llanos, Arturo González-Escribano. 234-245 [doi]
- Improving Read Performance with Online Access Pattern Analysis and PrefetchingHoujun Tang, Xiaocheng Zou, John Jenkins, David A. Boyuka II, Stephen Ranshous, Dries Kimpe, Scott Klasky, Nagiza F. Samatova. 246-257 [doi]
- Robust and Efficient Large-Large Table Outer Joins on Distributed InfrastructuresLong Cheng, Spyros Kotoulas, Tomas E. Ward, Georgios Theodoropoulos. 258-269 [doi]
- Top-k Item Identification on Dynamic and Distributed DatasetsAlessio Guerrieri, Alberto Montresor, Yannis Velegrakis. 270-281 [doi]
- Applying Selectively Parallel I/O Compression to Parallel Storage SystemsRosa Filgueira, Malcolm P. Atkinson, Yusuke Tanimura, Isao Kojima. 282-293 [doi]
- Ultra-Fast Load Balancing of Distributed Key-Value Stores through Network-Assisted LookupsDavide De Cesaris, Kostas Katrinis, Spyros Kotoulas, Antonio Corradi. 294-305 [doi]
- Virtual Machine Consolidation in Cloud Data Centers Using ACO MetaheuristicMd Hasanul Ferdaus, Manzur Murshed, Rodrigo N. Calheiros, Rajkumar Buyya. 306-317 [doi]
- Workflow Scheduling on Federated CloudsJuan José Durillo, Radu Prodan. 318-329 [doi]
- Locality-Aware Cooperation for VM Scheduling in Distributed CloudsJonathan Pastor, Marin Bertier, Frédéric Desprez, Adrien Lebre, Flavien Quesnel, Cédric Tedeschi. 330-341 [doi]
- Can Inter-VM Shmem Benefit MPI Applications on SR-IOV Based Virtualized Infiniband Clusters?Jie Zhang, Xiaoyi Lu, Jithin Jose, Rong Shi, Dhabaleswar K. Panda. 342-353 [doi]
- Power-Aware L1 and L2 Caches for GPGPUsEhsan Atoofian, Ali Manzak. 354-365 [doi]
- Power Consumption Due to Data Movement in Distributed Programming ModelsSiddhartha Jana, Oscar Hernandez, Stephen Poole, Barbara M. Chapman. 366-378 [doi]
- Spanning Tree or Gossip for Aggregation: A Comparative StudyLehel Nyers, Márk Jelasity. 379-390 [doi]
- Shades: Expediting Kademlia's Lookup ProcessGil Einziger, Roy Friedman, Yoav Kantor. 391-402 [doi]
- Analysis and Comparison of Truly Distributed Solvers for Linear Least Squares Problems on Wireless Sensor NetworksKarl E. Prikopa, Hana Straková, Wilfried N. Gansterer. 403-414 [doi]
- High-Performance Computer Algebra: A Hecke Algebra Case StudyPatrick Maier, Daria Livesey, Hans-Wolfgang Loidl, Phil Trinder. 415-426 [doi]
- Generic Deterministic Random Number Generation in Dynamic-Multithreaded PlatformsStefano Mor, Jean-Louis Roch, Nicolas Maillard. 427-438 [doi]
- Implementation and Performance Analysis of SkelGIS for Network Mesh-Based SimulationsHélène Coullon, Sébastien Limet. 439-450 [doi]
- GoFFish: A Sub-graph Centric Framework for Large-Scale Graph AnalyticsYogesh Simmhan, Alok Gautam Kumbhare, Charith Wickramaarachchi, Soonil Nagarkar, Santosh Ravi, Cauligi S. Raghavendra, Viktor K. Prasanna. 451-462 [doi]
- Resolving Semantic Conflicts in Word Based Software Transactional MemoryCraig Sharp, William Blewitt, Graham Morgan. 463-474 [doi]
- Automatic Tuning of the Parallelism Degree in Hardware Transactional MemoryDiego Rughetti, Paolo Romano, Francesco Quaglia, Bruno Ciciani. 475-486 [doi]
- A Distributed CPU-GPU Sparse Direct SolverPiyush Sao, Richard W. Vuduc, Xiaoye Sherry Li. 487-498 [doi]
- Parallel Computation of Echelon FormsJean-Guillaume Dumas, Thierry Gautier, Clément Pernet, Ziad Sultan. 499-510 [doi]
- Time-Domain BEM for the Wave Equation: Optimization and Hybrid ParallelizationBérenger Bramas, Olivier Coulaud, Guillaume Sylvand. 511-523 [doi]
- Structured Orthogonal Inversion of Block p-Cyclic Matrices on Multicores with GPU AcceleratorsSergiy Gogolenko, Zhaojun Bai, Richard Scalettar. 524-535 [doi]
- High-Throughput Maps on Message-Passing Manycore Architectures: Partitioning versus ReplicationOmid Shahmirzadi, Thomas Ropars, André Schiper. 536-547 [doi]
- A Fast Sparse Block Circulant Matrix Vector ProductEloy Romero, Andrés Tomás, Antonio Soriano, Ignacio Blanquer. 548-559 [doi]
- Scheduling Data Flow Program in XKaapi: A New Affinity Based Algorithm for Heterogeneous ArchitecturesRaphaël Bleuse, Thierry Gautier, João V. F. Lima, Grégory Mounié, Denis Trystram. 560-571 [doi]
- Delegation Locking Libraries for Improved Performance of Multithreaded ProgramsDavid Klaftenegger, Konstantinos F. Sagonas, Kjell Winblad. 572-583 [doi]
- A Generic Strategy for Multi-stage StencilsMauro Bianco, Benjamin Cumming. 584-595 [doi]
- Evaluation of OpenMP Task Scheduling Algorithms for Large NUMA ArchitecturesJérôme Clet-Ortega, Patrick Carribault, Marc Pérache. 596-607 [doi]
- Power-Aware Replica Placement in Tree Networks with Multiple Servers per ClientGuillaume Aupy, Anne Benoit, Matthieu Journault, Yves Robert. 608-619 [doi]
- On Constructing DAG-Schedules with Large AREAsScott T. Roche, Arnold L. Rosenberg, Rajmohan Rajaraman. 620-631 [doi]
- Software Defined Multicasting for MPI Collective Operation Offloading with the NetFPGAOmer Arap, Geoffrey Brown, Bryce Himebaugh, Martin Swany. 632-643 [doi]
- MapReduce over Lustre: Can RDMA-Based Approach Benefit?Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Sharmin Islam, Raghunath Rajachandrasekar, Dhabaleswar K. Panda. 644-655 [doi]
- Random Fields Generation on the GPU with the Spectral Turning Bands MethodLars Hunger, Biagio Cosenza, Stefan Kimeswenger, Thomas Fahringer. 656-667 [doi]
- Fast Set Intersection through Run-Time Bitmap Construction over PForDelta-Compressed IndexesXiaocheng Zou, Sriram Lakshminarasimhan, David A. Boyuka II, Stephen Ranshous, Houjun Tang, Scott Klasky, Nagiza F. Samatova. 668-679 [doi]
- Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWASJorge González-Domínguez, Bertil Schmidt, Jan Christian Kässens, Lars Wienbrandt. 680-691 [doi]
- IFM: A Scalable High Resolution Flood Modeling FrameworkSwati Singhal, Sandhya Aneja, Frank Liu, Lucas Villa Real, Thomas George. 692-703 [doi]
- High Performance Pseudo-analytical Simulation of Multi-Object Adaptive Optics over Multi-GPU SystemsAhmad Abdelfattah, Eric Gendron, Damien Gratadour, David E. Keyes, Hatem Ltaief, Arnaud Sevin, Fabrice Vidal. 704-715 [doi]
- Parallel Dual Tree Traversal on Multi-core and Many-core Architectures for Astrophysical N-body SimulationsBenoit Lange, Pierre Fortin. 716-727 [doi]
- Customizing Driving Directions with GPUsDaniel Delling, Moritz Kobitzsch, Renato F. Werneck. 728-739 [doi]
- GPU Accelerated Range Trees with ApplicationsManoj Kumar Maramreddy, Kishore Kothapalli. 740-751 [doi]
- Scalable On-Board Multi-GPU Simulation of Long-Range Molecular DynamicsMarcos Novalbos, Jaime Gonzalez, Miguel A. Otaduy, Roberto Martinez-Benito, Alberto Sanchez. 752-763 [doi]
- Resolution of Linear Algebra for the Discrete Logarithm Problem Using GPU and Multi-core ArchitecturesHamza Jeljeli. 764-775 [doi]
- Toward OpenCL Automatic Multi-Device SupportSylvain Henry, Alexandre Denis, Denis Barthou, Marie Christine Counilh, Raymond Namyst. 776-787 [doi]
- Concurrent Kernel Execution on Xeon Phi within Parallel Heterogeneous WorkloadsFlorian Wende, Thomas Steinke, Frank Cordes. 788-799 [doi]
- Writing Self-adaptive Codes for Heterogeneous SystemsJorge F. Fabeiro, Diego Andrade, Basilio B. Fraguela, Ramon Doallo. 800-811 [doi]
- A Pattern-Based Comparison of OpenACC and OpenMP for Accelerator ComputingSandra Wienke, Christian Terboven, James C. Beyer, Matthias S. Müller. 812-823 [doi]