Abstract is missing.
- Automatic Detection of Synchronization Errors in Codes that Target the Open Community RuntimeJirí Dokulil, Jana Katreniaková. 3-15 [doi]
- A Methodology for Performance Analysis of Applications Using Multi-layer I/ORonny Tschüter, Christian Herold, Bert Wesarg, Matthias Weber. 16-30 [doi]
- Runtime Determinacy Race Detection for OpenMP TasksHassan Salehe Matar, Didem Unat. 31-45 [doi]
- Estimating the Impact of External Interference on Application PerformanceAamer Shah, Matthias Müller, Felix Wolf 0001. 46-58 [doi]
- GT-Race: Graph Traversal Based Data Race Detection for Asynchronous Many-Task ParallelismLechen Yu, Vivek Sarkar. 59-73 [doi]
- Reducing GPU Register File EnergyVishwesh Jatala, Jayvant Anantpur, Amey Karkare. 77-91 [doi]
- Taxonomist: Application Detection Through Rich Monitoring DataEmre Ates, Ozan Tuncer, Ata Turk, Vitus J. Leung, Jim M. Brandt, Manuel Egele, Ayse Kivilcim Coskun. 92-105 [doi]
- Diagnosing Highly-Parallel OpenMP Programs with Aggregated Grain GraphsNico Reissmann, Ananya Muddukrishna. 106-119 [doi]
- Characterization of Smartphone Governor StrategiesSarbartha Banerjee, Lizy Kurian John. 120-134 [doi]
- HPC Benchmarking: Scaling Right and Looking Beyond the AverageMilan Radulovic, Kazi Asifuzzaman, Paul M. Carpenter, Petar Radojkovic, Eduard Ayguadé. 135-146 [doi]
- Combined Vertical and Horizontal Autoscaling Through Model Predictive ControlEmilio Incerto, Mirco Tribastone, Catia Trubiani. 147-159 [doi]
- Early Termination of Failed HPC Jobs Through Machine and Deep LearningMichal Zasadzinski, Victor Muntés-Mulero, Marc Solé, David Carrera, Thomas Ludwig 0002. 163-177 [doi]
- Peacock: Probe-Based Scheduling of Jobs by Rotating Between Elastic QueuesMansour Khelghatdoust, Vincent Gramoli. 178-191 [doi]
- Online Scheduling of Task Graphs on Hybrid PlatformsLouis-Claude Canon, Loris Marchal, Bertrand Simon, Frédéric Vivien. 192-204 [doi]
- Interference-Aware Scheduling Using Geometric ConstraintsRaphaël Bleuse, Konstantinos Dogeas, Giorgio Lucarelli, Grégory Mounié, Denis Trystram. 205-217 [doi]
- Resource-Efficient Execution of Conditional Parallel Real-Time TasksSanjoy Baruah. 218-231 [doi]
- Improving GPU Cache Hierarchy Performance with a Fetch and Replacement CacheFrancisco Candel, Salvador Petit, Alejandro Valero, Julio Sahuquillo. 235-248 [doi]
- Abelian: A Compiler for Graph Analytics on Distributed, Heterogeneous PlatformsGurbinder Gill, Roshan Dathathri, Loc Hoang, Andrew Lenharth, Keshav Pingali. 249-264 [doi]
- Using Dynamic Compilation to Achieve Ninja Performance for CNN Training on Many-Core ProcessorsAnkush Mandal, Rajkishore Barik, Vivek Sarkar. 265-278 [doi]
- Privacy-Preserving Top-k Query Processing in Distributed SystemsSakina Mahboubi, Reza Akbarinia, Patrick Valduriez. 281-292 [doi]
- Minimizing Network Traffic for Distributed Joins Using Lightweight Locality-Aware SchedulingLong Cheng 0003, John Murphy, Qingzhi Liu, Chunliang Hao, Georgios Theodoropoulos. 293-305 [doi]
- VIoLET: A Large-Scale Virtual Environment for Internet of ThingsShreyas Badiger, Shrey Baheti, Yogesh Simmhan. 309-324 [doi]
- Adaptive Bandwidth-Efficient Recovery Techniques in Erasure-Coded Cloud StorageRekha Nachiappan, Bahman Javadi, Rodrigo N. Calheiros, Kenan M. Matawie. 325-338 [doi]
- IT Optimization for Datacenters Under Renewable Power ConstraintStéphane Caux, Paul Renaud-Goud, Gustavo Rostirolla, Patricia Stolf. 339-351 [doi]
- GPU Provisioning: The 80 - 20 80 - 20 RuleEleni Kanellou, Nikolaos Chrysos, Stelios Mavridis, Yannis Sfakianakis, Angelos Bilas. 352-364 [doi]
- ECSched: Efficient Container Scheduling on Heterogeneous ClustersYang Hu, Huan Zhou, Cees de Laat, Zhiming Zhao. 365-377 [doi]
- Combinatorial Auction Algorithm Selection for Cloud Resource Allocation Using Machine LearningDiana Gudu, Marcus Hardt, Achim Streit. 378-391 [doi]
- Cloud Federation Formation in Oligopolistic MarketsYash Khandelwal, Karthik Ganti, Suresh Purini, Puduru V. Reddy. 392-403 [doi]
- Improving Cloud Simulation Using the Monte-Carlo MethodLuke Bertot, Stéphane Genaud, Julien Gossa. 404-416 [doi]
- Nobody Cares if You Liked Star Wars: KNN Graph Construction on the CheapAnne-Marie Kermarrec, Olivier Ruas, François Taïani. 419-431 [doi]
- One-Sided Communications for More Efficient Parallel State Space Exploration over RDMA ClustersCamille Coti, Sami Evangelista, Laure Petrucci. 432-446 [doi]
- Robust Decentralized Mean Estimation with Limited CommunicationGábor Danner, Márk Jelasity. 447-461 [doi]
- Snapshot-Based Synchronization: A Fast Replacement for Hand-over-Hand LockingEran Gilad, Trevor Brown 0001, Mark Oskin, Yoav Etsion. 465-479 [doi]
- Measuring Multithreaded Message Matching MiseryWhit Schonbein, Matthew G. F. Dosanjh, Ryan E. Grant, Patrick G. Bridges. 480-491 [doi]
- Global-Local View: Scalable Consistency for Concurrent Data TypesDeepthi Devaki Akkoorath, José Brandão, Annette Bieniusa, Carlos Baquero. 492-504 [doi]
- OpenABL: A Domain-Specific Language for Parallel and Distributed Agent-Based SimulationsBiagio Cosenza, Nikita Popov, Ben H. H. Juurlink, Paul Richmond, Mozhgan Kabiri Chimeh, Carmine Spagnuolo, Gennaro Cordasco, Vittorio Scarano. 505-518 [doi]
- Bulk: A Modern C++ Interface for Bulk-Synchronous Parallel ProgramsJan-Willem Buurlage, Tom Bannink, Rob H. Bisseling. 519-532 [doi]
- SharP Unified Memory Allocator: An Intent-Based Memory Allocator for Extreme-Scale SystemsFerrol Aderholdt, Manjunath Gorentla Venkata, Zachary W. Parchman. 533-545 [doi]
- Multi-granularity Locking in Hierarchies with Synergistic Hierarchical and Fine-Grained LocksK. Ganesh, Saurabh Kalikar, Rupesh Nasre. 546-559 [doi]
- Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes CollaborationMarc Sergent, Mario Dagrada, Patrick Carribault, Julien Jaeger, Marc Pérache, Guillaume Papauré. 560-572 [doi]
- Efficient Lock-Free Removing and Compaction for the Cache-Trie Data StructureAleksandar Prokopec. 575-589 [doi]
- NUMA Optimizations for Algorithmic SkeletonsPaul Metzger, Murray Cole, Christian Fensch. 590-602 [doi]
- Improving System Turnaround Time with Intel CAT by Identifying LLC Critical ApplicationsLucia Pons, Vicent Selfa, Julio Sahuquillo, Salvador Petit, Julio Pons. 603-615 [doi]
- Dynamic Placement of Progress Thread for Overlapping MPI Non-blocking Collectives on Manycore ProcessorAlexandre Denis, Julien Jaeger, Emmanuel Jeannot, Marc Pérache, Hugo Taboada. 616-627 [doi]
- Efficient Load Balancing Techniques for Graph Traversal Applications on GPUsFederico Busato, Nicola Bombieri. 628-641 [doi]
- Energy Efficient Stencil Computations on the Low-Power Manycore MPPA-256 ProcessorEmmanuel Podestá Jr., Bruno Marques do Nascimento, Márcio Castro. 642-655 [doi]
- High-Quality Shared-Memory Graph PartitioningYaroslav Akhremtsev, Peter Sanders 0001, Christian Schulz 0003. 659-671 [doi]
- Design Principles for Sparse Matrix Multiplication on the GPUCarl Yang, Aydin Buluç, John D. Owens. 672-687 [doi]
- Distributed Graph Clustering Using Modularity and Map EquationMichael Hamann, Ben Strasser, Dorothea Wagner, Tim Zeitz. 688-702 [doi]
- Improved Distributed Algorithm for Graph Truss DecompositionVenkatesan T. Chakaravarthy, Aashish Goyal, Prakash Murali, Shivmaran S. Pandian, Yogish Sabharwal. 703-717 [doi]
- Exploiting Data Sparsity for Large-Scale Matrix ComputationsKadir Akbudak, Hatem Ltaief, Aleksandr Mikhalev, Ali Charara, Aniello Esposito, David E. Keyes. 721-734 [doi]
- Hybrid Parallelization and Performance Optimization of the FLEUR Code: New Possibilities for All-Electron Density Functional TheoryUliana Alekseeva, Gregor Michalicek, Daniel Wortmann, Stefan Blügel. 735-748 [doi]
- Efficient Strict-Binning Particle-in-Cell Algorithm for Multi-core SIMD ProcessorsYann Barsamian, Arthur Charguéraud, Sever A. Hirstoaga, Michel Mehrenberger. 749-763 [doi]
- Task-Based Programming on Emerging Parallel Architectures for Finite-Differences Seismic Numerical KernelSalli Moustafa, Wilfried Kirschenmann, Fabrice Dupros, Hideo Aochi. 764-777 [doi]
- CEML: a Coordinated Runtime System for Efficient Machine Learning on Heterogeneous Computing SystemsJihoon Hyun, Jinsu Park, Kyu Yeun Kim, Seongdae Yu, Woongki Baek. 781-795 [doi]
- Stream Processing on Hybrid CPU/Intel® Xeon Phi™ SystemsPaulo Ferrão, Hélder Marques, Hervé Paulino. 796-810 [doi]
- Tile Low-Rank GEMM Using Batched Operations on GPUsAli Charara, David E. Keyes, Hatem Ltaief. 811-825 [doi]