Abstract is missing.
- End of Moore's Law: Or, a Computer Architect's Mid-life Crisis?Parthasarathy Ranganathan. 1 [doi]
- Exact and Parallel Triangle Counting in Dynamic GraphsDevavret Makkar, David A. Bader, Oded Green. 2-12 [doi]
- Shared-Memory Graph Truss DecompositionHumayun Kabir, Kamesh Madduri. 13-22 [doi]
- Approximate Computing Techniques for Iterative Graph AlgorithmsAjay Panyala, Omer Subasi, Mahantesh Halappanavar, Ananth Kalyanaraman, Daniel G. Chavarría-Miranda, Sriram Krishnamoorthy. 23-32 [doi]
- Scalable Exact Parent Sets Identification in Bayesian Networks Learning with Apache SparkSubhadeep Karan, Jaroslaw Zola. 33-41 [doi]
- Parallel Exact Dynamic Bayesian Network Structure Learning with Application to Gene NetworksMd. Vasimuddin, Srinivas Aluru. 42-51 [doi]
- Parallel Asynchronous Distributed-Memory Maximal Independent Set Algorithm with Work OrderingThejaka Amila Kanewala, Marcin Zalewski, Andrew Lumsdaine. 52-61 [doi]
- Designing Registration Caching Free High-Performance MPI Library with Implicit On-Demand Paging (ODP) of InfiniBandMingzhe Li, Xiaoyi Lu, Hari Subramoni, Dhabaleswar K. Panda. 62-71 [doi]
- Last Level Collective Hardware Prefetching For Data-Parallel ApplicationsGeorge Michelogiannakis, John Shalf. 72-83 [doi]
- Kernel-Assisted Communication Engine for MPI on Emerging Manycore ProcessorsJahanzeb Maqbool Hashmi, Khaled Hamidouche, Hari Subramoni, Dhabaleswar K. Panda. 84-93 [doi]
- Support for Power Efficient Proactive Cooling MechanismsBilge Acun, Eun-Kyung Lee, Yoonho Park, Laxmikant V. Kalé. 94-103 [doi]
- Redundant Arithmetic Based High Speed Carry Free Hybrid Adders with Built-In Scan Chain on FPGAsAyan Palchaudhuri, Anindya Sundar Dhar. 104-113 [doi]
- ConvLight: A Convolutional Accelerator with Memristor Integrated Photonic ComputingDharanidhar Dang, Jyotikrishna Dass, Rabi N. Mahapatra. 114-123 [doi]
- Provably Efficient Scheduling of Dynamically Allocating Programs on Parallel Cache HierarchiesGuy E. Blelloch, Phillip B. Gibbons, Harsha Vardhan Simhadri. 124-133 [doi]
- Further Explorations in State-Space Search for Optimal Task SchedulingMichael Orr, Oliver Sinnen. 134-141 [doi]
- A Novel Approach for Job Scheduling Optimizations Under Power Cap for ARM and Intel HPC SystemsDineshkumar Rajagopal, Daniele Tafani, Yiannis Georgiou, David Glesser, Michael Ott. 142-151 [doi]
- A Memory Congestion-Aware MPI Process Placement for Modern NUMA SystemsMulya Agung, Muhammad Alfian Amrizal, Kazuhiko Komatsu, Ryusuke Egawa, Hiroyuki Takizawa. 152-161 [doi]
- Expander: Lock-Free Cache for a Concurrent Data StructurePooja Aggarwal, Smruti R. Sarangi. 162-171 [doi]
- Adaptive Code Refinement: A Compiler Technique and Extensions to Generate Self-Tuning ApplicationsMaxime Schmitt, Philippe Helluy, Cédric Bastoul. 172-181 [doi]
- Machine Learning @ AmazonRajeev Rastogi. 182 [doi]
- Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and CommunicationSunWoo Lee, Dipendra Jha, Ankit Agrawal, Alok N. Choudhary, Wei-keng Liao. 183-192 [doi]
- Parallel Dynamic Data Driven Approaches for Synthetic Aperture RadarAdeesha Wijayasiri, Tania Banerjee, Sanjay Ranka, Sartaj Sahni, Mark S. Schmalz. 193-202 [doi]
- ARM Wrestling with Big Data: A Study of Commodity ARM64 Server for Big Data WorkloadsJayanth Kalyanasundaram, Yogesh Simmhan. 203-212 [doi]
- MPI-LiFE: Designing High-Performance Linear Fascicle Evaluation of Brain Connectome with MPIShashank Gugnani, Xiaoyi Lu, Franco Pestilli, Cesar F. Caiafa, Dhabaleswar K. Panda. 213-222 [doi]
- Reducing Network Congestion and Synchronization Overhead During Aggregation of Hierarchical DataSidharth Kumar, Duong Hoang, Steve Petruzza, John Edwards, Valerio Pascucci. 223-232 [doi]
- Fast Parallel Randomized QR with Column Pivoting Algorithms for Reliable Low-Rank Matrix ApproximationsJianwei Xiao, Ming Gu, Julien Langou. 233-242 [doi]
- An X10-Based Distributed Streaming Graph Database EngineMiyuru Dayarathna, Sathya Bandara, Nandula Jayamaha, Mahen Herath, Achala Madhushan, Sanath Jayasena, Toyotaro Suzumura. 243-252 [doi]
- GPU-Centric Communication on NVIDIA GPU Clusters with InfiniBand: A Case Study with OpenSHMEMSreeram Potluri, Anshuman Goswami, Davide Rossetti, C. J. Newburn, Manjunath Gorentla Venkata, Neena Imam. 253-262 [doi]
- Distributed Algorithm for High-Utility Subgraph Pattern Mining Over Big Data PlatformsAlind Khare, Vikram Goyal, Srikanth Baride, Sushil K. Prasad, Michael McDermott, Dhara Shah. 263-272 [doi]
- ReCALL: Reordered Cache Aware Locality Based Graph ProcessingKartik Lakhotia, Shreyas G. Singapura, Rajgopal Kannan, Viktor K. Prasanna. 273-282 [doi]
- Characterization of Data Movement Requirements for Sparse Matrix Computations on GPUsSireyya Emre Kurt, Vineeth Thumma, Changwan Hong, Aravind Sukumaran-Rajam, P. Sadayappan. 283-293 [doi]
- Applying Graph Analytics to Understand Compute Core Usage and Publication Trends in a Petascale Supercomputing FacilitySangkuen Lee, Sudharshan S. Vazhkudai, Raghul Gunasekaran. 294-305 [doi]
- Computing Just What You Need: Online Data Analysis and Reduction at Extreme ScalesIan T. Foster. 306 [doi]
- Integrating External Resources with a Task-Based Programming ModelZhihao Jia, Sean Treichler, Galen M. Shipman, Michael Bauer, Noah Watkins, Carlos Maltzahn, Patrick S. McCormick, Alex Aiken. 307-316 [doi]
- Enabling Dependability-Driven Resource Use and Message Log-Analysis for Cluster System DiagnosisEdward Chuah, Arshad Jhumka, Samantha Alt, Theo Damoulas, Nentawe Gurumdimma, Marie-Christine Sawley, William L. Barth, Tommy Minyard, James C. Browne. 317-327 [doi]
- Context-Aware Memory Profiling for Speculative ParallelismChangsu Kim, Juhyun Kim, Juwon Kang, Jae W. Lee, Hanjun Kim. 328-337 [doi]
- Lifting Barriers Using Parallel Polyhedral RegionsHarenome Razanajato, Cédric Bastoul, Vincent Loechner. 338-347 [doi]
- Exploiting Common Neighborhoods to Optimize MPI Neighborhood CollectivesSeyed Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, Ahmad Afsahi. 348-357 [doi]
- Efficient Fork-Join on GPUs Through Warp SpecializationArpith Chacko Jacob, Alexandre E. Eichenberger, Hyojin Sung, Samuel F. Antão, Gheorghe-Teodor Bercea, Carlo Bertolli, Alexey Bataev, Tian Jin, Tong Chen, Zehra Sura, Georgios Rokos, Kevin O'Brien. 358-367 [doi]
- Thrust++: Extending Thrust Framework for Better Abstraction and PerformanceAjai V. George, Sankar Manoj, Sanket R. Gupte, Sayantan Mitra, Santonu Sarkar. 368-377 [doi]
- A Novel Implementation of 2D3V Particle-in-Cell (PIC) Algorithm for Kepler GPU ArchitectureHarshil Shah, Siddharth Kamaria, Riddhesh Markandeya, Miral Shah, Bhaskar Chaudhury. 378-387 [doi]
- Parallelizing Hines Matrix Solver in Neuron Simulations on GPUDharma Teja Vooturi, Kishore Kothapalli, Upinder Singh Bhalla. 388-397 [doi]
- Building Halo Merger Trees from the Q Continuum SimulationEsteban Rangel, Nicholas Frontiere, Salman Habib, Katrin Heitmann, Wei-keng Liao, Ankit Agrawal, Alok N. Choudhary. 398-407 [doi]
- A Memory-Efficient GPU Method for Hamming and Levenshtein Distance SimilarityAndrew Todd, Marziyeh Nourian, Michela Becchi. 408-418 [doi]