Abstract is missing.
- Disruptive Research and InnovationKai Li. 1 [doi]
- Subgraph Counting: Color Coding Beyond TreesVenkatesan T. Chakaravarthy, Michael Kapralov, Prakash Murali, Fabrizio Petrini, Xinyu Que, Yogish Sabharwal, Baruch Schieber. 2-11 [doi]
- A Practical Parallel Algorithm for Diameter Approximation of Massive Weighted GraphsMatteo Ceccarello, Andrea Pietracaprina, Geppino Pucci, Eli Upfal. 12-21 [doi]
- Rabbit Order: Just-in-Time Parallel Reordering for Fast Graph AnalysisJunya Arai, Hiroaki Shiokawa, Takeshi Yamamuro, Makoto Onizuka, Sotetsu Iwamura. 22-31 [doi]
- Distributed-Memory Algorithms for Maximum Cardinality Matching in Bipartite GraphsAriful Azad, Aydin Buluç. 32-42 [doi]
- Automatic Parallel Pattern Detection in the Algorithm Structure Design SpaceZia Ul Huda, Rohit Atre, Ali Jannesari, Felix Wolf. 43-52 [doi]
- ARCHER: Effectively Spotting Data Races in Large OpenMP ApplicationsSimone Atzeni, Ganesh Gopalakrishnan, Zvonimir Rakamaric, Dong H. Ahn, Ignacio Laguna, Martin Schulz, Gregory L. Lee, Joachim Protze, Matthias S. Müller. 53-62 [doi]
- Algorithm and Architecture Independent Benchmarking with SEAKNathan R. Tallent, Joseph B. Manzano, Nitin A. Gawande, Seunghwa Kang, Darren J. Kerbyson, Adolfy Hoisie, Joseph K. Cross. 63-72 [doi]
- Design and Implementation of a Parallel Research Kernel for Assessing Dynamic Load-Balancing CapabilitiesEvangelos Georganas, Rob F. Van der Wijngaart, Timothy G. Mattson. 73-82 [doi]
- VNRE: Flexible and Efficient Acceleration for Network Redundancy EliminationXiongzi Ge, Yi Liu, Chengtao Lu, Jim Diehl, David H. C. Du, Liang Zhang, Jian Chen. 83-92 [doi]
- Analyzing Network Health and Congestion in Dragonfly-Based SupercomputersAbhinav Bhatele, Nikhil Jain, Yarden Livnat, Valerio Pascucci, Peer-Timo Bremer. 93-102 [doi]
- Random Regular Graph and Generalized De Bruijn Graph with k-Shortest Path RoutingPeyman Faizian, Md Atiqul Mollah, Xin Yuan, Scott Pakin, Michael Lang 0003. 103-112 [doi]
- Deflection Containment for Bufferless Network-on-ChipsXi-Yue Xiang, Nian-Feng Tzeng. 113-122 [doi]
- RUPS: Fixing Relative Distances among Urban Vehicles with Context-Aware TrajectoriesHongzi Zhu, Shan Chang, Li Lu, Wei Zhang. 123-131 [doi]
- Hybrid Dynamic Trees for Extreme-Resolution 3D Sparse Data ModelingMohammad M. Hossain, Thomas M. Tucker, Thomas R. Kurfess, Richard W. Vuduc. 132-141 [doi]
- Optimization of an Electromagnetics Code with Multicore Wavefront Diamond Blocking and Multi-dimensional Intra-Tile ParallelizationTareq M. Malas, Julian Hornich, Georg Hager, Hatem Ltaief, Christoph Pflaum, David E. Keyes. 142-151 [doi]
- Order-Invariant Real Number Summation: Circumventing Accuracy Loss for Multimillion Summands on Multiple Parallel ArchitecturesPatrick E. Small, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta. 152-160 [doi]
- INV-ASKIT: A Parallel Fast Direct Solver for Kernel MatricesChenhan D. Yu, William B. March, Bo Xiao, George Biros. 161-171 [doi]
- A Fast Tridiagonal Solver for Intel MIC ArchitectureXinliang Wang, Wei Xue, Jidong Zhai, Yangtong Xu, Weimin Zheng, Hai Xiang Lin. 172-181 [doi]
- A Relaxed Synchronization Approach for Solving Parallel Quadratic Programming Problems with Guaranteed ConvergenceKooktae Lee, Raktim Bhattacharya, Jyotikrishna Dass, V. N. S. Prithvi Sakuru, Rabi N. Mahapatra. 182-191 [doi]
- Enhancing Scalability and Load Balancing of Parallel Selected Inversion via Tree-Based Asynchronous CommunicationMathias Jacquelin, Lin Lin, Nathan Wichmann, Chao Yang. 192-201 [doi]
- Optimal Resilience Patterns to Cope with Fail-Stop and Silent ErrorsAnne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun. 202-211 [doi]
- Reducing Waste in Extreme Scale Systems through Introspective AnalysisLeonardo Arturo Bautista Gomez, Ana Gainaru, Swann Perarnau, Devesh Tiwari, Saurabh Gupta, Christian Engelmann, Franck Cappello, Marc Snir. 212-221 [doi]
- Fault Modeling of Extreme Scale Applications Using Machine LearningAbhinav Vishnu, Hubertus van Dam, Nathan R. Tallent, Darren J. Kerbyson, Adolfy Hoisie. 222-231 [doi]
- Efficient Checkpointing of Multi-threaded Applications as a Tool for Debugging, Performance Tuning, and ResiliencyMax Grossman, Vivek Sarkar. 232-241 [doi]
- X: A Comprehensive Analytic Model for Parallel MachinesAng Li, Shuaiwen Leon Song, Eric Brugel, Akash Kumar 0001, Daniel G. Chavarría-Miranda, Henk Corporaal. 242-252 [doi]
- NiMC: Characterizing and Eliminating Network-Induced Memory ContentionTaylor L. Groves, Ryan E. Grant, Dorian C. Arnold. 253-262 [doi]
- An Early Performance Study of Large-Scale POWER8 SMP SystemsXing Liu, Daniele Buono, Fabio Checconi, Jee Choi, Xinyu Que, Fabrizio Petrini, John A. Gunnels, Jeff Stuecheli. 263-272 [doi]
- A Methodology for Modeling Dynamic and Static Power Consumption for Multicore ProcessorsBhavishya Goel, Sally A. McKee. 273-282 [doi]
- Algorithmic Techniques for Solving Graph Problems on the Automata ProcessorIndranil Roy, Nagakishore Jammula, Srinivas Aluru. 283-292 [doi]
- A Case Study of Complex Graph Analysis in Distributed Memory: Implementation and OptimizationGeorge M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri. 293-302 [doi]
- FastBFS: Fast Breadth-First Graph Search on a Single ServerShu-han Cheng, Guangyan Zhang, Jiwu Shu, Qingda Hu, Weimin Zheng. 303-312 [doi]
- GraphPad: Optimized Graph Primitives for Parallel and Distributed PlatformsMichael J. Anderson, Narayanan Sundaram, Nadathur Satish, Md. Mostofa Ali Patwary, Theodore L. Willke, Pradeep Dubey. 313-322 [doi]
- On First Fit Bin Packing for Online Cloud Server AllocationXueyan Tang, Yusen Li, Runtian Ren, Wentong Cai. 323-332 [doi]
- Smoothed Online Resource Allocation in Multi-tier Distributed Cloud NetworksLei Jiao, Antonia Maria Tulino, Jaime Llorca, Yue Jin, Alessandra Sala. 333-342 [doi]
- Dynamic Acceleration of Parallel Applications in Cloud Platforms by Adaptive Time-Slice ControlSong Wu, Zhenjiang Xie, Haibao Chen, Sheng Di, Xinyu Zhao, Hai Jin. 343-352 [doi]
- Mystic: Predictive Scheduling for GPU Based Cloud Servers Using Machine LearningYash Ukidave, Xiangyu Li, David R. Kaeli. 353-362 [doi]
- TintMalloc: Reducing Memory Access Divergence via Controller-Aware ColoringXing Pan, Yasaswini Jyothi Gownivaripalli, Frank Mueller. 363-372 [doi]
- Markov Chain-Based Adaptive Scheduling in Software Transactional MemoryPierangelo di Sanzo, Marco Sannicandro, Bruno Ciciani, Francesco Quaglia. 373-382 [doi]
- MEMTUNE: Dynamic Memory Management for In-Memory Data Analytic PlatformsLuna Xu, Min Li, Li Zhang, Ali Raza Butt, Yandong Wang, Zane Zhenhua Hu. 383-392 [doi]
- High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and BenefitsDipti Shankar, Xiaoyi Lu, Nusrat S. Islam, Md. Wasi-ur-Rahman, Dhabaleswar K. Panda. 393-402 [doi]
- GreenMatch: Renewable-Aware Workload Scheduling for Massive Storage SystemsXiaoyang Qu, Jiguang Wan, Jun Wang 0001, Liqiong Liu, Dan Luo, Changsheng Xie. 403-412 [doi]
- CATA: Criticality Aware Task Acceleration for Multicore ProcessorsEmilio Castillo, Miquel Moretó, Marc Casas, Lluc Alvarez, Enrique Vallejo 0001, Kallia Chronaki, Rosa M. Badia, José Luis Bosque, Ramón Beivide, Eduard Ayguadé, Jesús Labarta, Mateo Valero. 413-422 [doi]
- TECfan: Coordinating Thermoelectric Cooler, Fan, and DVFS for CMP Energy OptimizationWenli Zheng, Kai Ma, Xiaorui Wang. 423-432 [doi]
- Utility Maximizing Thread Assignment and Resource AllocationPan Lai, Rui Fan, Wei Zhang, Fang Liu. 433-442 [doi]
- A Hybrid Decomposition Parallel Algorithm for Multi-scale Simulation of Viscoelastic FluidsXiaowei Guo, Xinhai Xu, Qian Wang, Hao Li, Xiaoguang Ren, Liyang Xu, Xuejun Yang. 443-452 [doi]
- A Hartree-Fock Application Using UPC++ and the New DArray LibraryDavid Ozog, Amir Kamil, Yili Zheng, Paul Hargrove, Jeff R. Hammond, Allen D. Malony, Wibe De Jong, Kathy Yelick. 453-462 [doi]
- A Fast Selected Inversion Algorithm for Green's Function Calculation in Many-Body Quantum Monte Carlo SimulationsChengming Jiang, Zhaojun Bai, Richard Scalettar. 463-472 [doi]
- Memory, Storage and Processing in Future Parallel and Distributed Processing SystemsJ. Thomas Pawlowski. 473 [doi]
- A New Approximation Algorithm for Matrix Partitioning in Presence of Strongly Heterogeneous ProcessorsOlivier Beaumont, Lionel Eyraud-Dubois, Thomas Lambert. 474-483 [doi]
- Structural Clustering: A New Approach to Support Performance Analysis at ScaleMatthias Weber, Ronny Brendel, Tobias Hilbrich, Kathryn Mohror, Martin Schulz, Holger Brunst. 484-493 [doi]
- PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed ArchitecturesMd. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter J. Sadowski, Evan Racah, Surendra Byna, Craig Tull, Wahid Bhimji, Prabhat, Pradeep Dubey. 494-503 [doi]
- DataNet: A Data Distribution-Aware Method for Sub-Dataset Analysis on Distributed File SystemsJun Wang, Jiangling Yin, Jian Zhou, Xuhong Zhang, RuiJun Wang. 504-513 [doi]
- Synchronization Trade-Offs in GPU Implementations of Graph AlgorithmsRashid Kaleem, Anand Venkat, Sreepathi Pai, Mary W. Hall, Keshav Pingali. 514-523 [doi]
- Eliminating Intra-Warp Load Imbalance in Irregular Nested Patterns via Collaborative Task EngagementFarzad Khorasani, Bryan Rowe, Rajiv Gupta, Laxmi N. Bhuyan. 524-533 [doi]
- Compiler-Assisted Workload Consolidation for Efficient Dynamic Parallelism on GPUHancheng Wu, Da Li, Michela Becchi. 534-543 [doi]
- OpenACC to FPGA: A Framework for Directive-Based High-Performance Reconfigurable ComputingSeyong Lee, Jungwon Kim, Jeffrey S. Vetter. 544-554 [doi]
- Architecting and Programming a Hardware-Incoherent Multiprocessor Cache HierarchyWooil Kim, Sanket Tavarageri, P. Sadayappan, Josep Torrellas. 555-565 [doi]
- Refree: A Refresh-Free Hybrid DRAM/PCM Main Memory SystemBahareh Pourshirazi, Zhichun Zhu. 566-575 [doi]
- Re-NUCA: A Practical NUCA Architecture for ReRAM Based Last-Level CachesJagadish Kotra, Mohammad Arjomand, Diana Guttman, Mahmut T. Kandemir, Chita R. Das. 576-585 [doi]
- Evaluating and Improving Thread-Level Speculation in Hardware Transactional MemoriesJuan Salamanca, José Nelson Amaral, Guido Araujo. 586-595 [doi]
- System Noise Revisited: Enabling Application Scalability and Reproducibility with SMTEdgar A. León, Ian Karlin, Adam Moody. 596-607 [doi]
- Key/Value-Enabled Flash Memory for Complex Scientific Workflows with On-Line Analysis and VisualizationStefan Eilemann, Fabien Delalondre, Jon Bernard, Judit Planas, Felix Schürmann, John Biddiscombe, Costas Bekas, Alessandro Curioni, Bernard Metzler, Peter Kaltstein, Peter Morjan, Joachim Fenkes, Ralph Bellofatto, Lars Schneidenbach, T. J. Christopher Ward, Blake G. Fitch. 608-617 [doi]
- Fast Classification of MPI Applications Using Lamport's Logical ClocksZhou Tong, Scott Pakin, Michael Lang 0003, Xin Yuan. 618-627 [doi]
- Online-Autotuning of Parallel SAH kD-TreesMartin Tillmann, Philip Pfaffe, Christopher Kaag, Walter F. Tichy. 628-637 [doi]
- Polynomial-Time Construction of Optimal MPI Derived Datatype TreesRobert Ganian, Martin Kalany, Stefan Szeider, Jesper Larsson Träff. 638-647 [doi]
- Write-Avoiding AlgorithmsErin Carson, James Demmel, Laura Grigori, Nicholas Knight, Penporn Koanantakool, Oded Schwartz, Harsha Vardhan Simhadri. 648-658 [doi]
- Communication Efficient Algorithms for Top-k Selection ProblemsLorenz Hübschle-Schneider, Peter Sanders. 659-668 [doi]
- Minimal Aggregated Shared Memory Messaging on Distributed Memory SupercomputersBenjamin Jamroz, John M. Dennis. 669-678 [doi]
- Never Say Never - Probabilistic and Temporal Failure DetectorsDacfey Dzung, Rachid Guerraoui, David Kozhaya, Yvonne Anne Pignolet. 679-688 [doi]
- Gathering a Closed Chain of Robots on a GridSebastian Abshoff, Andreas Cord-Landwehr, Matthias Fischer 0001, Daniel Jung, Friedhelm Meyer auf der Heide. 689-699 [doi]
- On Competitive Algorithms for Approximations of Top-k-Position Monitoring of Distributed StreamsAlexander Mäcker, Manuel Malatyali, Friedhelm Meyer auf der Heide. 700-709 [doi]
- Towards a Restrained Use of Non-Equivocation for Achieving Iterative Approximate Byzantine ConsensusChuanyou Li, Michel Hurfin, Yun Wang, Lei Yu. 710-719 [doi]
- Storage-Optimized Data-Atomic Algorithms for Handling Erasures and Errors in Distributed Storage SystemsKishori M. Konwar, N. Prakash, Erez Kantor, Nancy A. Lynch, Muriel Médard, Alexander A. Schwarzmann. 720-729 [doi]
- Fast Error-Bounded Lossy HPC Data Compression with SZSheng Di, Franck Cappello. 730-739 [doi]
- I/O Aware Power ShiftingLee Savoie, David K. Lowenthal, Bronis R. de Supinski, Tanzima Islam, Kathryn Mohror, Barry Rountree, Martin Schulz. 740-749 [doi]
- On the Root Causes of Cross-Application I/O Interference in HPC Storage SystemsOrcun Yildiz, Matthieu Dorier, Shadi Ibrahim, Robert B. Ross, Gabriel Antoniu. 750-759 [doi]
- Exploiting Variant-Based Parallelism for Data Mining of Space Weather PhenomenaMichael G. Gowanlock, David M. Blair, Victor Pankratius. 760-769 [doi]
- Solving Open MIP Instances with ParaSCIP on Supercomputers Using up to 80, 000 CoresYuji Shinano, Tobias Achterberg, Timo Berthold, Stefan Heinz, Thorsten Koch, Michael Winkler. 770-779 [doi]
- AAlign: A SIMD Framework for Pairwise Sequence Alignment on x86-Based Multi-and Many-Core ProcessorsKaixi Hou, Hao Wang 0002, Wu-chun Feng. 780-789 [doi]
- Mendel: A Distributed Storage Framework for Similarity Searching over Sequencing DataCameron Tolooee, Sangmi Lee Pallickara, Asa Ben-Hur. 790-799 [doi]
- Unlocking the Mysteries of the Universe with SupercomputersKatrin Heitmann. 800 [doi]
- ZNN - A Fast and Scalable Algorithm for Training 3D Convolutional Networks on Multi-core and Many-Core Shared Memory MachinesAleksandar Zlateski, Kisuk Lee, H. Sebastian Seung. 801-811 [doi]
- Stochastic Matrix-Function Estimators: Scalable Big-Data Kernels with High PerformancePeter W. J. Staar, Panagiotis Kl. Barkoutsos, Roxana Istrate, A. Cristiano I. Malossi, Ivano Tavernelli, Nikolaj Moll, Heiner Giefers, Christoph Hagleitner, Costas Bekas, Alessandro Curioni. 812-821 [doi]
- Discrete Cache Insertion Policies for Shared Last Level Cache Management on Large MulticoresAswinkumar Sridharan, André Seznec. 822-831 [doi]
- Massively Parallel First-Principles Simulation of Electron Dynamics in MaterialsErik W. Draeger, Xavier Andrade, John A. Gunnels, Abhinav Bhatele, Andre Schleife, Alfredo A. Correa. 832-841 [doi]
- Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix MultiplicationPenporn Koanantakool, Ariful Azad, Aydin Buluç, Dmitriy Morozov, Sang-Yun Oh, Leonid Oliker, Katherine A. Yelick. 842-853 [doi]
- Petascale Local Time Stepping for the ADER-DG Finite Element MethodAlexander Breuer, Alexander Heinecke, Michael Bader. 854-863 [doi]
- Asymptotic Optimality of Parallel Short DivisionNiall Emmart, Charles C. Weems. 864-872 [doi]
- High Performance Parallel Stochastic Gradient Descent in Shared MemoryScott Sallinen, Nadathur Satish, Mikhail Smelyanskiy, Samantika S. Sury, Christopher Ré. 873-882 [doi]
- Optimal Algorithms for Graphs and Images on a Shared Memory MeshYujie An, Quentin F. Stout. 883-891 [doi]
- Parallel Graph Coloring for Manycore ArchitecturesMehmet Deveci, Erik G. Boman, Karen D. Devine, Sivasankaran Rajamanickam. 892-901 [doi]
- A Medium-Grained Algorithm for Sparse Tensor FactorizationShaden Smith, George Karypis. 902-911 [doi]
- Parallel Tensor Compression for Large-Scale Scientific DataWoody Austin, Grey Ballard, Tamara G. Kolda. 912-922 [doi]
- GinFlow: A Decentralised Adaptive Workflow Execution ManagerJavier Rojas Balderrama, Matthieu Simonin, Cédric Tedeschi. 923-932 [doi]
- Hierarchical Parallel Dynamic Dependence Analysis for Recursively Task-Parallel ProgramsNikolaos Papakonstantinou, Foivos S. Zakkak, Polyvios Pratikakis. 933-942 [doi]
- MPMD Framework for Offloading Load Balance ComputationOlga Pearce, Todd Gamblin, Bronis R. de Supinski, Martin Schulz, Nancy M. Amato. 943-952 [doi]
- Integrating Abstractions to Enhance the Execution of Distributed ApplicationsMatteo Turilli, Feng Liu, Zhao Zhang, André Merzky, Michael Wilde, Jon B. Weissman, Daniel S. Katz, Shantenu Jha. 953-962 [doi]
- cusFFT: A High-Performance Sparse Fast Fourier Transform Algorithm on GPUsCheng Wang, Sunita Chandrasekaran, Barbara M. Chapman. 963-972 [doi]
- Balancing Scalar and Vector Execution on GPU ArchitecturesZhongliang Chen, David R. Kaeli. 973-982 [doi]
- Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on Modern GPU-Enabled SystemsChing-Hsiang Chu, Khaled Hamidouche, Akshay Venkatesh, Dip Sankar Banerjee, Hari Subramoni, Dhabaleswar K. Panda. 983-992 [doi]
- Online Algorithm-Based Fault Tolerance for Cholesky Decomposition on Heterogeneous Systems with GPUsJieyang Chen, Xin Liang, Zizhong Chen. 993-1002 [doi]
- Reusable Resource Scheduling via Colored Interval CoveringVenkatesan T. Chakaravarthy, Sreyash Kenkre, Sakib A. Mondal, Vinayaka Pandit, Yogish Sabharwal. 1003-1012 [doi]
- Partitioned Feasibility Tests for Sporadic Tasks on Heterogeneous MachinesShaurya Ahuja, Kefu Lu, Benjamin Moseley. 1013-1020 [doi]
- Are Static Schedules so Bad? A Case Study on Cholesky FactorizationEmmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, Suraj Kumar. 1021-1030 [doi]
- Optimization and Analysis of MPI Collective Communication on Fat-Tree NetworksSameer Kumar, Sameh Sharkawi, K. A. Nysal Jan. 1031-1040 [doi]
- On the Scalability, Performance Isolation and Device Driver Transparency of the IHK/McKernel Hybrid Lightweight KernelBalazs Gerofi, Masamichi Takagi, Atsushi Hori, Gou Nakamura, Tomoki Shirasawa, Yutaka Ishikawa. 1041-1050 [doi]
- ZCCloud: Exploring Wasted Green Power for High-Performance ComputingFan Yang, Andrew A. Chien. 1051-1060 [doi]
- Agile Live Migration of Virtual MachinesUmesh Deshpande, Danny Chan, Ten-Young Guh, James Edouard, Kartik Gopalan, Nilton Bila. 1061-1070 [doi]
- Lazy Repair for Addition of Fault-Tolerance to Distributed ProgramsMohammad Roohitavaf, Yiyan Lin, Sandeep S. Kulkarni. 1071-1080 [doi]
- Security RBSG: Protecting Phase Change Memory with Security-Level Adjustable Dynamic MappingFangting Huang, Dan Feng, Wen Xia, Wen Zhou, Yucheng Zhang, Min Fu, Chuntao Jiang, Yukun Zhou. 1081-1090 [doi]
- Mitigation of Denial of Service Attack with Hardware Trojans in NoC ArchitecturesTravis Boraten, Avinash Karanth Kodi. 1091-1100 [doi]
- CRC-Based Memory Reliability for Task-Parallel HPC ApplicationsOmer Subasi, Osman S. Ünsal, Jesús Labarta, Gulay Yalcin, Adrián Cristal. 1101-1112 [doi]
- Differentiated Scheduling of Response-Critical and Best-Effort Wide-Area Data TransfersRajkumar Kettimuthu, Gagan Agrawal, P. Sadayappan, Ian T. Foster. 1113-1122 [doi]
- High Performance Pattern Matching Using the Automata ProcessorIndranil Roy, Ankit Srivastava, Marziyeh Nourian, Michela Becchi, Srinivas Aluru. 1123-1132 [doi]
- GPU-Accelerated Outlier Detection for Continuous Data StreamsChandima Hewa Nadungodage, Yuni Xia, John Jaehwan Lee. 1133-1142 [doi]
- NEPTUNE: Real Time Stream Processing for Internet of Things and Sensing EnvironmentsThilina Buddhika, Shrideep Pallickara. 1143-1152 [doi]