Abstract is missing.
- Big data: Scale down, scale up, scale outPhillip B. Gibbons. 3 [doi]
- Balanced Coloring for Parallel Computing ApplicationsHao Lu, Mahantesh Halappanavar, Daniel G. Chavarría-Miranda, Assefaw Hadish Gebremedhin, Ananth Kalyanaraman. 7-16 [doi]
- High-Performance Graph Analytics on Manycore ProcessorsGeorge M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri. 17-27 [doi]
- Scalable Community Detection with the Louvain AlgorithmXinyu Que, Fabio Checconi, Fabrizio Petrini, John A. Gunnels. 28-37 [doi]
- Cooperative Computing for Autonomous Data CentersJonathan W. Berry, Michael J. Collins 0003, Aaron Kearns, Cynthia A. Phillips, Jared Saia, Randy Smith. 38-47 [doi]
- Divide and Conquer Symmetric Tridiagonal Eigensolver for Multicore ArchitecturesGregoire Pichon, Azzam Haidar, Mathieu Faverge, Jakub Kurzak. 51-60 [doi]
- SPLATT: Efficient and Parallel Sparse Tensor-Matrix MultiplicationShaden Smith, Niranjay Ravindran, Nicholas D. Sidiropoulos, George Karypis. 61-70 [doi]
- A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated SystemsPiyush Sao, Xing Liu, Richard W. Vuduc, Xiaoye S. Li. 71-81 [doi]
- Locality Aware DAG-Scheduling for LU-DecompositionTobias Maier, Peter Sanders, Jochen Speck. 82-92 [doi]
- GASOLIN: Global Arbitration for Streams of Data in Optical LinksJiwei Liu, Jun Yang, Rami G. Melhem. 93-102 [doi]
- Contention-Based Nonminimal Adaptive Routing in High-Radix NetworksPablo Fuentes, Enrique Vallejo 0001, Marina García, Ramón Beivide, Germán Rodríguez, Cyriel Minkenberg, Mateo Valero. 103-112 [doi]
- Identifying the Culprits Behind Network CongestionAbhinav Bhatele, Andrew R. Titus, Jayaraman J. Thiagarajan, Nikhil Jain, Todd Gamblin, Peer-Timo Bremer, Martin Schulz, Laxmikant V. Kalé. 113-122 [doi]
- Embedding Nonblocking Multicast Virtual Networks in Fat-Tree Data CentersJun Duan, Zhiyang Guo, Yuanyuan Yang. 123-132 [doi]
- Cashmere: Heterogeneous Many-Core ComputingPieter Hijma, Ceriel J. H. Jacobs, Rob van Nieuwpoort, Henri E. Bal. 135-145 [doi]
- A Scheduling and Runtime Framework for a Cluster of Heterogeneous Machines with Multiple AcceleratorsTarun Beri, Sorav Bansal, Subodh Kumar. 146-155 [doi]
- Hierarchical DAG Scheduling for Hybrid Distributed SystemsWei Wu, Aurelien Bouteiller, George Bosilca, Mathieu Faverge, Jack Dongarra. 156-165 [doi]
- Pushing the Performance Envelope of Modular Exponentiation Across Multiple Generations of GPUsNiall Emmart, Charles C. Weems. 166-176 [doi]
- Federated Scheduling of Sporadic DAG Task SystemsSanjoy Baruah. 179-186 [doi]
- Addressing Fairness in SMT Multicores with a Progress-Aware SchedulerJosué Feliu, Julio Sahuquillo, Salvador Petit, José Duato. 187-196 [doi]
- Fast and High Quality Topology-Aware Task MappingMehmet Deveci, Kamer Kaya, Bora Uçar, Ümit V. Çatalyürek. 197-206 [doi]
- Workload-Driven VM Consolidation in Cloud Data CentersHao Lin, Xin Qi, Shuo Yang, Samuel P. Midkiff. 207-216 [doi]
- Update Consistency for Wait-Free Concurrent ObjectsMatthieu Perrin, Achour Mostéfaoui, Claude Jard. 219-228 [doi]
- Modeling Energy Consumption of Lock-Free Queue ImplementationsAras Atalar, Anders Gidenstam, Paul Renaud-Goud, Philippas Tsigas. 229-238 [doi]
- A Consistency Framework for Iteration Operations in Concurrent Data StructuresYiannis Nikolakopoulos, Anders Gidenstam, Marina Papatriantafilou, Philippas Tsigas. 239-248 [doi]
- An Automated Framework for Decomposing Memory Transactions to Exploit Partial RollbackAditya Dhoke, Roberto Palmieri, Binoy Ravindran. 249-258 [doi]
- Cracking Down MapReduce Failure Amplification through Analytics Logging and MigrationYandong Wang, Huansong Fu, Weikuan Yu. 261-270 [doi]
- Grouping Blocks for MapReduce Co-LocalityXiao Yu, Bo Hong. 271-280 [doi]
- SMapReduce: Optimising Resource Allocation by Managing Working Slots at RuntimeFeng Liang, Francis C. M. Lau. 281-290 [doi]
- High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMAMd. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Sharmin Islam, Raghunath Rajachandrasekar, Dhabaleswar K. Panda. 291-300 [doi]
- High-Performance Energy-Efficient Recursive Dynamic Programming with Matrix-Multiplication-Like Flexible KernelsJesmin Jahan Tithi, Pramod Ganapathi, Aakrati Talati, Sonal Aggarwal, Rezaul Alam Chowdhury. 303-312 [doi]
- Compiler-Directed Transformation for Higher-Order StencilsProtonu Basu, Mary W. Hall, Samuel Williams, Brian van Straalen, Leonid Oliker, Phillip Colella. 313-323 [doi]
- LUC: Limiting the Unintended Consequences of Power Scaling on Parallel Transaction-Oriented WorkloadsHung-Ching Chang, Bo Li, Godmar Back, Ali Raza Butt, Kirk W. Cameron. 324-333 [doi]
- PowerFCT: Power Optimization of Data Center Network with Flow Completion Time ConstraintsKuangyu Zheng, Xiaodong Wang, Xiaorui Wang. 334-343 [doi]
- Leader Election in Sparse Dynamic Networks with ChurnJohn Augustine, Tejas Kulkarni, Sumathi Sivasubramaniam. 347-356 [doi]
- Online Top-k-Position Monitoring of Distributed Data StreamsAlexander Mäcker, Manuel Malatyali, Friedhelm Meyer auf der Heide. 357-364 [doi]
- DSLR: A Distributed Schedule Length Reduction Algorithm for WSNsAshutosh Bhatia, R. C. Hansdah. 365-374 [doi]
- Logarithmic-Time Complete Visibility for Robots with LightsRamachandran Vaidyanathan, Costas Busch, Jerry L. Trahan, Gokarna Sharma, Suresh Rai. 375-384 [doi]
- Indexing of Spatiotemporal Trajectories for Efficient Distance Threshold Similarity Searches on the GPUMichael G. Gowanlock, Henri Casanova. 387-396 [doi]
- Efficient Selection Algorithm for Fast k-NN Search on GPUsXiaoxin Tang, Zhiyi Huang 0001, David M. Eyers, Steven Mills, Minyi Guo. 397-406 [doi]
- Optimizing Sparse Matrix Operations on GPUs Using Merge PathSteven Dalton, Sean Baxter, Duane Merrill, Luke N. Olson, Michael Garland. 407-416 [doi]
- Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU SystemsMoritz Kreutzer, Andreas Pieper, Georg Hager, Gerhard Wellein, Andreas Alvermann, Holger Fehske. 417-426 [doi]
- A Batch System with Efficient Adaptive Scheduling for Malleable and Evolving ApplicationsSuraj Prabhakaran, Marcel Neumann, Sebastian Rinke, Felix Wolf, Abhishek Gupta, Laxmikant V. Kalé. 429-438 [doi]
- Improving Batch Scheduling on Blue Gene/Q by Relaxing 5D Torus Network Allocation ConstraintsZhou Zhou, Xu Yang, Zhiling Lan, Paul Rich, Wei Tang, Vitali Morozov, Narayan Desai. 439-448 [doi]
- Quiet Neighborhoods: Key to Protect Job Performance PredictabilityAna Jokanovic, José Carlos Sancho, Germán Rodríguez, Alejandro Lucero, Cyriel Minkenberg, Jesús Labarta. 449-459 [doi]
- Stratified Sampling for Even Workload Partitioning Applied to IDA* and Delaunay AlgorithmsJeeva Paudel, Levi H. S. Lelis, José Nelson Amaral. 460-469 [doi]
- A Scalable Prescriptive Parallel Debugging ModelNicklas Bo Jensen, Niklas Quarfot Nielsen, Gregory L. Lee, Sven Karlsson, Matthew P. LeGendre, Martin Schulz, Dong H. Ahn. 473-483 [doi]
- An Efficient Data-Dependence Profiler for Sequential and Parallel ProgramsZhen Li, Ali Jannesari, Felix Wolf. 484-493 [doi]
- Decentralized Runtime Verification of LTL Specifications in Distributed SystemsMenna Mostafa, Borzoo Bonakdarpour. 494-503 [doi]
- Fast Proof Generation for Verifying Cloud SearchJingyu Zhou, Jiannong Cao, Bin Yao 0002, Minyi Guo. 504-513 [doi]
- Julia: A fresh approach to parallel programmingAlan Edelman. 517 [doi]
- On the Influence of Graph Density on Randomized GossipingRobert Elsässer, Dominik Kaaser. 521-531 [doi]
- Distinct Random Sampling from a Distributed StreamSrikanta Tirthapura. 532-541 [doi]
- Randomized Renaming in Shared Memory SystemsPetra Berenbrink, André Brinkmann, Robert Elsässer, Tom Friedetzky, Lars Nagel. 542-549 [doi]
- Threshold Load Balancing with Weighted TasksPetra Berenbrink, Tom Friedetzky, Frederik Mallmann-Trenn, Sepehr Meshkinfamfard, Chris Wastell. 550-558 [doi]
- merAligner: A Fully Parallel Sequence AlignerEvangelos Georganas, Aydin Buluç, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick. 561-570 [doi]
- An Algebraic Parallel Treecode in Arbitrary DimensionsWilliam B. March, Bo Xiao, Chenhan D. Yu, George Biros. 571-580 [doi]
- 3D Cartesian Transport Sweep for Massively Parallel Architectures with PaRSECSalli Moustafa, Mathieu Faverge, Laurent Plagne, Pierre Ramet. 581-590 [doi]
- A Pattern Specification and Optimizations Framework for Accelerating Scientific Computations on Heterogeneous ClustersLinchuan Chen, Xin Huo, Gagan Agrawal. 591-600 [doi]
- D-Code: An Efficient RAID-6 Code to Optimize I/O Loads and Read PerformanceYingxun Fu, Jiwu Shu. 603-612 [doi]
- HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid ServersShuibing He, Xian-He Sun, Adnan Haider. 613-622 [doi]
- Opass: Analysis and Optimization of Parallel Data Access on Distributed File SystemsJiangling Yin, Jun Wang, Jian Zhou, Tyler Lukasiewicz, Dan Huang, Junyao Zhang. 623-632 [doi]
- Improving Storage Availability in Cloud-of-Clouds with Hybrid Redundant Data DistributionBo Mao, Suzhen Wu, Hong Jiang. 633-642 [doi]
- Efficient Process Replication for MPI Applications: Sharing Work between ReplicasThomas Ropars, Arnaud Lefray, Dohyun Kim, André Schiper. 645-654 [doi]
- Charm++ and MPI: Combining the Best of Both WorldsNikhil Jain, Abhinav Bhatele, Jae-Seung Yeom, Mark F. Adams, Francesco Miniati, Chao Mei, Laxmikant V. Kalé. 655-664 [doi]
- Casper: An Asynchronous Progress Model for MPI RMA on Many-Core ArchitecturesMin-Si, Antonio J. Peña, Jeff R. Hammond, Pavan Balaji, Masamichi Takagi, Yutaka Ishikawa. 665-676 [doi]
- Scalable Asynchronous Contact Mechanics Using Charm++Xiang Ni, Laxmikant V. Kalé, Rasmus Tamstorf. 677-686 [doi]
- Association Rule Mining with the Micron Automata ProcessorKe Wang, Yanjun Qi, Jeffrey J. Fox, Mircea R. Stan, Kevin Skadron. 689-699 [doi]
- Cichlid: Efficient Large Scale RDFS/OWL Reasoning with SparkRong Gu, Shanyong Wang, Fangfang Wang, Chunfeng Yuan, Yihua Huang. 700-709 [doi]
- Parallel Strategies for Solving Large Unit Commitment Problems in the California ISO Planning ModelGuojing Cong, Carol Meyers, Deepak Rajan, Tiziano Parriani. 710-719 [doi]
- Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel SystemsDheevatsa Mudigere, Srinivas Sridharan, Anand M. Deshpande, JongSoo Park, Alexander Heinecke, Mikhail Smelyanskiy, Bharat Kaul, Pradeep Dubey, Dinesh K. Kaushik, David E. Keyes. 723-732 [doi]
- A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor CoresDavid Ozog, Allen D. Malony, Andrew R. Siegel. 733-742 [doi]
- Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRALDoru-Thom Popovici, Francis P. Russell, Karl A. Wilkinson, Chris-Kriton Skylaris, Paul H. J. Kelly, Franz Franchetti. 743-752 [doi]
- Parallel Hessian Assembly for Seismic Waveform Inversion Using Global UpdatesScott French, Yili Zheng, Barbara Romanowicz, Katherine A. Yelick. 753-762 [doi]
- Design for a Soft Error Resilient Dynamic Task-Based RuntimeChongxiao Cao, Thomas Hérault, George Bosilca, Jack J. Dongarra. 765-774 [doi]
- Recovering from Overload in Multicore Mixed-Criticality SystemsJeremy P. Erickson, Namhoon Kim, James H. Anderson. 775-785 [doi]
- Investigating the Interplay between Energy Efficiency and Resilience in High Performance ComputingLi Tan, Shuaiwen Leon Song, Panruo Wu, Zizhong Chen, Rong Ge, Darren J. Kerbyson. 786-796 [doi]
- A Hybrid Approach to Processing Big Data Graphs on Memory-Restricted SystemsHarshvardhan, Brandon West, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger. 799-808 [doi]
- Distributed Programming over Time-Series GraphsYogesh L. Simmhan, Neel Choudhury, Charith Wickramaarachchi, Alok Gautam Kumbhare, Marc Frîncu, Cauligi S. Raghavendra, Viktor K. Prasanna. 809-818 [doi]
- Efficient and Simplified Parallel Graph Processing over CPU and MICLinchuan Chen, Xin Huo, Bin Ren, Surabhi Jain, Gagan Agrawal. 819-828 [doi]
- Assisting H1N1 and Ebola Outbreak Response through High Performance Networked EpidemiologyMadhav Marathe. 831 [doi]
- Two-Level Main Memory Co-Design: Multi-threaded Algorithmic Primitives, Analysis, and SimulationMichael A. Bender, Jonathan W. Berry, Simon D. Hammond, K. Scott Hemmert, Samuel McCauley, Branden Moore, Benjamin Moseley, Cynthia A. Phillips, David S. Resnick, Arun Rodrigues. 835-846 [doi]
- CA-SVM: Communication-Avoiding Support Vector Machines on Distributed SystemsYang You, James Demmel, Kenneth Czechowski, Le Song, Richard W. Vuduc. 847-859 [doi]
- Filtering, Reductions and Synchronization in the Anton 2 NetworkJ. P. Grossman, Brian Towles, Brian Greskamp, David E. Shaw. 860-870 [doi]
- Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer SynchronizationRoberto Belli, Torsten Hoefler. 871-881 [doi]
- 2W-FD: A Failure Detector Algorithm with QoSAlejandro Tomsic, Pierre Sens, Joao Garcia, Luciana Arantes, Julien Sopena. 885-893 [doi]
- Stabilizing Byzantine-Fault Tolerant StorageSilvia Bonomi, Maria Potop-Butucaru, Sébastien Tixeuil. 894-903 [doi]
- Making BFT Protocols Really AdaptiveJean Paul Bahsoun, Rachid Guerraoui, Ali Shoker. 904-913 [doi]
- Exploration of Lossy Compression for Application-Level Checkpoint/RestartNaoto Sasaki, Kento Sato, Toshio Endo, Satoshi Matsuoka. 914-922 [doi]
- Load-Balanced Local Time Stepping for Large-Scale Wave PropagationMax Rietmann, Daniel Peter, Olaf Schenk, Bora Uçar, Marcus J. Grote. 925-935 [doi]
- Towards Balance-Affinity Tradeoff in Concurrent Subgraph TraversalsYinglong Xia, Lifeng Nai, Jui-Hsin Lai. 936-945 [doi]
- Controlled Contention: Balancing Contention and Reservation in Multicore Application SchedulingJingjing Wang, Nael B. Abu-Ghazaleh, Dmitry V. Ponomarev. 946-955 [doi]
- Resource and Deadline-Aware Job Scheduling in Dynamic Hadoop ClustersDazhao Cheng, Jia Rao, Changjun Jiang, Xiaobo Zhou. 956-965 [doi]
- Mitigating the Susceptibility of GPGPUs Register File to Process VariationsJingweijia Tan, Xin Fu. 969-978 [doi]
- PRO: Progress Aware GPU Warp Scheduling AlgorithmJayvant Anantpur, R. Govindarajan. 979-988 [doi]
- Performance Impact of Batching Web-Application Requests Using Hot-Spot Processing on GPUsTobias Fjalling, Per Stenström. 989-999 [doi]
- An Approach for Energy Efficient Execution of Hybrid Parallel ProgramsLavanya Ramapantulu, Dumitrel Loghin, Yong Meng Teo. 1000-1009 [doi]
- Scheduling the I/O of HPC Applications Under CongestionAna Gainaru, Guillaume Aupy, Anne Benoit, Franck Cappello, Yves Robert, Marc Snir. 1013-1022 [doi]
- Leveraging Naturally Distributed Data Redundancy to Reduce Collective I/O Replication OverheadBogdan Nicolae. 1023-1032 [doi]
- Exploring Data Staging Across Deep Memory Hierarchies for Coupled Data Intensive Simulation WorkflowsTong Jin, Fan Zhang, Qian Sun, Hoang Bui, Melissa Romanus, Norbert Podhorszki, Scott Klasky, Hemanth Kolla, Jacqueline Chen, Robert Hager, Choong-Seock Chang, Manish Parashar. 1033-1042 [doi]
- Reducing Vector I/O for Faster GPU Sparse Matrix-Vector MultiplicationPham Nguyen Quang Anh, Rui Fan, Yonggang Wen. 1043-1052 [doi]
- Parallel Graph Partitioning for Complex NetworksHenning Meyerhenke, Peter Sanders, Christian Schulz 0003. 1055-1064 [doi]
- A Self-Stabilizing Memory Efficient Algorithm for the Minimum Diameter Spanning Tree under an Omnipotent DaemonLélia Blin, Fadwa Boubekeur, Swan Dubois. 1065-1074 [doi]
- A Parallel Tree Grafting Algorithm for Maximum Cardinality Matching in Bipartite GraphsAriful Azad, Aydin Buluç, Alex Pothen. 1075-1084 [doi]
- Fair Resource Allocation for Heterogeneous TasksKoyel Mukherjee, Partha Dutta, Gurulingesh Raravi, Thangaraj Rajasubramaniam, Koustuv Dasgupta, Atul Singh. 1087-1096 [doi]
- Resources-Conscious Asynchronous High-Speed Data Transfer in Multicore Systems: Design, Optimizations, and EvaluationTan Li, Yufei Ren, Dantong Yu, Shudong Jin. 1097-1106 [doi]
- RISC: Robust Infrastructure over Shared Computing Resources through Dynamic Pricing and IncentivizationTridib Mukherjee, Partha Dutta, Vinay Gangadhar Hegde, Sujit Gujar. 1107-1116 [doi]
- A Dual-Consistency Cache Coherence ProtocolAlberto Ros, Alexandra Jimborean. 1119-1128 [doi]
- Nexus#: A Distributed Hardware Task Manager for Task-Based Programming ModelsTamer Dallou, Nina Engelhardt, Ahmed Elhossini, Ben H. H. Juurlink. 1129-1138 [doi]
- Minimizing Thermal Variation Across System ComponentsKaicheng Zhang, Seda Ogrenci Memik, Gokhan Memik, Kazutomo Yoshii, Rajesh Sankaran, Peter H. Beckman. 1139-1148 [doi]
- PCERE: Fine-Grained Parallel Benchmark Decomposition for Scalability PredictionMihail Popov, Chadi Akel, Florent Conti, William Jalby, Pablo de Oliveira Castro. 1151-1160 [doi]
- Matching Application Signatures for Performance Predictions Using a Single ExecutionAnirudh Jayakumar, Prakash Murali, Sathish Vadhiyar. 1161-1170 [doi]
- Monitoring Large-Scale Location-Based Information SystemsHammad Khan, Julien Gascon-Samson, Jörg Kienzle, Bettina Kemme. 1171-1181 [doi]