Abstract is missing.
- Algorithm Engineering for Scalable Parallel External SortingPeter Sanders. 1 [doi]
- Power-Aware Replica Placement and Update Strategies in Tree NetworksAnne Benoit, Paul Renaud-Goud, Yves Robert. 2-13 [doi]
- Minimum Cost Resource Allocation for Meeting Job RequirementsVenkatesan T. Chakaravarthy, Gyana R. Parija, Sambuddha Roy, Yogish Sabharwal, Amit Kumar. 14-23 [doi]
- Power and Performance Management in Priority-Type Cluster Computing SystemsKaiqi Xiong. 24-35 [doi]
- Willow: A Control System for Energy and Thermal Adaptive ComputingKrishna Kant, Muthukumar Murugan, David H. C. Du. 36-47 [doi]
- Communication-Avoiding QR Decomposition for GPUsMichael J. Anderson, Grey Ballard, James Demmel, Kurt Keutzer. 48-58 [doi]
- Overlapping Computation and Communication for Advection on Hybrid Parallel ComputersJ. B. White III, Jack J. Dongarra. 59-67 [doi]
- VisIO: Enabling Interactive Visualization of Ultra-Scale, Time Series Data via High-Bandwidth Distributed I/O SystemsChristopher Mitchell, James P. Ahrens, Jun Wang 0001. 68-79 [doi]
- Architectural Constraints to Attain 1 Exaflop/s for Three Scientific Application ClassesAbhinav Bhatele, Pritish Jetley, Hormozd Gahvari, Lukasz Wesolowski, William D. Gropp, Laxmikant V. Kalé. 80-91 [doi]
- A Novel Power Management for CMP Systems in Data-Intensive EnvironmentPengju Shang, Jun Wang 0001. 92-103 [doi]
- Characterization of System Services and Their Performance Impact in Multi-core NodesSeetharami Seelam, Liana Fong, John Lewars, John Divirgilio, Brian F. Veale, Kevin Gildea. 104-117 [doi]
- Automatic Recognition of Performance Idioms in Scientific ApplicationsJiahua He, Allan Snavely, Rob F. Van der Wijngaart, Michael A. Frumkin. 118-127 [doi]
- Iso-Energy-Efficiency: An Approach to Power-Constrained Parallel ComputationShuaiwen Song, Chun-Yi Su, Rong Ge, Abhinav Vishnu, Kirk W. Cameron. 128-139 [doi]
- A Study of Speculative Distributed Scheduling on the Cell/B.EPieter Bellens, Josep M. Pérez, Rosa M. Badia, Jesús Labarta. 140-151 [doi]
- Exploiting Data Similarity to Reduce Memory FootprintsSusmit Biswas, Bronis R. de Supinski, Martin Schulz, Diana Franklin, Timothy Sherwood, Frederic T. Chong. 152-163 [doi]
- The Evaluation of an Effective Out-of-Core Run-Time System in the Context of Parallel Mesh GenerationAndriy Kot, Andrey N. Chernikov, Nikos Chrisochoides. 164-175 [doi]
- Enriching 3-D Video Games on MulticoresRomain Cledat, Tushar Kumar, Jaswanth Sreeram, Santosh Pande. 176-187 [doi]
- On Nonblocking Folded-Clos Networks in Computer Communication EnvironmentsXin Yuan. 188-196 [doi]
- vFtree - A Fat-Tree Routing Algorithm Using Virtual Lanes to Alleviate CongestionWei Lin Guay, Bartosz Bogdanski, Sven-Arne Reinemo, Olav Lysne, Tor Skeie. 197-208 [doi]
- Measuring Temporal Lags in Delay-Tolerant NetworksArnaud Casteigts, Paola Flocchini, Bernard Mans, Nicola Santoro. 209-218 [doi]
- A Lightweight Method for Automated Design of ConvergenceAli Ebnenasir, Aly Farahat. 219-230 [doi]
- Snap-Stabilizing Committee CoordinationBorzoo Bonakdarpour, Stéphane Devismes, Franck Petit. 231-242 [doi]
- SC-OA: A Secure and Efficient Scheme for Origin Authentication of Interdomain Routing in Cloud Computing NetworksZhongjian Le, Naixue Xiong, Bo Yang, Yuezhi Zhou. 243-254 [doi]
- Automatic Library Generation for BLAS3 on GPUsHuimin Cui, Lei Wang 0004, Jingling Xue, Yang Yang, Xiaobing Feng 0002. 255-265 [doi]
- Redesign of Higher-Level Matrix Algorithms for Multicore and Distributed Architectures and Applications in Quantum Monte Carlo SimulationChe-Rung Lee, Zhaojun Bai. 266-274 [doi]
- Challenges of Scaling Algebraic Multigrid Across Modern Multicore ArchitecturesAllison H. Baker, Todd Gamblin, Martin Schulz, Ulrike Meier Yang. 275-286 [doi]
- Hauberk: Lightweight Silent Data Corruption Error Detector for GPGPUKeun Soo Yim, Cuong Pham, Mushfiq Saleheen, Zbigniew Kalbarczyk, Ravishankar K. Iyer. 287-300 [doi]
- A Performance and Area Efficient Architecture for Intrusion Detection SystemsGovind Sreekar Shenoy, Jordi Tubella, Antonio González. 301-310 [doi]
- Time-Ordered Event Traces: A New Debugging Primitive for Concurrency BugsMartin Dimitrov, Huiyang Zhou. 311-321 [doi]
- Singlehop Collaborative Feedback Primitives for Threshold Querying in Wireless Sensor NetworksMurat Demirbas, Serafettin Tasci, Hanifi Gunes, Atri Rudra. 322-333 [doi]
- Completely Distributed Particle Filters for Target Tracking in Sensor NetworksBo Jiang, Binoy Ravindran. 334-344 [doi]
- Connectivity Trade-offs in 3D Wireless Sensor Networks Using Directional AntennaeEvangelos Kranakis, Danny Krizanc, Ashish Modi, Oscar Morales Ponce. 345-351 [doi]
- Distributed Fine-Grained Access Control in Wireless Sensor NetworksSushmita Ruj, Amiya Nayak, Ivan Stojmenovic. 352-362 [doi]
- Design of MILC Lattice QCD Application for GPU ClustersGuochun Shi, Steven A. Gottlieb, Aaron Torok, Volodymyr V. Kindratenko. 363-371 [doi]
- Multifrontal Factorization of Sparse SPD Matrices on GPUsThomas George, Vaibhav Saxena, Anshul Gupta, Amik Singh, Anamitra R. Choudhury. 372-383 [doi]
- Large-Scale Semantic Concept Detection on Manycore Platforms for Multimedia MiningMamadou Diao, Chrysostomos Nicopoulos, Jongman Kim. 384-394 [doi]
- Efficient GPU Implementation for Particle in Cell AlgorithmRejith George Joseph, Girish Ravunnikutty, Sanjay Ranka, Eduardo F. D'Azevedo, Scott Klasky. 395-406 [doi]
- Hardware-Based Job Queue Management for Manycore Architectures and OpenMP EnvironmentsJungHee Lee, Chrysostomos Nicopoulos, Yongjae Lee, Hyung Gyu Lee, Jongman Kim. 407-418 [doi]
- HK-NUCA: Boosting Data Searches in Dynamic Non-Uniform Cache Architectures for Chip MultiprocessorsJavier Lira, Carlos Molina, Antonio González. 419-430 [doi]
- Power Token Balancing: Adapting CMPs to Power Constraints for Parallel Multithreaded WorkloadsJuan M. Cebrian, Juan L. Aragón, Stefanos Kaxiras. 431-442 [doi]
- A Very Fast Simulator for Exploring the Many-Core FutureOlivier Certner, Zheng Li, Arun Raman, Olivier Temam. 443-454 [doi]
- Variable Granularity Access Tracking Scheme for Improving the Performance of Software Transactional MemorySandya S. Mannarswamy, Ramaswamy Govindarajan. 455-466 [doi]
- Automated Architecture-Aware Mapping of Streaming Applications Onto GPUsAndrei Hagiescu, Huynh Phung Huynh, Weng-Fai Wong, Rick Siow Mong Goh. 467-478 [doi]
- Automatic Loop Tiling for Direct Memory AccessHaibo Lin, Tao Liu, Lakshminarayanan Renganarayanan, Huoding Li, Tong Chen, Kevin O'Brien, Ling Shao. 479-489 [doi]
- Tolerant Value Speculation in Coarse-Grain Streaming ComputationsNathaniel Azuelos, Idit Keidar, Ayal Zaks. 490-501 [doi]
- Panel StatementYves Robert, William J. Dally, Jack Dongarra, Satoshi Matsuoka, Robert Schreiber, Horst D. Simon, Uzi Vishkin. 505 [doi]
- Tutorial StatementBruce Palmer, Manojkumar Krishnan, Abhinav Vishnu. 506 [doi]
- Architecture-aware Algorithms and Software for Peta and Exascale ComputingJack Dongarra. 507 [doi]
- Adding a Referee to an Interconnection Network: What Can(not) Be Computed in One RoundFlorent Becker, Martín Matamala, Nicolas Nisse, Ivan Rapaport, Karol Suchan, Ioan Todinca. 508-514 [doi]
- Improved Algorithms for the Distributed Trigger Counting ProblemVenkatesan T. Chakaravarthy, Anamitra R. Choudhury, Yogish Sabharwal. 515-523 [doi]
- The Weighted Byzantine Agreement ProblemVijay K. Garg, John Bridgman. 524-531 [doi]
- Leveraging Social Networks to Combat Collusion in Reputation Systems for Peer-to-Peer NetworksZe Li, Haiying Shen, Karan Sapra. 532-543 [doi]
- Computing Strongly Connected Components in Parallel on CUDAJiri Barnat, Petr Bauch, Lubos Brim, Milan Ceska. 544-555 [doi]
- On Optimal Tree Traversals for Sparse Matrix FactorizationMathias Jacquelin, Loris Marchal, Yves Robert, Bora Uçar. 556-567 [doi]
- Fast Community Detection Algorithm with GPUs and Multicore ArchitecturesJyothish Soman, Ankur Narang. 568-579 [doi]
- A Study of Parallel Particle Tracing for Steady-State and Time-Varying Flow FieldsTom Peterka, Robert B. Ross, Boonthanome Nouanesengsy, Teng-Yok Lee, Han-Wei Shen, Wesley Kendall, Jian Huang. 580-591 [doi]
- Critical Bubble Scheme: An Efficient Implementation of Globally Aware Network Flow ControlLizhong Chen, Ruisheng Wang, Timothy Mark Pinkston. 592-603 [doi]
- A Scalable Reverse Lookup Scheme Using Group-Based Shifted Declustering LayoutJunyao Zhang, Pengju Shang, Jun Wang 0001. 604-615 [doi]
- Deadlock-Free Oblivious Routing for Arbitrary TopologiesJens Domke, Torsten Hoefler, Wolfgang E. Nagel. 616-627 [doi]
- RDMA Capable iWARP over DatagramsRyan E. Grant, Mohammad J. Rashti, Ahmad Afsahi, Pavan Balaji. 628-639 [doi]
- Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI ProgramsZoltán Szebenyi, Todd Gamblin, Martin Schulz, Bronis R. de Supinski, Felix Wolf, Brian J. N. Wylie. 640-651 [doi]
- A Practical Approach for Performance Analysis of Shared-Memory ProgramsBogdan Marius Tudor, Yong Meng Teo. 652-663 [doi]
- Single Node On-Line Simulation of MPI Applications with SMPIPierre-Nicolas Clauss, Mark Stillwell, Stéphane Genaud, Frédéric Suter, Henri Casanova, Martin Quinson. 664-675 [doi]
- PATUS: A Code Generation and Autotuning Framework for Parallel Iterative Stencil Computations on Modern MicroarchitecturesMatthias Christen, Olaf Schenk, Helmar Burkhart. 676-687 [doi]
- Optimizing Large-Scale Graph Analysis on a Multi-threaded, Multi-core PlatformGuojing Cong, Konstantin Makarychev. 688-697 [doi]
- A New Data Layout for Set Intersection on GPUsRasmus Resen Amossen, Rasmus Pagh. 698-708 [doi]
- Partitioning Spatially Located Computations Using RectanglesErik Saule, Erdeniz Ö. Bas, Ümit V. Çatalyürek. 709-720 [doi]
- Reduced-Bandwidth Multithreaded Algorithms for Sparse Matrix-Vector MultiplicationAydin Buluç, Samuel Williams, Leonid Oliker, James Demmel. 721-733 [doi]
- GRAL: A Grouping Algorithm to Optimize Application Placement in Wireless Embedded SystemsNikos Tziritas, Thanasis Loukopoulos, Spyros Lalis, Petros Lampsas. 734-745 [doi]
- Vitis: A Gossip-based Hybrid Overlay for Internet-scale Publish/Subscribe Enabling Rendezvous Routing in Unstructured Overlay NetworksFatemeh Rahimian, Sarunas Girdzijauskas, Amir H. Payberah, Seif Haridi. 746-757 [doi]
- Moving the Code to the Data - Dynamic Code Deployment Using ActiveSpacesCiprian Docan, Manish Parashar, Julian Cummings, Scott Klasky. 758-769 [doi]
- High Performance Scalable and Expressive Modeling Environment to Study Mobile Malware in Large Dynamic NetworksKarthik Channakeshava, Keith R. Bisset, V. S. Anil Kumar, Madhav V. Marathe, Shrirang M. Yardi. 770-781 [doi]
- H-Code: A Hybrid MDS Array Code to Optimize Partial Stripe Writes in RAID-6Chentao Wu, Shenggang Wan, Xubin He, Qiang Cao, Changsheng Xie. 782-793 [doi]
- LACIO: A New Collective I/O Strategy for Parallel I/O SystemsYong Chen, Xian-He Sun, Rajeev Thakur, Philip C. Roth, William D. Gropp. 794-804 [doi]
- Using Shared Memory to Accelerate MapReduce on Graphics Processing UnitsFeng Ji, Xiaosong Ma. 805-816 [doi]
- Unified Signatures for Improving Performance in Transactional MemoryWoojin Choi, Jeff Draper. 817-827 [doi]
- Reducing Fragmentation on Torus-Connected SupercomputersWei Tang, Zhiling Lan, Narayan Desai, Daniel Buettner, Yongen Yu. 828-839 [doi]
- Co-analysis of RAS Log and Job Log on Blue Gene/PZiming Zheng, Li Yu, Wei Tang, Zhiling Lan, Rinku Gupta, Narayan Desai, Susan Coghlan, Daniel Buettner. 840-851 [doi]
- A Quantitative Analysis of OS NoiseAlessandro Morari, Roberto Gioiosa, Robert W. Wisniewski, Francisco J. Cazorla, Mateo Valero. 852-863 [doi]
- CheCL: Transparent Checkpointing and Process Migration of OpenCL ApplicationsHiroyuki Takizawa, Kentaro Koyama, Katsuto Sato, Kazuhiko Komatsu, Hiroaki Kobayashi. 864-876 [doi]
- Panel StatementPer Stenström, Doug Burger, Wen-mei W. Hwu, Vipin Kumar, Kunle Olukotun, David A. Padua, Burton Smith. 877 [doi]
- Power, Programmability, and Granularity: The Challenges of ExaScale ComputingBill Dally. 878 [doi]
- Online Adaptive Code Generation and TuningAnanta Tiwari, Jeffrey K. Hollingsworth. 879-892 [doi]
- GLocks: Efficient Support for Highly-Contended Locks in Many-Core CMPsJosé L. Abellán, Juan Fernández, Manuel E. Acacio. 893-905 [doi]
- Profiling Heterogeneous Multi-GPU Systems to Accelerate Cortically Inspired Learning AlgorithmsAndrew Nere, Atif Hashmi, Mikko H. Lipasti. 906-920 [doi]
- PHAST: Hardware-Accelerated Shortest Path TreesDaniel Delling, Andrew V. Goldberg, Andreas Nowatzyk, Renato Fonseca F. Werneck. 921-931 [doi]
- QR Factorization on a Multicore Node Enhanced with Multiple GPU AcceleratorsEmmanuel Agullo, Cédric Augonnet, Jack Dongarra, Mathieu Faverge, Hatem Ltaief, Samuel Thibault, Stanimire Tomov. 932-943 [doi]
- Two-Stage Tridiagonal Reduction for Dense Symmetric Matrices Using Tile Algorithms on Multicore ArchitecturesPiotr Luszczek, Hatem Ltaief, Jack Dongarra. 944-955 [doi]
- An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPUAndrew A. Davidson, Yao Zhang, John D. Owens. 956-965 [doi]
- A Communication-Avoiding, Hybrid-Parallel, Rank-Revealing Orthogonalization MethodMark Hoemmen. 966-977 [doi]
- Flease - Lease Coordination Without a Lock ServerBjörn Kolbeck, Mikael Högqvist, Jan Stender, Felix Hupfeld. 978-988 [doi]
- Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic MPI ApplicationsAmina Guermouche, Thomas Ropars, Elisabeth Brunet, Marc Snir, Franck Cappello. 989-1000 [doi]
- Minimal Obstructions for the Coordinated Attack Problem and BeyondTristan Fevat, Emmanuel Godard. 1001-1011 [doi]
- Scheduling Parallel Iterative Applications on Volatile ResourcesHenri Casanova, Fanny Dufossé, Yves Robert, Frédéric Vivien. 1012-1023 [doi]
- Shared Resource Monitoring and Throughput Optimization in Cloud-Computing DatacentersJaideep Moses, Ravi Iyer, Ramesh Illikkal, Sadagopan Srinivasan, Konstantinos Aisopos. 1024-1033 [doi]
- The Impact of Soft Resource Allocation on n-Tier Application ScalabilityQingyang Wang, Simon Malkowski, Deepal Jayasinghe, PengCheng Xiong, Calton Pu, Yasuhiko Kanemasa, Motoyuki Kawaba, Lilian Harada. 1034-1045 [doi]
- Profiling Directed NUMA Optimization on Linux Systems: A Case Study of the Gaussian Computational Chemistry CodeRui Yang, Joseph Antony, Alistair P. Rendell, Danny Robson, Peter E. Strazdins. 1046-1057 [doi]
- Model-Driven SIMD Code Generation for a Multi-resolution Tensor KernelKevin Stock, Thomas Henretty, Iyyappa Murugandi, P. Sadayappan, Robert J. Harrison. 1058-1067 [doi]
- Multi-GPU MapReduce on GPU ClustersJeff A. Stuart, John D. Owens. 1068-1079 [doi]
- X10 as a Parallel Language for Scientific Computation: Practice and ExperienceJosh Milthorpe, V. Ganesh, Alistair P. Rendell, David Grove. 1080-1088 [doi]
- Implementation and Performance Evaluation of the HPC Challenge Benchmarks in Coarray Fortran 2.0Guohua Jin, John M. Mellor-Crummey, Laksono Adhianto, William N. Scherer III, Chaoran Yang. 1089-1100 [doi]
- Communication Optimizations for Distributed-Memory X10 ProgramsRajkishore Barik, Jisheng Zhao, David Grove, Igor Peshansky, Zoran Budimlic, Vivek Sarkar. 1101-1113 [doi]
- I/O-Optimal Distribution Sweeping on Private-Cache Chip MultiprocessorsDeepak Ajwani, Nodari Sitchinava, Norbert Zeh. 1114-1123 [doi]
- A Fast Algorithm for Constructing Inverted Files on Heterogeneous PlatformsZheng Wei, Joseph JáJá. 1124-1134 [doi]
- Graph Partitioning with Natural CutsDaniel Delling, Andrew V. Goldberg, Ilya Razenshteyn, Renato Fonseca F. Werneck. 1135-1146 [doi]
- Reader Activation Scheduling in Multi-reader RFID Systems: A Study of General CaseShaoJie Tang, Cheng Wang, Xiang-Yang Li, Changjun Jiang. 1147-1155 [doi]
- Efficient Parallel Scheduling of Malleable TasksPeter Sanders, Jochen Speck. 1156-1166 [doi]
- Offline Scheduling of Multi-threaded Request Streams on a Caching ServerVeronika Rehn-Sonigo, Denis Trystram, Frédéric Wagner, Haifeng Xu, Guochuan Zhang. 1167-1176 [doi]
- Tight Analysis of Relaxed Multi-organization Scheduling AlgorithmsDaniel Cordeiro, Pierre-François Dutot, Grégory Mounié, Denis Trystram. 1177-1186 [doi]
- Scheduling Functionally Heterogeneous Systems with Utilization BalancingYuxiong He, Jie Liu, Hongyang Sun. 1187-1198 [doi]
- Smith-Waterman Alignment of Huge Sequences with GPU in Linear SpaceEdans Flavius de Oliveira Sandes, Alba Cristina Magalhaes Alves de Melo. 1199-1211 [doi]
- Accelerating Protein Sequence Search in a Heterogeneous Computing SystemShucai Xiao, Heshan Lin, Wu-chun Feng. 1212-1222 [doi]
- Parallel Metagenomic Sequence Clustering Via Sketching and Maximal Quasi-clique Enumeration on Map-Reduce CloudsXiao Yang, Jaroslaw Zola, Srinivas Aluru. 1223-1233 [doi]
- Large-Scale Lattice Gas Monte Carlo Simulations for the Generalized Ising ModelTobias C. Kerscher, Stefan Müller, Quinn O. Snell, Gus L. W. Hart. 1234-1241 [doi]
- CATCH: A Cloud-Based Adaptive Data Transfer Service for HPCHenry M. Monti, Ali Raza Butt, Sudharshan S. Vazhkudai. 1242-1253 [doi]
- A Scalable and Elastic Publish/Subscribe ServiceMing Li, Fan Ye, Minkyong Kim, Han Chen, Hui Lei. 1254-1265 [doi]
- CABdedupe: A Causality-Based Deduplication Performance Booster for Cloud Backup ServicesYujuan Tan, Hong Jiang, Dan Feng, Lei Tian, Zhichao Yan. 1266-1277 [doi]
- DryadOpt: Branch-and-Bound on Distributed Data-Parallel Execution EnginesMihai Budiu, Daniel Delling, Renato Fonseca F. Werneck. 1278-1289 [doi]