Abstract is missing.
- Multiphase LBM Distributed over Multiple GPUsCarlos Rosales. 1-7 [doi]
- Performance Emulation of Cell-Based AMR Cosmology SimulationsJingjin Wu, Roberto E. González, Zhiling Lan, Nickolay Y. Gnedin, Andrey V. Kravtsov, Douglas H. Rudd, Yongen Yu. 8-16 [doi]
- BMF: Bitmapped Mass Fingerprinting for Fast Protein IdentificationWeikuan Yu, K. John Wu, Wei-Shinn Ku, Cong Xu, Juan Gao. 17-25 [doi]
- Optimizing Network I/O Virtualization with Efficient Interrupt Coalescing and Virtual Receive Side ScalingYaozu Dong, Dongxiao Xu, Yang Zhang, Guangdeng Liao. 26-34 [doi]
- RDMA Based Replication of Multiprocessor Virtual Machines over High-Performance InterconnectsBalazs Gerofi, Yutaka Ishikawa. 35-44 [doi]
- ResourceExchange: Latency-Aware Scheduling in Virtualized Environments with High Performance FabricsAdit Ranadive, Ada Gavrilovska, Karsten Schwan. 45-53 [doi]
- Large-Scale Simulator for Global Data Infrastructure OptimizationSergio Herrero-Lopez, John R. Williams, Abel Sanchez. 54-64 [doi]
- Achieving Scalable Parallelization for the Hessenberg FactorizationAnthony M. Castaldo, R. Clint Whaley. 65-73 [doi]
- Design and Implementation of Broadcast Algorithms for Extreme-Scale SystemsPavel Shamis, Richard L. Graham, Manjunath Gorentla Venkata, Joshua Ladd. 74-83 [doi]
- Model-Driven Simulation to Evaluate Performance Impact of Workload Features on Parallel SystemsTran Ngoc Minh, Lex Wolters. 84-92 [doi]
- EDO: Improving Read Performance for Scientific Applications through Elastic Data OrganizationYuan Tian, Scott Klasky, Hasan Abbasi, Jay F. Lofstead, Ray W. Grout, Norbert Podhorszki, Qing Liu, Yandong Wang, Weikuan Yu. 93-102 [doi]
- PIDX: Efficient Parallel I/O for Multi-resolution Multi-dimensional Scientific DatasetsSidharth Kumar, Venkatram Vishwanath, Philip H. Carns, Brian Summa, Giorgio Scorzelli, Valerio Pascucci, Robert B. Ross, Jacqueline Chen, Hemanth Kolla, Ray W. Grout. 103-111 [doi]
- AA-Dedupe: An Application-Aware Source Deduplication Approach for Cloud Backup Services in the Personal Computing EnvironmentYinjin Fu, Hong Jiang, Nong Xiao, Lei Tian, Fang Liu. 112-120 [doi]
- Incorporating Network RAM and Flash into Fast Backing Store for ClustersTia Newhall, Douglas Woos. 121-129 [doi]
- Design of HPC Node with Heterogeneous ProcessorsZheng Cao, Hongwei Tang, Qiang Li, Bo Li 0009, Fei Chen, Kai Wang, Xuejun An, Ninghui Sun. 130-138 [doi]
- Performance Analysis and Benchmarking of the Intel SCCPhilipp Gschwandtner, Thomas Fahringer, Radu Prodan. 139-149 [doi]
- Supporting Computing Element Heterogeneity in P2P GridsJaehwan Lee, Peter J. Keleher, Alan Sussman. 150-158 [doi]
- DARE: Adaptive Data Replication for Efficient Cluster SchedulingCristina L. Abad, Yi Lu, Roy H. Campbell. 159-168 [doi]
- A Framework for Data-Intensive Computing with Cloud BurstingTekin Bicer, David Chiu, Gagan Agrawal. 169-177 [doi]
- Automatic Hybrid OpenMP + MPI Program Generation for Dynamic Programming ProblemsDenny R. Vandenberg, Quentin F. Stout. 178-186 [doi]
- On Scalability for MPI Runtime SystemsGeorge Bosilca, Thomas Hérault, Ala Rezmerita, Jack Dongarra. 187-195 [doi]
- Process Distance-Aware Adaptive MPI Collective CommunicationsTeng Ma, Thomas Hérault, George Bosilca, Jack J. Dongarra. 196-204 [doi]
- Experience on Comparison of Operating Systems Scalability on the Multi-core ArchitectureYan Cui, Yingxin Wang, Yu Chen, Yuanchun Shi. 205-215 [doi]
- Automatic Computer System Characterization for a Parallelizing CompilerAlan Sussman, Norman Lo, Timothy Anderson. 216-224 [doi]
- Energy Templates: Exploiting Application Information to Save EnergyDarren J. Kerbyson, Abhinav Vishnu, Kevin J. Barker. 225-233 [doi]
- Performance Characterization and Optimization of Atomic Operations on AMD GPUsMarwa Elteir, Heshan Lin, Wu-chun Feng. 234-243 [doi]
- Analyzing the Performance Bottlenecks of the POWER7-IH NetworkDarren J. Kerbyson, Kevin J. Barker. 244-252 [doi]
- Play It Again, SimMR!Abhishek Verma, Ludmila Cherkasova, Roy H. Campbell. 253-261 [doi]
- An ISO-Energy-Efficient Approach to Scalable System Power-Performance OptimizationShuaiwen Song, Matthew Grove, Kirk W. Cameron. 262-271 [doi]
- High Performance Dense Linear System Solver with Soft Error ResiliencePeng Du, Piotr Luszczek, Jack Dongarra. 272-280 [doi]
- Dynamic Load Balance for Optimized Message Logging in Fault Tolerant HPC ApplicationsEsteban Meneses, Laxmikant V. Kalé, Greg Bronevetsky. 281-289 [doi]
- Accelerating Galois Field Arithmetic for Reed-Solomon Erasure Codes in Storage ApplicationsSebastian Kalcher, Volker Lindenstruth. 290-298 [doi]
- A Sampling-Based Approach for Communication Libraries Auto-TuningElisabeth Brunet, François Trahay, Alexandre Denis, Raymond Namyst. 299-307 [doi]
- Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2Hao Wang, Sreeram Potluri, Miao Luo, Ashish Kumar Singh, Xiangyong Ouyang, Sayantan Sur, Dhabaleswar K. Panda. 308-316 [doi]
- Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand ClustersHari Subramoni, Krishna Chaitanya Kandalla, Jérôme Vienne, Sayantan Sur, B. Barth, Karen A. Tomko, R. Mclay, Karl W. Schulz, Dhabaleswar K. Panda. 317-325 [doi]
- An RMS for Non-predictably Evolving ApplicationsCristian Klein, Christian Pérez. 326-334 [doi]
- Automatic Task Re-organization in MapReduceZhenhua Guo, Marlon E. Pierce, Geoffrey Fox, Mo Zhou. 335-343 [doi]
- Evolutionary Scheduling of Parallel Tasks Graphs onto Homogeneous ClustersSascha Hunold, Joachim Lepping. 344-352 [doi]
- Symphony: A Scheduler for Client-Server Applications on Coprocessor-Based Heterogeneous ClustersM. Mustafa Rafique, Srihari Cadambi, Kunal Rao, Ali Raza Butt, Srimat T. Chakradhar. 353-362 [doi]
- Multicore/GPGPU Portable Computational Kernels via Multidimensional ArraysH. Carter Edwards, Daniel Sunderland, Chris Amsler, Sam Mish. 363-370 [doi]
- Implementation of Multigrid on QPACEMatthias Bolten, Daniel Brinkers, Ulrich Rüde, Markus Stürmer. 371-377 [doi]
- Heterogeneous Cloud ComputingStephen P. Crago, Kyle Dunn, Patrick Eads, Lorin Hochstein, Dong-In Kang, Mikyung Kang, Devendra Modium, Karandeep Singh, Jinwoo Suh, John Paul Walters. 378-385 [doi]
- Exploring Fine-Grained Task-Based Execution on Multi-GPU SystemsLong Chen, Oreste Villa, Guang R. Gao. 386-394 [doi]
- Performance Portability of a GPU Enabled Factorization with the DAGuE FrameworkGeorge Bosilca, Aurelien Bouteiller, Thomas Hérault, Pierre Lemarinier, Narapat Ohm Saengpatsa, Stanimire Tomov, Jack J. Dongarra. 395-402 [doi]
- CULZSS: LZSS Lossless Data Compression on CUDAAdnan Ozsoy, D. Martin Swany. 403-411 [doi]
- Quartile and Outlier Detection on Heterogeneous Clusters Using Distributed Radix SortKyle Spafford, Jeremy S. Meredith, Jeffrey S. Vetter. 412-419 [doi]
- MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and BenefitAshish Kumar Singh, Sreeram Potluri, Hao Wang, Krishna Chaitanya Kandalla, Sayantan Sur, Dhabaleswar K. Panda. 420-427 [doi]
- Automatically Selecting the Number of Aggregators for Collective I/O OperationsMohamad Chaarawi, Edgar Gabriel. 428-437 [doi]
- Improving I/O Forwarding Throughput with Data CompressionBenjamin Welton, Dries Kimpe, Jason Cope, Christina M. Patrick, Kamil Iskra, Robert B. Ross. 438-445 [doi]
- Application I/O and Data ManagementWilliam W. Dai. 446-454 [doi]
- FastQuery: A Parallel Indexing System for Scientific DataJerry Chou, Kesheng Wu, Prabhat. 455-464 [doi]
- Parallel I/O Performance for Application-Level Checkpointing on the Blue Gene/P SystemJing Fu, Misun Min, Robert Latham, Christopher D. Carothers. 465-473 [doi]
- Methodology for Performance Evaluation of the Input/Output System on Computer ClustersSandra Méndez, Dolores Rexachs, Emilio Luque. 474-483 [doi]
- Can a Decentralized Metadata Service Layer Benefit Parallel Filesystems?Vilobh Meshram, Xavier Besseron, Xiangyong Ouyang, Raghunath Rajachandrasekar, Ravi Prakash, Dhabaleswar K. Panda. 484-493 [doi]
- Asynchronous Collective Output with Non-dedicated CoresPhil Miller, Shen Li, Chao Mei. 494-502 [doi]
- Improving PCM Endurance with Randomized Address Remapping in Hybrid Memory SystemGang Wu, Jian Gao, Huxing Zhang, Yaozu Dong. 503-507 [doi]
- HEaRS: A Hierarchical Energy-Aware Resource Scheduler for Virtualized Data CentersHui Chen, Meina Song, Junde Song, Ada Gavrilovska, Karsten Schwan. 508-512 [doi]
- Parallel Greedy Genetic Algorithm for Job Scheduling in Cluster EnviornmentsGholamali Rahnavard, Jharrod Lafon, Hadi Sharifi. 513-516 [doi]
- Scheduling Workflows in Opportunistic EnvironmentsMaria del Mar Lopez, Elisa Heymann, Miquel A. Senar. 517-521 [doi]
- TDP-Shell: A Generic Framework to Improve Interoperability between Batch Queue Systems and Monitoring ToolsVicente Ivars, Miquel A. Senar, Elisa Heymann. 522-526 [doi]
- Locality-Aware Parallel Process Mapping for Multi-core HPC SystemsJoshua Hursey, Jeffrey M. Squyres, Terry Dontje. 527-531 [doi]
- Evaluating Performance Impacts of Delayed Failure Repairing on Large-Scale SystemsZhou Zhou, Wei Tang, Ziming Zheng, Zhiling Lan, Narayan Desai. 532-536 [doi]
- Reservation-Based Overbooking for HPC ClustersGeorg Birkenheuer, André Brinkmann. 537-541 [doi]
- Investigating Scenario-Conscious Asynchronous Rendezvous over RDMAJudicael A. Zounmevo, Ahmad Afsahi. 542-546 [doi]
- Implementing High Performance Remote Method Invocation in CCAJian Yin, Khushbu Agarwal, Manoj Krishnan, Daniel G. Chavarría-Miranda, Ian Gorton, Tom Epperly. 547-551 [doi]
- Predictive and Distributed Routing Balancing for High Speed Interconnection NetworksCarlos Nunez Castillo, Diego Lugones, Daniel Franco 0002, Emilio Luque. 552-556 [doi]
- Improving MapReduce Performance via Heterogeneity-Load-Aware Partition FunctionHuifeng Sun, Junliang Chen, Chuanchang Liu, Zibin Zheng, Nan Yu, Zhi Yang. 557-560 [doi]
- Scalability of Semi-implicit Time Integrators for Nonhydrostatic Galerkin-Based Atmospheric Models on Large Scale ClustersJames F. Kelly, Frank X. Giraldo, Gabriele Jost. 561-565 [doi]
- Performance Behavior Prediction Scheme for Shared-Memory Parallel ApplicationsJohn Corredor, Juan Carlos Moure, Dolores Rexachs, Daniel Franco 0002, Emilio Luque. 566-569 [doi]
- Performance Optimization of Data Structures Using Memory Access CharacterizationAshay Rane, James Browne. 570-574 [doi]
- Experimental and Numerical Study of the Effect of Geometric Parameters on Liquid Single-Phase Pressure Drop in Micro-Scale Pin-Fin ArraysValerie Pezzullo, Steven Voinier. 575-579 [doi]
- Data Partitioning on Heterogeneous Multicore PlatformsZiming Zhong, Vladimir Rychkov, Alexey L. Lastovetsky. 580-584 [doi]
- Frequent Itemset Mining on Large-Scale Shared Memory MachinesYan Zhang, Fan Zhang, Jason D. Bakos. 585-589 [doi]
- GPApriori: GPU-Accelerated Frequent Itemset MiningFan Zhang, Yan Zhang, Jason D. Bakos. 590-594 [doi]
- An Energy-Efficient Scheme for Cloud Resource Provisioning Based on CloudSimYuxiang Shi, Xiaohong Jiang, Kejiang Ye. 595-599 [doi]
- Performance of a Virtual Cluster in a General-Purpose Teaching LaboratoryEric Johnson, Patrick Garrity, Timothy Yates, Richard A. Brown. 600-604 [doi]
- Datamation: A Quarter of a Century and Four Orders of Magnitude LaterPaolo Bertasi, Michele Bonazza, Marco Bressan 0002, Enoch Peserico. 605-609 [doi]