Abstract is missing.
- PreDatA - preparatory data analytics on peta-scale machinesFang Zheng, Hasan Abbasi, Ciprian Docan, Jay F. Lofstead, Qing Liu, Scott Klasky, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Matthew Wolf. 1-12 [doi]
- A scheduling framework for large-scale, parallel, and topology-aware applicationsValentin Kravtsov, Pavel Bar, David Carmeli, Assaf Schuster, Martin T. Swain. 1-12 [doi]
- Fisheye lens distortion correction on multicore and hardware accelerator platformsKonstantis Daloukas, Christos D. Antonopoulos, Nikolaos Bellas, Sek M. Chai. 1-10 [doi]
- A local, distributed constant-factor approximation algorithm for the dynamic facility location problemBastian Degener, Barbara Kempkes, Peter Pietrzyk. 1-10 [doi]
- Chip multiprocessor architecture: A programmability-driven approachKunle Olukotun. 1 [doi]
- A hybrid Interest Management mechanism for peer-to-peer Networked Virtual EnvironmentsKe Pan, Wentong Cai, Xueyan Tang, Suiping Zhou, Stephen John Turner. 1-12 [doi]
- Operating system resource managementBurton Smith. 1 [doi]
- Improving the performance of hypervisor-based fault toleranceJun Zhu, Wei Dong, Zhefu Jiang, Xiaogang Shi, Zhen Xiao, Xiaoming Li. 1-10 [doi]
- Runtime checking of serializability in software transactional memoryArnab Sinha, Sharad Malik. 1-12 [doi]
- Scalable failure recovery for high-performance data aggregationDorian C. Arnold, Barton P. Miller. 1-11 [doi]
- Hypergraph-based task-bundle scheduling towards efficiency and fairness in heterogeneous distributed systemsHan Zhao, Xinxin Liu, Xiaolin Li. 1-12 [doi]
- Varying bandwidth resource allocation problem with bag constraintsVenkatesan T. Chakaravarthy, Vinayaka Pandit, Yogish Sabharwal, Deva P. Seetharam. 1-10 [doi]
- Analyzing the soft error resilience of linear solvers on multicore multiprocessorsKonrad Malkowski, Padma Raghavan, Mahmut T. Kandemir. 1-12 [doi]
- A dynamic approach for characterizing collusion in desktop gridsLouis-Claude Canon, Emmanuel Jeannot, Jon B. Weissman. 1-12 [doi]
- Efficient parallel algorithms for maximum-density segment problemXue Wang, Fasheng Qiu, Sushil K. Prasad, Guantao Chen. 1-9 [doi]
- Linpack evaluation on a supercomputer with heterogeneous acceleratorsToshio Endo, Akira Nukada, Satoshi Matsuoka, Naoya Maruyama. 1-8 [doi]
- Large-scale multi-dimensional document clustering on GPU clustersYongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas E. Potok. 1-10 [doi]
- A general algorithm for detecting faults under the comparison diagnosis modelIain A. Stewart. 1-9 [doi]
- Optimization of applications with non-blocking neighborhood collectives via multisends on the Blue Gene/P supercomputerSameer Kumar, Philip Heidelberger, Dong Chen, Michael Hines. 1-11 [doi]
- Broadcasting on large scale heterogeneous platforms under the bounded multi-port modelOlivier Beaumont, Lionel Eyraud-Dubois, Shailesh Kumar Agrawal. 1-11 [doi]
- Sparse power-efficient topologies for wireless ad hoc sensor networksAmitabha Bagchi. 1-10 [doi]
- Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtimeKing Tin Lam, Yang Luo, Cho-Li Wang. 1-11 [doi]
- Direct self-consistent field computations on GPU clustersGuochun Shi, Volodymyr V. Kindratenko, Ivan S. Ufimtsev, Todd J. Martinez. 1-8 [doi]
- QR factorization of tall and skinny matrices in a grid computing environmentEmmanuel Agullo, Camille Coti, Jack Dongarra, Thomas Hérault, Julien Langou. 1-11 [doi]
- Offline library adaptation using automatically generated heuristicsFrédéric de Mesmay, Yevgen Voronenko, Markus Püschel. 1-10 [doi]
- HPDA: A hybrid parity-based disk array for enhanced performance and reliabilityBo Mao, Hong Jiang, Dan Feng, Suzhen Wu, Jianxi Chen, Lingfang Zeng, Lei Tian. 1-12 [doi]
- Exploiting the forgiving nature of applications for scalable parallel executionJiayuan Meng, Anand Raghunathan, Srimat T. Chakradhar, Surendra Byna. 1-12 [doi]
- ADEPT scalability predictor in support of adaptive resource allocationArash Deshmeh, Jacob Machina, Angela C. Sodan. 1-12 [doi]
- Optimization of linked list prefix computations on multithreaded GPUs using CUDAZheng Wei, Joseph JáJá. 1-8 [doi]
- Algorithmic mechanisms for internet-based master-worker computing with untrusted and selfish workersAntonio Fernández Anta, Chryssis Georgiou, Miguel A. Mosteiro. 1-11 [doi]
- MapReduce programming with apache HadoopMilind A. Bhandarkar. 1 [doi]
- A multi-source label-correcting algorithm for the all-pairs shortest paths problemHiroki Yanagisawa. 1-10 [doi]
- Stabilizing pipelines for streaming applicationsAndrew Berns, Anurag Dasgupta, Sukumar Ghosh. 1-9 [doi]
- Structuring the execution of OpenMP applications for multicore architecturesFrançois Broquedis, Olivier Aumage, Brice Goglin, Samuel Thibault, Pierre-André Wacrenier, Raymond Namyst. 1-10 [doi]
- Exploiting set-level non-uniformity of capacity demand to enhance CMP cooperative cachingDongyuan Zhan, Hong Jiang, Sharad C. Seth. 1-10 [doi]
- Hybrid MPI/OpenMP power-aware computingDong Li, Bronis R. de Supinski, Martin Schulz, Kirk W. Cameron, Dimitrios S. Nikolopoulos. 1-12 [doi]
- Dynamic fractional resource scheduling for HPC workloadsMark Stillwell, Frédéric Vivien, Henri Casanova. 1-12 [doi]
- MMT: Exploiting fine-grained parallelism in dynamic memory managementDevesh Tiwari, Sanghoon Lee 0006, James Tuck, Yan Solihin. 1-12 [doi]
- Robust control-theoretic thermal balancing for server clustersYong Fu, Chenyang Lu, Hongan Wang. 1-11 [doi]
- Parallel computation of best connections in public transportation networksDaniel Delling, Bastian Katz, Thomas Pajor. 1-12 [doi]
- Oblivious algorithms for multicores and network of processorsRezaul Alam Chowdhury, Francesco Silvestri, Brandon Blakeley, Vijaya Ramachandran. 1-12 [doi]
- Speculative execution on multi-GPU systemsGregory F. Diamos, Sudhakar Yalamanchili. 1-12 [doi]
- A parallel architecture for meaning comparisonSuneil Mohan, Amitava Biswas, Aalap Tripathy, Jagannath Panigrahy, Rabi N. Mahapatra. 1-10 [doi]
- Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-Hybrid platformsZhengyu He, Bo Hong. 1-10 [doi]
- Performance and energy optimization of concurrent pipelined applicationsAnne Benoit, Paul Renaud-Goud, Yves Robert. 1-12 [doi]
- An auto-tuning framework for parallel multicore stencil computationsShoaib Kamil, Cy Chan, Leonid Oliker, John Shalf, Samuel Williams. 1-12 [doi]
- Identifying ad-hoc synchronization for enhanced race detectionAli Jannesari, Walter F. Tichy. 1-10 [doi]
- Parallel I/O performance: From events to ensemblesAndrew Uselton, Mark Howison, Nicholas J. Wright, David Skinner, Noel Keen, John Shalf, Karen L. Karavanic, Leonid Oliker. 1-11 [doi]
- Locality-aware adaptive grain signatures for Transactional MemoriesWoojin Choi, Jeff Draper. 1-10 [doi]
- DynTile: Parametric tiled loop generation for parallel execution on multicore processorsAlbert Hartono, Muthu Manikandan Baskaran, J. Ramanujam, Ponnuswamy Sadayappan. 1-12 [doi]
- Parallelization of DQMC simulation for strongly correlated electron systemsChe-Rung Lee, I-Hsin Chung, Zhaojun Bai. 1-9 [doi]
- Distributive waveband assignment in multi-granular optical networksYang Wang, Xiaojun Cao. 1-9 [doi]
- Highly scalable parallel sortingEdgar Solomonik, Laxmikant V. Kalé. 1-12 [doi]
- Exploiting inter-thread temporal locality for chip multithreadingJiayuan Meng, Jeremy W. Sheaffer, Kevin Skadron. 1-12 [doi]
- A simple thermal model for multi-core processors and its application to slack allocationZhe Wang, Sanjay Ranka. 1-11 [doi]
- GenerOS: An asymmetric operating system kernel for multi-core systemsQingbo Yuan, Jianbo Zhao, Mingyu Chen, Ninghui Sun. 1-10 [doi]
- Implementing the Himeno benchmark with CUDA on GPU clustersEverett H. Phillips, Massimiliano Fatica. 1-10 [doi]
- QoS aware BiNoC architectureShih-Hsin Lo, Ying-Cherng Lan, Hsin-Hsien Yeh, Wen-Chung Tsai, Yu Hen Hu, Sao-Jie Chen. 1-10 [doi]
- SLAW: A scalable locality-aware adaptive work-stealing schedulerYi Guo, Jisheng Zhao, Vincent Cavé, Vivek Sarkar. 1-12 [doi]
- Adapting cache partitioning algorithms to pseudo-LRU replacement policiesKamil Kedzierski, Miquel Moretó, Francisco J. Cazorla, Mateo Valero. 1-12 [doi]
- A novel application of parallel betweenness centrality to power grid contingency analysisShuangshuang Jin, Zhenyu Huang, Yousu Chen, Daniel G. Chavarría-Miranda, John Feo, Pak Chung Wong. 1-7 [doi]
- Hierarchical phasers for scalable synchronization and reductions in dynamic parallelismJun Shirako, Vivek Sarkar. 1-12 [doi]
- A lock-free, cache-efficient multi-core synchronization mechanism for line-rate network traffic monitoringPatrick P. C. Lee, Tian Bu, Girish P. Chandranmenon. 1-12 [doi]
- Decentralized resource management for multi-core desktop gridsJaehwan Lee, Peter J. Keleher, Alan Sussman. 1-11 [doi]
- Achieve constant performance guarantees using asynchronous crossbar scheduling without speedupDeng Pan, Kia Makki, Niki Pissinou. 1-12 [doi]
- Servet: A benchmark suite for autotuning on multicore clustersJorge González-Domínguez, Guillermo L. Taboada, Basilio B. Fraguela, María J. Martín, Juan Touriño. 1-9 [doi]
- Midpoint routing algorithms for Delaunay triangulationsWeisheng Si, Albert Y. Zomaya. 1-7 [doi]
- QoS assessment of WS-BPEL processes through non-Markovian stochastic Petri netsDario Bruneo, Salvatore Distefano, Francesco Longo, Marco Scarpa. 1-12 [doi]
- Improving the performance of program monitors with compiler support in multi-core environmentGuojin He, Antonia Zhai. 1-12 [doi]
- Dynamic load balancing on single- and multi-GPU systemsLong Chen, Oreste Villa, Sriram Krishnamoorthy, Guang R. Gao. 1-12 [doi]
- High performance comparison-based sorting algorithm on many-core GPUsXiaochun Ye, Dongrui Fan, Wei Lin, Nan Yuan, Paolo Ienne. 1-10 [doi]
- A high-performance fault-tolerant software framework for memory on commodity GPUsNaoya Maruyama, Akira Nukada, Satoshi Matsuoka. 1-12 [doi]
- Extreme scale computing: Modeling the impact of system noise in multicore clustered systemsSeetharami Seelam, Liana L. Fong, Asser N. Tantawi, John Lewars, John Divirgilio, Kevin Gildea. 1-12 [doi]
- Attack-resistant frequency countingBo Wu, Jared Saia, Valerie King. 1-10 [doi]
- Message from general chairDavid A. Bader. 1-2 [doi]
- DEBAR: A scalable high-performance de-duplication storage system for backup and archivingTianming Yang, Hong Jiang, Dan Feng, Zhongying Niu, Ke Zhou, Yaping Wan. 1-12 [doi]
- Scalable multi-pipeline architecture for high performance multi-pattern string matchingWeirong Jiang, Yi-Hua Edward Yang, Viktor K. Prasanna. 1-12 [doi]
- Tile QR factorization with parallel panel processing for multicore architecturesBilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack Dongarra. 1-10 [doi]
- An introductory exascale feasibility study for FFTs and multigridHormozd Gahvari, William Gropp. 1-9 [doi]
- eScience in the cloud: A MODIS satellite data reprojection and reduction pipeline in the Windows Azure platformJie Li, Marty Humphrey, Deborah A. Agarwal, Keith R. Jackson, Catharine van Ingen, Youngryel Ryu. 1-10 [doi]
- Executing task graphs using work-stealingKunal Agrawal, Charles E. Leiserson, Jim Sukha. 1-12 [doi]
- Service and resource discovery in cycle-sharing environments with a utility algebraJoão Nuno Silva, Paulo Ferreira, Luís Veiga. 1-11 [doi]
- Load regulating algorithm for static-priority task scheduling on multiprocessorsRisat Mahmud Pathan, Jan Jonsson. 1-12 [doi]
- Optimizing and tuning the fast multipole method for state-of-the-art multicore architecturesAparna Chandramowlishwaran, Samuel Williams, Leonid Oliker, Ilya Lashuk, George Biros, Richard W. Vuduc. 1-12 [doi]
- Algorithmic Cholesky factorization fault recoveryDouglas Hakkarinen, Zizhong Chen. 1-10 [doi]
- Supporting fault tolerance in a data-intensive computing middlewareTekin Bicer, Wei Jiang, Gagan Agrawal. 1-12 [doi]
- BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applicationsBogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bougé, Matthieu Dorier. 1-11 [doi]
- Using the middle tier to understand cross-tier delay in a multi-tier applicationHaichuan Wang, Qiming Teng, Xiao Zhong, Peter F. Sweeney. 1-9 [doi]
- Profitability-based power allocation for speculative multithreaded systemsPolychronis Xekalakis, Nikolas Ioannou, Salman Khan, Marcelo Cintra. 1-11 [doi]
- Fine-grained QoS scheduling for PCM-based main memory systemsPing Zhou, Yu Du, Youtao Zhang, Jun Yang 0002. 1-12 [doi]
- Parallel computing with CUDAMichael Garland. 1 [doi]
- First experiences with congestion control in InfiniBand hardwareErnst Gunnar Gran, Magne Eimot, Sven-Arne Reinemo, Tor Skeie, Olav Lysne, Lars Paul Huse, Gilad Shainer. 1-12 [doi]
- Head-body partitioned string matching for Deep Packet Inspection with scalable and attack-resilient performanceYi-Hua E. Yang, Viktor K. Prasanna, Chenqian Jiang. 1-11 [doi]
- On the importance of bandwidth control mechanisms for scheduling on large scale heterogeneous platformsOlivier Beaumont, Hejer Rejeb. 1-12 [doi]
- Using focused regression for accurate time-constrained scaling of scientific applicationsBradley J. Barnes, Jeonifer Garren, David K. Lowenthal, Jaxk Reeves, Bronis R. de Supinski, Martin Schulz, Barry Rountree. 1-12 [doi]
- Scheduling algorithms for linear workflow optimizationKunal Agrawal, Anne Benoit, Loic Magnan, Yves Robert. 1-12 [doi]
- GPU sample sortNikolaj Leischner, Vitaly Osipov, Peter Sanders. 1-10 [doi]
- Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/PWei Tang, Narayan Desai, Daniel Buettner, Zhiling Lan. 1-11 [doi]
- Improving numerical reproducibility and stability in large-scale numerical simulations on GPUsMichela Taufer, Omar Padron, Philip Saponaro, Sandeep Patel. 1-9 [doi]
- A scalable algorithm for maintaining perpetual system connectivity in dynamic distributed systemsTarun Bansal, Neeraj Mittal. 1-12 [doi]
- Adapting communication-avoiding LU and QR factorizations to multicore architecturesSimplice Donfack, Laura Grigori, Alok Kumar Gupta. 1-10 [doi]
- Consistency in hindsight: A fully decentralized STM algorithmAnnette Bieniusa, Thomas Fuhrmann. 1-12 [doi]
- Clustering JVMs with software transactional memory supportChristos Kotselidis, Mikel Luján, Mohammad Ansari, Konstantinos Malakasis, Behram Khan, Chris C. Kirkham, Ian Watson. 1-12 [doi]
- On-line detection of large-scale parallel application s structureGermán Llort, Juan Gonzalez, Harald Servat, Judit Gimenez, Jesús Labarta. 1-10 [doi]
- Evaluating standard-based self-virtualizing devices: A performance study on 10 GbE NICs with SR-IOV supportJiuxing Liu. 1-12 [doi]
- Overlays with preferences: Approximation algorithms for matching with preference listsGiorgos Georgiadis, Marina Papatriantafilou. 1-10 [doi]
- Masking I/O latency using application level I/O caching and prefetching on Blue Gene systemsSeetharami Seelam, I-Hsin Chung, John Bauer, Hui-Fang Wen. 1-12 [doi]
- Parallel de novo assembly of large genomes from high-throughput short readsBenjamin G. Jackson, Matthew Regennitter, Xiao Yang, Patrick S. Schnable, Srinivas Aluru. 1-10 [doi]
- Inter-block GPU communication via fast barrier synchronizationShucai Xiao, Wu-chun Feng. 1-12 [doi]
- Parallelization of tau-leap coarse-grained Monte Carlo simulations on GPUsLifan Xu, Michela Taufer, Stuart Collins, Dionisios G. Vlachos. 1-9 [doi]
- Contention-based georouting with guaranteed delivery, minimal communication overhead, and shorter paths in wireless sensor networksStefan Rührup, Ivan Stojmenovic. 1-9 [doi]
- A low cost split-issue technique to improve performance of SMT clustered VLIW processorsManoj Gupta 0001, Fermín Sánchez, Josep Llosa. 1-12 [doi]
- Parallel external memory graph algorithmsLars Arge, Michael T. Goodrich, Nodari Sitchinava. 1-11 [doi]
- Performance impact of resource contention in multicore systemsRobert Hood, Haoqiang Jin, Piyush Mehrotra, Johnny Chang, M. Jahed Djomehri, Sharad Gavali, Dennis C. Jespersen, Kenichi Taylor, Rupak Biswas. 1-12 [doi]
- Balls into non-uniform binsPetra Berenbrink, André Brinkmann, Tom Friedetzky, Lars Nagel. 1-10 [doi]
- Toward understanding heterogeneity in computingArnold L. Rosenberg, Ron. C. Chiang. 1-10 [doi]
- Object-oriented stream programming using aspectsMingliang Wang, Manish Parashar. 1-11 [doi]
- Engineering a scalable high quality graph partitionerManuel Holtgrewe, Peter Sanders, Christian Schulz. 1-12 [doi]
- A cost-effective strategy for intermediate data storage in scientific cloud workflow systemsDong Yuan, Yun Yang, Xiao Liu, Jinjun Chen. 1-12 [doi]
- Message from the program chairCynthia A. Phillips. 1-2 [doi]
- Improving the performance of Uintah: A large-scale adaptive meshing computational frameworkJustin Luitjens, Martin Berzins. 1-10 [doi]
- Analysis of durability in replicated distributed storage systemsSriram Ramabhadran, Joseph Pasquale. 1-12 [doi]
- Performance evaluation of concurrent collections on high-performance multicore computing systemsAparna Chandramowlishwaran, Kathleen Knobe, Richard W. Vuduc. 1-12 [doi]
- Unconventional wisdom in multicore computingRichard W. Vuduc. 1 [doi]
- Message from steering co-chairsViktor K. Prasanna. 1 [doi]
- Dynamic analysis of the relay cache-coherence protocol for distributed transactional memoryBo Zhang, Binoy Ravindran. 1-11 [doi]
- Distributed advance network reservation with delay guaranteesNiloofar Fazlollahi, David Starobinski. 1-12 [doi]
- Oversubscription on multicore processorsCostin Iancu, Steven A. Hofmeyr, Filip Blagojevic, Yili Zheng. 1-11 [doi]
- Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input dataHenry M. Monti, Ali Raza Butt, Sudharshan S. Vazhkudai. 1-12 [doi]
- KRASH: Reproducible CPU load generation on many-core machinesSwann Perarnau, Guillaume Huard. 1-10 [doi]
- Optimal loop unrolling for GPGPU programsGiridhar Sreenivasa Murthy, Mahesh Ravishankar, Muthu Manikandan Baskaran, Ponnuswamy Sadayappan. 1-11 [doi]
- Intra-application cache partitioningSai Prashanth Muralidhara, Mahmut T. Kandemir, Padma Raghavan. 1-12 [doi]
- Palacios and Kitten: New high performance operating systems for scalable virtualized and native supercomputingJohn R. Lange, Kevin T. Pedretti, Trammell Hudson, Peter A. Dinda, Zheng Cui, Lei Xia, Patrick G. Bridges, Andy Gocke, Steven Jaconette, Michael Levenhagen, Ron Brightwell. 1-12 [doi]
- Power-aware resource provisioning in cluster computingKaiqi Xiong. 1-11 [doi]
- Power-aware MPI task aggregation prediction for high-end computing systemsDong Li, Dimitrios S. Nikolopoulos, Kirk W. Cameron, Bronis R. de Supinski, Martin Schulz. 1-12 [doi]