Abstract is missing.
- Performance and energy optimization of concurrent pipelined applicationsAnne Benoit, Paul Renaud-Goud, Yves Robert. 1-12 [doi]
- Toward understanding heterogeneity in computingArnold L. Rosenberg, Ron. C. Chiang. 1-10 [doi]
- QoS aware BiNoC architectureShih-Hsin Lo, Ying-Cherng Lan, Hsin-Hsien Yeh, Wen-Chung Tsai, Yu Hen Hu, Sao-Jie Chen. 1-10 [doi]
- Consistency in hindsight: A fully decentralized STM algorithmAnnette Bieniusa, Thomas Fuhrmann. 1-12 [doi]
- Improving numerical reproducibility and stability in large-scale numerical simulations on GPUsMichela Taufer, Omar Padron, Philip Saponaro, Sandeep Patel. 1-9 [doi]
- HPDA: A hybrid parity-based disk array for enhanced performance and reliabilityBo Mao, Hong Jiang, Dan Feng, Suzhen Wu, Jianxi Chen, Lingfang Zeng, Lei Tian. 1-12 [doi]
- Dynamic load balancing on single- and multi-GPU systemsLong Chen, Oreste Villa, Sriram Krishnamoorthy, Guang R. Gao. 1-12 [doi]
- ADEPT scalability predictor in support of adaptive resource allocationArash Deshmeh, Jacob Machina, Angela C. Sodan. 1-12 [doi]
- KRASH: Reproducible CPU load generation on many-core machinesSwann Perarnau, Guillaume Huard. 1-10 [doi]
- GenerOS: An asymmetric operating system kernel for multi-core systemsQingbo Yuan, Jianbo Zhao, Mingyu Chen, Ninghui Sun. 1-10 [doi]
- Exploiting the forgiving nature of applications for scalable parallel executionJiayuan Meng, Anand Raghunathan, Srimat T. Chakradhar, Surendra Byna. 1-12 [doi]
- Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input dataHenry M. Monti, Ali Raza Butt, Sudharshan S. Vazhkudai. 1-12 [doi]
- Scalable failure recovery for high-performance data aggregationDorian C. Arnold, Barton P. Miller. 1-11 [doi]
- Fisheye lens distortion correction on multicore and hardware accelerator platformsKonstantis Daloukas, Christos D. Antonopoulos, Nikolaos Bellas, Sek M. Chai. 1-10 [doi]
- Parallelization of tau-leap coarse-grained Monte Carlo simulations on GPUsLifan Xu, Michela Taufer, Stuart Collins, Dionisios G. Vlachos. 1-9 [doi]
- Sparse power-efficient topologies for wireless ad hoc sensor networksAmitabha Bagchi. 1-10 [doi]
- On-line detection of large-scale parallel application s structureGermán Llort, Juan Gonzalez, Harald Servat, Judit Gimenez, Jesús Labarta. 1-10 [doi]
- Message from the program chairCynthia A. Phillips. 1-2 [doi]
- Distributed advance network reservation with delay guaranteesNiloofar Fazlollahi, David Starobinski. 1-12 [doi]
- Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/PWei Tang, Narayan Desai, Daniel Buettner, Zhiling Lan. 1-11 [doi]
- Oversubscription on multicore processorsCostin Iancu, Steven A. Hofmeyr, Filip Blagojevic, Yili Zheng. 1-11 [doi]
- Runtime checking of serializability in software transactional memoryArnab Sinha, Sharad Malik. 1-12 [doi]
- MMT: Exploiting fine-grained parallelism in dynamic memory managementDevesh Tiwari, Sanghoon Lee 0006, James Tuck, Yan Solihin. 1-12 [doi]
- QoS assessment of WS-BPEL processes through non-Markovian stochastic Petri netsDario Bruneo, Salvatore Distefano, Francesco Longo, Marco Scarpa. 1-12 [doi]
- Analyzing the soft error resilience of linear solvers on multicore multiprocessorsKonrad Malkowski, Padma Raghavan, Mahmut T. Kandemir. 1-12 [doi]
- A dynamic approach for characterizing collusion in desktop gridsLouis-Claude Canon, Emmanuel Jeannot, Jon B. Weissman. 1-12 [doi]
- Unconventional wisdom in multicore computingRichard W. Vuduc. 1 [doi]
- A hybrid Interest Management mechanism for peer-to-peer Networked Virtual EnvironmentsKe Pan, Wentong Cai, Xueyan Tang, Suiping Zhou, Stephen John Turner. 1-12 [doi]
- On the importance of bandwidth control mechanisms for scheduling on large scale heterogeneous platformsOlivier Beaumont, Hejer Rejeb. 1-12 [doi]
- Adapting cache partitioning algorithms to pseudo-LRU replacement policiesKamil Kedzierski, Miquel Moretó, Francisco J. Cazorla, Mateo Valero. 1-12 [doi]
- Algorithmic mechanisms for internet-based master-worker computing with untrusted and selfish workersAntonio Fernández Anta, Chryssis Georgiou, Miguel A. Mosteiro. 1-11 [doi]
- Locality-aware adaptive grain signatures for Transactional MemoriesWoojin Choi, Jeff Draper. 1-10 [doi]
- Engineering a scalable high quality graph partitionerManuel Holtgrewe, Peter Sanders, Christian Schulz. 1-12 [doi]
- Offline library adaptation using automatically generated heuristicsFrédéric de Mesmay, Yevgen Voronenko, Markus Püschel. 1-10 [doi]
- A simple thermal model for multi-core processors and its application to slack allocationZhe Wang, Sanjay Ranka. 1-11 [doi]
- Oblivious algorithms for multicores and network of processorsRezaul Alam Chowdhury, Francesco Silvestri, Brandon Blakeley, Vijaya Ramachandran. 1-12 [doi]
- A novel application of parallel betweenness centrality to power grid contingency analysisShuangshuang Jin, Zhenyu Huang, Yousu Chen, Daniel G. Chavarría-Miranda, John Feo, Pak Chung Wong. 1-7 [doi]
- An introductory exascale feasibility study for FFTs and multigridHormozd Gahvari, William Gropp. 1-9 [doi]
- Object-oriented stream programming using aspectsMingliang Wang, Manish Parashar. 1-11 [doi]
- Highly scalable parallel sortingEdgar Solomonik, Laxmikant V. Kalé. 1-12 [doi]
- Distributive waveband assignment in multi-granular optical networksYang Wang, Xiaojun Cao. 1-9 [doi]
- A multi-source label-correcting algorithm for the all-pairs shortest paths problemHiroki Yanagisawa. 1-10 [doi]
- Message from general chairDavid A. Bader. 1-2 [doi]
- Analysis of durability in replicated distributed storage systemsSriram Ramabhadran, Joseph Pasquale. 1-12 [doi]
- DEBAR: A scalable high-performance de-duplication storage system for backup and archivingTianming Yang, Hong Jiang, Dan Feng, Zhongying Niu, Ke Zhou, Yaping Wan. 1-12 [doi]
- Speculative execution on multi-GPU systemsGregory F. Diamos, Sudhakar Yalamanchili. 1-12 [doi]
- Dynamic analysis of the relay cache-coherence protocol for distributed transactional memoryBo Zhang, Binoy Ravindran. 1-11 [doi]
- Inter-block GPU communication via fast barrier synchronizationShucai Xiao, Wu-chun Feng. 1-12 [doi]
- Masking I/O latency using application level I/O caching and prefetching on Blue Gene systemsSeetharami Seelam, I-Hsin Chung, John Bauer, Hui-Fang Wen. 1-12 [doi]
- MapReduce programming with apache HadoopMilind A. Bhandarkar. 1 [doi]
- Algorithmic Cholesky factorization fault recoveryDouglas Hakkarinen, Zizhong Chen. 1-10 [doi]
- Parallel computing with CUDAMichael Garland. 1 [doi]
- An auto-tuning framework for parallel multicore stencil computationsShoaib Kamil, Cy Chan, Leonid Oliker, John Shalf, Samuel Williams. 1-12 [doi]
- Exploiting set-level non-uniformity of capacity demand to enhance CMP cooperative cachingDongyuan Zhan, Hong Jiang, Sharad C. Seth. 1-10 [doi]
- First experiences with congestion control in InfiniBand hardwareErnst Gunnar Gran, Magne Eimot, Sven-Arne Reinemo, Tor Skeie, Olav Lysne, Lars Paul Huse, Gilad Shainer. 1-12 [doi]
- BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applicationsBogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bougé, Matthieu Dorier. 1-11 [doi]
- Chip multiprocessor architecture: A programmability-driven approachKunle Olukotun. 1 [doi]
- Achieve constant performance guarantees using asynchronous crossbar scheduling without speedupDeng Pan, Kia Makki, Niki Pissinou. 1-12 [doi]
- Balls into non-uniform binsPetra Berenbrink, André Brinkmann, Tom Friedetzky, Lars Nagel. 1-10 [doi]
- Evaluating standard-based self-virtualizing devices: A performance study on 10 GbE NICs with SR-IOV supportJiuxing Liu. 1-12 [doi]
- Improving the performance of program monitors with compiler support in multi-core environmentGuojin He, Antonia Zhai. 1-12 [doi]
- Parallel I/O performance: From events to ensemblesAndrew Uselton, Mark Howison, Nicholas J. Wright, David Skinner, Noel Keen, John Shalf, Karen L. Karavanic, Leonid Oliker. 1-11 [doi]
- Decentralized resource management for multi-core desktop gridsJaehwan Lee, Peter J. Keleher, Alan Sussman. 1-11 [doi]
- Efficient parallel algorithms for maximum-density segment problemXue Wang, Fasheng Qiu, Sushil K. Prasad, Guantao Chen. 1-9 [doi]
- Direct self-consistent field computations on GPU clustersGuochun Shi, Volodymyr V. Kindratenko, Ivan S. Ufimtsev, Todd J. Martinez. 1-8 [doi]
- Servet: A benchmark suite for autotuning on multicore clustersJorge González-Domínguez, Guillermo L. Taboada, Basilio B. Fraguela, María J. Martín, Juan Touriño. 1-9 [doi]
- Optimization of linked list prefix computations on multithreaded GPUs using CUDAZheng Wei, Joseph JáJá. 1-8 [doi]
- A local, distributed constant-factor approximation algorithm for the dynamic facility location problemBastian Degener, Barbara Kempkes, Peter Pietrzyk. 1-10 [doi]
- Attack-resistant frequency countingBo Wu, Jared Saia, Valerie King. 1-10 [doi]
- Large-scale multi-dimensional document clustering on GPU clustersYongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas E. Potok. 1-10 [doi]
- Dynamic fractional resource scheduling for HPC workloadsMark Stillwell, Frédéric Vivien, Henri Casanova. 1-12 [doi]
- Profitability-based power allocation for speculative multithreaded systemsPolychronis Xekalakis, Nikolas Ioannou, Salman Khan, Marcelo Cintra. 1-11 [doi]
- A parallel architecture for meaning comparisonSuneil Mohan, Amitava Biswas, Aalap Tripathy, Jagannath Panigrahy, Rabi N. Mahapatra. 1-10 [doi]
- Scalable multi-pipeline architecture for high performance multi-pattern string matchingWeirong Jiang, Yi-Hua Edward Yang, Viktor K. Prasanna. 1-12 [doi]
- Using focused regression for accurate time-constrained scaling of scientific applicationsBradley J. Barnes, Jeonifer Garren, David K. Lowenthal, Jaxk Reeves, Bronis R. de Supinski, Martin Schulz, Barry Rountree. 1-12 [doi]
- Power-aware MPI task aggregation prediction for high-end computing systemsDong Li, Dimitrios S. Nikolopoulos, Kirk W. Cameron, Bronis R. de Supinski, Martin Schulz. 1-12 [doi]
- Optimal loop unrolling for GPGPU programsGiridhar Sreenivasa Murthy, Mahesh Ravishankar, Muthu Manikandan Baskaran, Ponnuswamy Sadayappan. 1-11 [doi]
- Implementing the Himeno benchmark with CUDA on GPU clustersEverett H. Phillips, Massimiliano Fatica. 1-10 [doi]
- Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtimeKing Tin Lam, Yang Luo, Cho-Li Wang. 1-11 [doi]
- Performance evaluation of concurrent collections on high-performance multicore computing systemsAparna Chandramowlishwaran, Kathleen Knobe, Richard W. Vuduc. 1-12 [doi]
- Optimization of applications with non-blocking neighborhood collectives via multisends on the Blue Gene/P supercomputerSameer Kumar, Philip Heidelberger, Dong Chen, Michael Hines. 1-11 [doi]
- Parallel de novo assembly of large genomes from high-throughput short readsBenjamin G. Jackson, Matthew Regennitter, Xiao Yang, Patrick S. Schnable, Srinivas Aluru. 1-10 [doi]
- Clustering JVMs with software transactional memory supportChristos Kotselidis, Mikel Luján, Mohammad Ansari, Konstantinos Malakasis, Behram Khan, Chris C. Kirkham, Ian Watson. 1-12 [doi]
- Identifying ad-hoc synchronization for enhanced race detectionAli Jannesari, Walter F. Tichy. 1-10 [doi]
- A general algorithm for detecting faults under the comparison diagnosis modelIain A. Stewart. 1-9 [doi]
- Extreme scale computing: Modeling the impact of system noise in multicore clustered systemsSeetharami Seelam, Liana L. Fong, Asser N. Tantawi, John Lewars, John Divirgilio, Kevin Gildea. 1-12 [doi]
- Supporting fault tolerance in a data-intensive computing middlewareTekin Bicer, Wei Jiang, Gagan Agrawal. 1-12 [doi]
- Midpoint routing algorithms for Delaunay triangulationsWeisheng Si, Albert Y. Zomaya. 1-7 [doi]
- Hybrid MPI/OpenMP power-aware computingDong Li, Bronis R. de Supinski, Martin Schulz, Kirk W. Cameron, Dimitrios S. Nikolopoulos. 1-12 [doi]
- Varying bandwidth resource allocation problem with bag constraintsVenkatesan T. Chakaravarthy, Vinayaka Pandit, Yogish Sabharwal, Deva P. Seetharam. 1-10 [doi]
- Parallel computation of best connections in public transportation networksDaniel Delling, Bastian Katz, Thomas Pajor. 1-12 [doi]
- Hierarchical phasers for scalable synchronization and reductions in dynamic parallelismJun Shirako, Vivek Sarkar. 1-12 [doi]
- Linpack evaluation on a supercomputer with heterogeneous acceleratorsToshio Endo, Akira Nukada, Satoshi Matsuoka, Naoya Maruyama. 1-8 [doi]
- Head-body partitioned string matching for Deep Packet Inspection with scalable and attack-resilient performanceYi-Hua E. Yang, Viktor K. Prasanna, Chenqian Jiang. 1-11 [doi]
- Performance impact of resource contention in multicore systemsRobert Hood, Haoqiang Jin, Piyush Mehrotra, Johnny Chang, M. Jahed Djomehri, Sharad Gavali, Dennis C. Jespersen, Kenichi Taylor, Rupak Biswas. 1-12 [doi]
- Adapting communication-avoiding LU and QR factorizations to multicore architecturesSimplice Donfack, Laura Grigori, Alok Kumar Gupta. 1-10 [doi]
- Operating system resource managementBurton Smith. 1 [doi]
- Intra-application cache partitioningSai Prashanth Muralidhara, Mahmut T. Kandemir, Padma Raghavan. 1-12 [doi]
- Hypergraph-based task-bundle scheduling towards efficiency and fairness in heterogeneous distributed systemsHan Zhao, Xinxin Liu, Xiaolin Li. 1-12 [doi]
- Improving the performance of Uintah: A large-scale adaptive meshing computational frameworkJustin Luitjens, Martin Berzins. 1-10 [doi]
- A scalable algorithm for maintaining perpetual system connectivity in dynamic distributed systemsTarun Bansal, Neeraj Mittal. 1-12 [doi]
- Using the middle tier to understand cross-tier delay in a multi-tier applicationHaichuan Wang, Qiming Teng, Xiao Zhong, Peter F. Sweeney. 1-9 [doi]
- SLAW: A scalable locality-aware adaptive work-stealing schedulerYi Guo, Jisheng Zhao, Vincent Cavé, Vivek Sarkar. 1-12 [doi]
- A low cost split-issue technique to improve performance of SMT clustered VLIW processorsManoj Gupta 0001, Fermín Sánchez, Josep Llosa. 1-12 [doi]
- Contention-based georouting with guaranteed delivery, minimal communication overhead, and shorter paths in wireless sensor networksStefan Rührup, Ivan Stojmenovic. 1-9 [doi]
- QR factorization of tall and skinny matrices in a grid computing environmentEmmanuel Agullo, Camille Coti, Jack Dongarra, Thomas Hérault, Julien Langou. 1-11 [doi]
- High performance comparison-based sorting algorithm on many-core GPUsXiaochun Ye, Dongrui Fan, Wei Lin, Nan Yuan, Paolo Ienne. 1-10 [doi]
- Parallelization of DQMC simulation for strongly correlated electron systemsChe-Rung Lee, I-Hsin Chung, Zhaojun Bai. 1-9 [doi]
- Power-aware resource provisioning in cluster computingKaiqi Xiong. 1-11 [doi]
- Palacios and Kitten: New high performance operating systems for scalable virtualized and native supercomputingJohn R. Lange, Kevin T. Pedretti, Trammell Hudson, Peter A. Dinda, Zheng Cui, Lei Xia, Patrick G. Bridges, Andy Gocke, Steven Jaconette, Michael Levenhagen, Ron Brightwell. 1-12 [doi]
- A scheduling framework for large-scale, parallel, and topology-aware applicationsValentin Kravtsov, Pavel Bar, David Carmeli, Assaf Schuster, Martin T. Swain. 1-12 [doi]
- Overlays with preferences: Approximation algorithms for matching with preference listsGiorgos Georgiadis, Marina Papatriantafilou. 1-10 [doi]
- Broadcasting on large scale heterogeneous platforms under the bounded multi-port modelOlivier Beaumont, Lionel Eyraud-Dubois, Shailesh Kumar Agrawal. 1-11 [doi]
- Fine-grained QoS scheduling for PCM-based main memory systemsPing Zhou, Yu Du, Youtao Zhang, Jun Yang 0002. 1-12 [doi]
- Scheduling algorithms for linear workflow optimizationKunal Agrawal, Anne Benoit, Loic Magnan, Yves Robert. 1-12 [doi]
- Structuring the execution of OpenMP applications for multicore architecturesFrançois Broquedis, Olivier Aumage, Brice Goglin, Samuel Thibault, Pierre-André Wacrenier, Raymond Namyst. 1-10 [doi]
- Improving the performance of hypervisor-based fault toleranceJun Zhu, Wei Dong, Zhefu Jiang, Xiaogang Shi, Zhen Xiao, Xiaoming Li. 1-10 [doi]
- A high-performance fault-tolerant software framework for memory on commodity GPUsNaoya Maruyama, Akira Nukada, Satoshi Matsuoka. 1-12 [doi]
- Exploiting inter-thread temporal locality for chip multithreadingJiayuan Meng, Jeremy W. Sheaffer, Kevin Skadron. 1-12 [doi]
- Stabilizing pipelines for streaming applicationsAndrew Berns, Anurag Dasgupta, Sukumar Ghosh. 1-9 [doi]
- Message from steering co-chairsViktor K. Prasanna. 1 [doi]
- Service and resource discovery in cycle-sharing environments with a utility algebraJoão Nuno Silva, Paulo Ferreira, Luís Veiga. 1-11 [doi]
- Parallel external memory graph algorithmsLars Arge, Michael T. Goodrich, Nodari Sitchinava. 1-11 [doi]
- A lock-free, cache-efficient multi-core synchronization mechanism for line-rate network traffic monitoringPatrick P. C. Lee, Tian Bu, Girish P. Chandranmenon. 1-12 [doi]
- A cost-effective strategy for intermediate data storage in scientific cloud workflow systemsDong Yuan, Yun Yang, Xiao Liu, Jinjun Chen. 1-12 [doi]
- Executing task graphs using work-stealingKunal Agrawal, Charles E. Leiserson, Jim Sukha. 1-12 [doi]
- Load regulating algorithm for static-priority task scheduling on multiprocessorsRisat Mahmud Pathan, Jan Jonsson. 1-12 [doi]
- Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-Hybrid platformsZhengyu He, Bo Hong. 1-10 [doi]
- DynTile: Parametric tiled loop generation for parallel execution on multicore processorsAlbert Hartono, Muthu Manikandan Baskaran, J. Ramanujam, Ponnuswamy Sadayappan. 1-12 [doi]
- eScience in the cloud: A MODIS satellite data reprojection and reduction pipeline in the Windows Azure platformJie Li, Marty Humphrey, Deborah A. Agarwal, Keith R. Jackson, Catharine van Ingen, Youngryel Ryu. 1-10 [doi]
- PreDatA - preparatory data analytics on peta-scale machinesFang Zheng, Hasan Abbasi, Ciprian Docan, Jay F. Lofstead, Qing Liu, Scott Klasky, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Matthew Wolf. 1-12 [doi]
- Optimizing and tuning the fast multipole method for state-of-the-art multicore architecturesAparna Chandramowlishwaran, Samuel Williams, Leonid Oliker, Ilya Lashuk, George Biros, Richard W. Vuduc. 1-12 [doi]
- GPU sample sortNikolaj Leischner, Vitaly Osipov, Peter Sanders. 1-10 [doi]
- Robust control-theoretic thermal balancing for server clustersYong Fu, Chenyang Lu, Hongan Wang. 1-11 [doi]
- Tile QR factorization with parallel panel processing for multicore architecturesBilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack Dongarra. 1-10 [doi]