Abstract is missing.
- Rethinking shared-memory languages and hardwareSarita V. Adve. 1 [doi]
- An execution strategy and optimized runtime support for parallelizing irregular reductions on modern GPUsXin Huo, Vignesh T. Ravi, Wenjing Ma, Gagan Agrawal. 2-11 [doi]
- Automatic generation of executable communication specifications from parallel applicationsXing Wu, Frank Mueller, Scott Pakin. 12-21 [doi]
- Hystor: making the best use of solid state drives in high performance storage systemsFeng Chen, David A. Koufaty, Xiaodong Zhang. 22-32 [doi]
- Transactional conflict decoupling and value predictionFuad Tabba, Andrew W. Hay, James R. Goodman. 33-42 [doi]
- Multiset signatures for transactional memoryRicardo Quislant, Eladio Gutiérrez, Oscar G. Plata, Emilio L. Zapata. 43-52 [doi]
- ZEBRA: a data-centric, hybrid-policy hardware transactional memory designJ. Rubén Titos Gil, Anurag Negi, Manuel E. Acacio, José M. García, Per Stenström. 53-62 [doi]
- Scalable fine-grained call path tracingNathan R. Tallent, John M. Mellor-Crummey, Michael Franco, Reed Landrum, Laksono Adhianto. 63-74 [doi]
- Generic topology mapping strategies for large-scale parallel architecturesTorsten Hoefler, Marc Snir. 75-84 [doi]
- Page placement in hybrid memory systemsLuiz E. Ramos, Eugene Gorbatov, Ricardo Bianchini. 85-95 [doi]
- Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularityYang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, Shu-Ping Zhang. 96-107 [doi]
- SecureME: a hardware-software approach to full system securitySiddhartha Chhabra, Brian Rogers, Yan Solihin, Milos Prvulovic. 108-119 [doi]
- Processing data streams with hard real-time constraints on heterogeneous systemsUri Verner, Assaf Schuster, Mark Silberstein. 120-129 [doi]
- Coordinating processor and main memory for efficientserver power controlMing Chen, Xiaorui Wang, Xue Li. 130-140 [doi]
- Optimizing throughput/power trade-offs in hardware transactional memory using DVFS and intelligent schedulingClay Hughes, Tao Li. 141-150 [doi]
- Challenges and opportunities in renewable energy and energy efficiencySteven W. Hammond. 151 [doi]
- Characterizing the impact of soft errors on iterative methods in scientific computingManu Shantharam, Sowmyalatha Srinivasmurthy, Padma Raghavan. 152-161 [doi]
- High performance linpack benchmark: a fault tolerant implementation without checkpointingTeresa Davies, Christer Karlsson, Hui Liu, Chong Ding, Zizhong Chen. 162-171 [doi]
- Modeling the performance of an algebraic multigrid cycle on HPC platformsHormozd Gahvari, Allison H. Baker, Martin Schulz, Ulrike Meier Yang, Kirk E. Jordan, William Gropp. 172-181 [doi]
- Optimizing the datacenter for data-centric workloadsStijn Polfliet, Frederick Ryckbosch, Lieven Eeckhout. 182-191 [doi]
- Predictive coordination of multiple on-chip resources for chip multiprocessorsJian Chen, Lizy Kurian John. 192-201 [doi]
- An idiom-finding tool for increasing productivity of acceleratorsLaura Carrington, Mustafa M. Tikir, Catherine Olschanowsky, Michael Laurenzano, Joshua Peraza, Allan Snavely, Stephen Poole. 202-212 [doi]
- Performance modeling as the key to extreme scale computingWilliam D. Gropp. 213 [doi]
- Mint: realizing CUDA performance in 3D stencil methods with annotated CDidem Unat, Xing Cai, Scott B. Baden. 214-224 [doi]
- MDR: performance model driven runtime for heterogeneous parallel platformsJacques A. Pienaar, Anand Raghunathan, Srimat T. Chakradhar. 225-234 [doi]
- Active pebbles: parallel programming for data-driven applicationsJeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine. 235-244 [doi]
- Automating GPU computing in MATLABChun-Yu Shei, Pushkar Ratnalikar, Arun Chauhan. 245-254 [doi]
- Using GPUs to compute large out-of-card FFTsLiang Gu, Jakob Siegel, Xiaoming Li. 255-264 [doi]
- Automatic SIMD vectorization of fast fourier transforms for the larrabee and AVX instruction setsDaniel McFarlin, Volodymyr Arbatov, Franz Franchetti, Markus Püschel. 265-274 [doi]
- Cost-effectively offering private buffers in SoCs and CMPsZhen Fang, Li Zhao, Ravishankar R. Iyer, Carlos Flores Fajardo, German Fabila Garcia, Seung Eun Lee, Bin Li, Steve R. King, Xiaowei Jiang, Srihari Makineni. 275-284 [doi]
- A composite and scalable cache coherence protocol for large scale CMPsYi Xu, Yu Du, Youtao Zhang, Jun Yang 0002. 285-294 [doi]
- Controlling cache utilization of HPC applicationsSwann Perarnau, Marc Tchiboukdjian, Guillaume Huard. 295-304 [doi]
- Cosmic microwave background map-making at the petascale and beyondRajesh Sudarsan, Julian Borrill, Christopher Cantalupo, Theodore Kisner, Kamesh Madduri, Leonid Oliker, Yili Zheng, Horst Simon. 305-316 [doi]
- A QHD-capable parallel H.264 decoderChi Ching Chi, Ben H. H. Juurlink. 317-326 [doi]
- MP-PIPE: a massively parallel protein-protein interaction prediction engineAndrew Schoenrock, Frank K. H. A. Dehne, James R. Green, Ashkan Golshani, Sylvain Pitre. 327-337 [doi]
- The elephant and the mice: the role of non-strict fine-grain synchronization for modern many-core architecturesJuergen Ributzka, Yuhei Hayashi, Joseph B. Manzano, Guang R. Gao. 338-347 [doi]
- F:::2:::BFLY: an on-chip free-space optical network with wavelength-switchingJin Ouyang, Chuan Yang, Dimin Niu, Yuan Xie, Zhiwen Liu. 348-358 [doi]
- Karma: scalable deterministic record-replayArkaprava Basu, Jayaram Bobba, Mark D. Hill. 359-368 [doi]
- SRC: information retrieval as a persistent parallel service on supercomputer infrastructureTobias Berka, Marián Vajtersic. 369 [doi]
- SRC: Damaris - using dedicated i/o cores for scalable post-petascale HPC simulationsMatthieu Dorier. 370 [doi]
- SRC: FenixOS - a research operating system focused on high scalability and reliabilityStavros Passas, Sven Karlsson. 371 [doi]
- SRC: soft error detection and recovery for high performance linpackTeresa Davies, Zizhong Chen. 372 [doi]
- Poster: DVFS management in real-processorsVasileios Spiliopoulos, Georgios Keramidas, Stefanos Kaxiras, Konstantinos Efstathiou. 373 [doi]
- SRC: enabling petascale data analysis for scientific applications through data reorganizationYuan Tian. 375 [doi]
- SRC: virtual i/o caching: dynamic storage cache management for concurrent workloadsMichael R. Frasca, Ramya Prabhakar. 376 [doi]
- SRC: an automatic code overlaying technique for multicores with explicitly-managed memory hierarchiesChoonki Jang. 377 [doi]
- Poster: programming clusters of GPUs with OMPSsJavier Bueno, Alejandro Duran, Xavier Martorell, Eduard Ayguadé, Rosa M. Badia, Jesús Labarta. 378 [doi]
- Poster: revisiting virtual channel memory for performance and fairness on multi-core architectureLicheng Chen, Yongbing Huang, Yungang Bao, Onur Mutlu, Guangming Tan, Mingyu Chen. 379 [doi]
- Poster: implications of merging phases on scalability of multi-core architecturesMadhavan Manivannan, Ben H. H. Juurlink, Per Stenström. 380 [doi]
- SRC: facilitating efficient parallelization of information storage and retrieval on large data setsSteven Feldman. 381 [doi]
- SRC: automatic extraction of SST/macro skeleton modelsAmruth Rudraiah Dakshinamurthy. 382 [doi]