Abstract is missing.
- Parallel programming and code selection in fortressGuy L. Steele Jr.. 1 [doi]
- Collective communication on architectures that support simultaneous communication over multiple linksErnie Chan, Robert A. van de Geijn, William Gropp, Rajeev Thakur. 2-11 [doi]
- Performance evaluation of adaptive MPIChao Huang, Gengbin Zheng, Laxmikant V. Kalé, Sameer Kumar. 12-21 [doi]
- Mobile MPI programs in computational gridsRohit Fernandes, Keshav Pingali, Paul Stodghill. 22-31 [doi]
- RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefitsSayantan Sur, Hyun-Wook Jin, Lei Chai, Dhabaleswar K. Panda. 32-39 [doi]
- Global-view abstractions for user-defined reductions and scansSteven J. Deitz, David Callahan, Bradford L. Chamberlain, Lawrence Snyder. 40-47 [doi]
- Programming for parallelism and locality with hierarchically tiled arraysGanesh Bikshandi, Jia Guo, Daniel Hoeflinger, Gheorghe Almási, Basilio B. Fraguela, María Jesús Garzarán, David A. Padua, Christoph von Praun. 48-57 [doi]
- Parallel programming in modern web search enginesRaymie Stata. 58 [doi]
- Performance characterization of molecular dynamics techniques for biomolecular simulationsSadaf R. Alam, Jeffrey S. Vetter, Pratul K. Agarwal, Al Geist. 59-68 [doi]
- On-line automated performance diagnosis on thousands of processesPhilip C. Roth, Barton P. Miller. 69-80 [doi]
- A case study in top-down performance estimation for a large-scale parallel applicationIlya Sharapov, Robert Kroeger, Guy Delamarter, Razvan Cheveresan, Matthew Ramsay. 81-89 [doi]
- Hardware profile-guided automatic page placement for ccNUMA systemsJaydeep Marathe, Frank Mueller. 90-99 [doi]
- Adaptive scheduling with parallelism feedbackKunal Agrawal, Yuxiong He, Wen-Jing Hsu, Charles E. Leiserson. 100-109 [doi]
- Predicting bounds on queuing delay for batch-scheduled parallel machinesJohn Brevik, Daniel Nurmi, Richard Wolski. 110-118 [doi]
- Optimizing irregular shared-memory applications for distributed-memory systemsAyon Basumallik, Rudolf Eigenmann. 119-128 [doi]
- Proving correctness of highly-concurrent linearisable objectsViktor Vafeiadis, Maurice Herlihy, Tony Hoare, Marc Shapiro. 129-136 [doi]
- Accurate and efficient runtime detection of atomicity errors in concurrent programsLiqiang Wang, Scott D. Stoller. 137-146 [doi]
- Scalable synchronous queuesWilliam N. Scherer III, Doug Lea, Michael L. Scott. 147-156 [doi]
- POSH: a TLS compiler that exploits program structureWei Liu, James Tuck, Luis Ceze, Wonsun Ahn, Karin Strauss, Jose Renau, Josep Torrellas. 158-167 [doi]
- High-performance IPv6 forwarding algorithm for multi-core and multithreaded network processorXianghui Hu, Xinan Tang, Bei Hua. 168-177 [doi]
- MAMA! : a memory allocator for multithreaded architecturesSimon Kahan, Petr Konecny. 178-186 [doi]
- McRT-STM: a high performance software transactional memory system for a multi-core runtimeBratin Saha, Ali-Reza Adl-Tabatabai, Richard L. Hudson, Chi Cao Minh, Ben Hertzberg. 187-197 [doi]
- Exploiting distributed version concurrency in a transactional memory clusterKaloian Manassiev, Madalin Mihailescu, Cristiana Amza. 198-208 [doi]
- Hybrid transactional memorySanjeev Kumar, Michael Chu, Christopher J. Hughes, Partha Kundu, Anthony D. Nguyen. 209-220 [doi]
- Fast and transparent recovery for continuous availability of cluster-based serversRosalia Christodoulopoulou, Kaloian Manassiev, Angelos Bilas, Cristiana Amza. 221-229 [doi]
- Minimizing execution time in MPI programs on an energy-constrained, power-scalable clusterRobert Springer, David K. Lowenthal, Barry Rountree, Vincent W. Freeh. 230-238 [doi]
- Teaching parallel computing to science faculty: best practices and common pitfallsDavid A. Joiner, Paul Gray, Thomas Murphy, Charles Peck. 239-246 [doi]