Abstract is missing.
- Programming the cloudJames R. Larus. 1-2 [doi]
- Ordered vs. unordered: a comparison of parallelism and work-efficiency in irregular algorithmsMuhammad Amber Hassaan, Martin Burtscher, Keshav Pingali. 3-12 [doi]
- Programming the memory hierarchy revisited: supporting irregular parallelism in sequoiaMichael Bauer, John Clark, Eric Schkufza, Alex Aiken. 13-24 [doi]
- Compact data structure and scalable algorithms for the sparse grid techniqueAlin Florindor Murarasu, Josef Weidendorfer, Gerrit Buse, Daniel Butnaru, Dirk Pflüger. 25-34 [doi]
- A domain-specific approach to heterogeneous parallelismHassan Chafi, Arvind K. Sujeeth, Kevin J. Brown, HyoukJoong Lee, Anand R. Atreya, Kunle Olukotun. 35-46 [doi]
- Copperhead: compiling an embedded data parallel languageBryan C. Catanzaro, Michael Garland, Kurt Keutzer. 47-56 [doi]
- OoOJava: software out-of-order executionJames Christopher Jenista, Yong Hun Eom, Brian Demsky. 57-68 [doi]
- SpiceC: scalable parallelism via implicit copying and explicit commitMin Feng, Rajiv Gupta, Yi Hu. 69-80 [doi]
- Inferring ownership transfer for efficient message passingStas Negara, Rajesh K. Karmani, Gul A. Agha. 81-90 [doi]
- All-window profiling and composable models of cache sharingXiaoya Xiang, Bin Bao, Tongxin Bai, Chen Ding, Trishul M. Chilimbi. 91-102 [doi]
- ULCC: a user-level facility for optimizing shared cache performance on multicoresXiaoning Ding, Kaibo Wang, Xiaodong Zhang. 103-112 [doi]
- ScalaExtrap: trace-based communication extrapolation for spmd programsXing Wu, Frank Mueller. 113-122 [doi]
- How s the parallel computing revolution going?Kathryn S. McKinley. 123-124 [doi]
- Thread contracts for safe parallelismRajesh K. Karmani, P. Madhusudan, Brandon M. Moore. 125-134 [doi]
- GRace: a low-overhead mechanism for detecting data races in GPU programsMai Zheng, Vignesh T. Ravi, Feng Qin, Gagan Agrawal. 135-146 [doi]
- Cooperative reasoning for preemptive executionJaeheon Yi, Caitlin Sadowski, Cormac Flanagan. 147-156 [doi]
- Communicating memory transactionsMohsen Lesani, Jens Palsberg. 157-168 [doi]
- Transaction communicators: enabling cooperation among concurrent transactionsVictor Luchangco, Virendra J. Marathe. 169-178 [doi]
- Lock-free and scalable multi-version software transactional memorySergio Miguel Fernandes, João P. Cachopo. 179-188 [doi]
- Enhanced speculative parallelization via incremental recoveryChen Tian, Changhui Lin, Min Feng, Rajiv Gupta. 189-200 [doi]
- Lifeline-based global load balancingVijay A. Saraswat, Prabhanjan Kambadur, Sreedhar B. Kodali, David Grove, Sriram Krishnamoorthy. 201-212 [doi]
- COREMU: a scalable and portable parallel full-system emulatorZhaoguo Wang, Ran Liu, Yufei Chen, Xi Wu, Haibo Chen, Weihua Zhang, Binyu Zang. 213-222 [doi]
- Wait-free queues with multiple enqueuers and dequeuersAlex Kogan, Erez Petrank. 223-234 [doi]
- The STAPL parallel container frameworkGabriel Tanase, Antal A. Buss, Adam Fidel, Harshvardhan, Ioannis Papadopoulos, Olga Pearce, Timmie G. Smith, Nathan Thomas, Xiabing Xu, Nedal Mourad, Jeremy Vu, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger. 235-246 [doi]
- CSX: an extended compression format for spmv on shared memory systemsKornilios Kourtis, Vasileios Karakasis, Georgios I. Goumas, Nectarios Koziris. 247-256 [doi]
- Auto-tuning of fast fourier transform on graphics processorsYuri Dotsenko, Sara S. Baghsorkhi, Brandon Lloyd, Naga K. Govindaraju. 257-266 [doi]
- Accelerating CUDA graph algorithms at maximum warpSungpack Hong, Sang-Kyun Kim, Tayo Oguntebi, Kunle Olukotun. 267-276 [doi]
- Achieving a single compute device image in OpenCL for multiple GPUsJungwon Kim, Honggyu Kim, Joo Hwan Lee, Jaejin Lee. 277-288 [doi]
- QoS aware storage cache management in multi-server environmentsRamya Prabhakar, Shekhar Srikantaiah, Rajat Garg, Mahmut T. Kandemir. 289-290 [doi]
- Weak atomicity under the x86 memory consistency modelAmitabha Roy 0002, Steven Hand, Timothy L. Harris. 291-292 [doi]
- Kremlin: like gprof, but for parallelizationDonghwan Jeon, Saturnino Garcia, Christopher M. Louie, Sravanthi Kota Venkata, Michael Bedford Taylor. 293-294 [doi]
- Time skewing made simpleRobert Strzodka, Mohammed Shaheen, Dawid Pajak. 295-296 [doi]
- Evaluating graph coloring on GPUsAndre Vincent Pascal Grosset, Peihong Zhu, Shusen Liu, Suresh Venkatasubramanian, Mary W. Hall. 297-298 [doi]
- Two examples of parallel programming without concurrency constructs (PP-CC)Chen Ding. 299-300 [doi]
- A wait-free NCAS library for parallel applications with timing constraintsPhilippe Stellwag, Fabian Scheler, Jakob Krainz, Wolfgang Schröder-Preikschat. 301-302 [doi]
- Algorithm-based recovery for HPLTeresa Davies, Zizhong Chen, Christer Karlsson, Hui Liu. 303-304 [doi]
- Active pebbles: a programming model for highly parallel fine-grained data-driven computationsJeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine. 305-306 [doi]
- Symbolically modeling concurrent MCAPI executionsTopher Fischer, Eric Mercer, Neha Rungta. 307-308 [doi]
- Automatic formal verification of MPI-based parallel programsStephen F. Siegel, Timothy K. Zirkel. 309-310 [doi]
- SCRATCH: a tool for automatic analysis of dma racesAlastair F. Donaldson, Daniel Kroening, Philipp Rümmer. 311-312 [doi]
- Automatic safety proofs for asynchronous memory operationsMatko Botincan, Mike Dodds, Alastair F. Donaldson, Matthew J. Parkinson. 313-314 [doi]