Abstract is missing.
- A peta-scalable CPU-GPU algorithm for global atmospheric simulationsChao Yang, Wei Xue, Haohuan Fu, Lin Gan, Linfeng Li, Yangtong Xu, Yutong Lu, Jiachang Sun, Guangwen Yang, Weimin Zheng. 1-12 [doi]
- Adoption protocols for fanout-optimal fault-tolerant termination detectionJonathan Lifflander, Phil Miller, Laxmikant V. Kalé. 13-22 [doi]
- Array dataflow analysis for polyhedral X10 programsTomofumi Yuki, Paul Feautrier, Sanjay V. Rajopadhye, Vijay Saraswat. 23-34 [doi]
- Betweenness centrality: algorithms and implementationsDimitrios Prountzos, Keshav Pingali. 35-46 [doi]
- Compiler aided manual speculation for high performance concurrent data structuresLingxiang Xiang, Michael Lee Scott. 47-56 [doi]
- Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPUBo Wu, Zhijia Zhao, Eddy Zheng Zhang, Yunlian Jiang, Xipeng Shen. 57-68 [doi]
- Correct and efficient work-stealing for weak memory modelsNhat Minh Lê, Antoniu Pop, Albert Cohen, Francesco Zappa Nardelli. 69-80 [doi]
- Data-only flattening for nested data parallelismLars Bergstrom, Matthew Fluet, Mike Rainey, John H. Reppy, Stephen Rosen, Adam Shaw. 81-92 [doi]
- Distributed merge treesDmitriy Morozov, Gunther H. Weber. 93-102 [doi]
- Fast concurrent queues for x86 processorsAdam Morrison, Yehuda Afek. 103-112 [doi]
- FastLane: improving performance of software transactional memory for low thread countsJons-Tobias Wamhoff, Christof Fetzer, Pascal Felber, Etienne Rivière, Gilles Muller. 113-122 [doi]
- From relational verification to SIMD loop synthesisGilles Barthe, Juan Manuel Crespo, Sumit Gulwani, César Kunz, Mark Marron. 123-134 [doi]
- Ligra: a lightweight graph processing framework for shared memoryJulian Shun, Guy E. Blelloch. 135-146 [doi]
- Morph algorithms on GPUsRupesh Nasre, Martin Burtscher, Keshav Pingali. 147-156 [doi]
- NUMA-aware reader-writer locksIrina Calciu, David Dice, Yossi Lev, Victor Luchangco, Virendra J. Marathe, Nir Shavit. 157-166 [doi]
- Online-ABFT: an online algorithm based fault tolerance scheme for soft error detection in iterative methodsZizhong Chen. 167-176 [doi]
- Ownership passing: efficient distributed memory programming on multi-core systemsAndrew Friedley, Torsten Hoefler, Greg Bronevetsky, Andrew Lumsdaine, Ching-Chen Ma. 177-186 [doi]
- Parallel schedule synthesis for attribute grammarsLeo A. Meyerovich, Matthew E. Torok, Eric Atkinson, Rastislav Bodík. 187-196 [doi]
- Parallel suffix array and least common prefix for the GPUMrinal Deo, Sean Keely. 197-206 [doi]
- Scalable deterministic replay in a parallel full-system emulatorYufei Chen, Haibo Chen. 207-218 [doi]
- Scheduling parallel programs by work stealing with private dequesUmut A. Acar, Arthur Charguéraud, Mike Rainey. 219-228 [doi]
- StreamScan: fast scan algorithms for GPUs without global barrier synchronizationShengen Yan, Guoping Long, Yunquan Zhang. 229-238 [doi]
- The tasks with effects model for safe concurrencyStephen Heumann, Vikram S. Adve, Shengjie Wang. 239-250 [doi]
- TigerQuoll: parallel event-based JavaScriptDaniele Bonetta, Walter Binder, Cesare Pautasso. 251-260 [doi]
- Using hardware transactional memory to correct and simplify and readers-writer lock algorithmDave Dice, Yossi Lev, Yujie Liu, Victor Luchangco, Mark Moir. 261-270 [doi]
- ZOOMM: a parallel web browser engine for multicore mobile devicesCalin Cascaval, Seth Fowler, Pablo Montesinos-Ortego, Wayne Piekarski, Mehrdad Reshadi, Behnam Robatmili, Michael Weber 0002, Vrajesh Bhavsar. 271-280 [doi]
- Automatic problem size sensitive task partitioning on heterogeneous parallel systemsIvan Grasso, Klaus Kofler, Biagio Cosenza, Thomas Fahringer. 281-282 [doi]
- Data layout optimization for GPGPU architecturesJun Liu, Wei Ding, Ohyoung Jang, Mahmut T. Kandemir. 283-284 [doi]
- Decomposition techniques for optimal design-space exploration of streaming applicationsShobana Padmanabhan, Yixin Chen, Roger D. Chamberlain. 285-286 [doi]
- Exploring different automata representations for efficient regular expression matching on GPUsXiaodong Yu, Michela Becchi. 287-288 [doi]
- Expressing graph algorithms using generalized active messagesNick Edmonds, Jeremiah Willcock, Andrew Lumsdaine. 289-290 [doi]
- Multi-level parallel computing of reverse time migration for seismic imaging on blue Gene/QLigang Lu, Karen A. Magerlein. 291-292 [doi]
- Parallel programming with big operatorsChangHee Park, Guy L. Steele Jr., Jean-Baptiste Tristan. 293-294 [doi]
- Programming with hardware lock elisionYehuda Afek, Amir Levy, Adam Morrison. 295-296 [doi]
- RaceFree: an efficient multi-threading model for determinismKai Lu, Xu Zhou, Xiaoping Wang, Wenzhe Zhang, Gen Li 0002. 297-298 [doi]
- Reducing contention through priority updatesJulian Shun, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons. 299-300 [doi]
- Relational algorithms for multi-bulk-synchronous processorsGregory Frederick Diamos, Haicheng Wu, Jin Wang, Ashwin Lele, Sudhakar Yalamanchili. 301-302 [doi]
- Runtime elision of transactional barriers for captured memoryFernando Miguel Carvalho, João P. Cachopo. 303-304 [doi]
- Scalable data race detection for partitioned global address space programsChang-Seo Park, Koushik Sen, Costin Iancu. 305-306 [doi]
- Scalable statistics countersDave Dice, Yossi Lev, Mark Moir. 307-308 [doi]
- Swift/T: scalable data flow programming for many-task applicationsJustin M. Wozniak, Timothy G. Armstrong, Michael Wilde, Daniel S. Katz, Ewing L. Lusk, Ian T. Foster. 309-310 [doi]
- TeamWork: synchronizing threads globally to detect real deadlocks for multithreaded programsYan Cai, Ke Zhai, Shangru Wu, Wing Kwong Chan. 311-312 [doi]
- Towards an energy estimator for fault tolerance protocolsMohammed el Mehdi Diouri, Olivier Glück, Laurent Lefèvre, Franck Cappello. 313-314 [doi]
- Work-stealing with configurable scheduling strategiesMartin Wimmer 0003, Daniel Cederman, Jesper Larsson Träff, Philippas Tsigas. 315-316 [doi]
- WuKong: effective diagnosis of bugs at large system scalesBowen Zhou, Milind Kulkarni, Saurabh Bagchi. 317-318 [doi]