Abstract is missing.
- Improving GPGPU energy-efficiency through concurrent kernel execution and DVFSQing Jiao, Mian Lu, Huynh Phung Huynh, Tulika Mitra. 1-11 [doi]
- Characterizing and enhancing global memory data coalescing on GPUsNaznin Fauzia, Louis-Noël Pouchet, P. Sadayappan. 12-22 [doi]
- Automatic data placement into GPU on-chip memory resourcesChao Li, Yi Yang, Zhen Lin, Huiyang Zhou. 23-33 [doi]
- A parallel abstract interpreter for JavaScriptKyle Dewey, Vineeth Kashyap, Ben Hardekopf. 34-45 [doi]
- MemorySanitizer: fast detector of uninitialized memory use in C++Evgeniy Stepanov, Konstantin Serebryany. 46-55 [doi]
- On performance debugging of unnecessary lock contentions on multicore processors: a replay-based approachLong Zheng, Xiaofei Liao, Bingsheng He, Song Wu, Hai Jin. 56-67 [doi]
- Optimizing binary translation of dynamically generated codeByron Hawkins, Brian Demsky, Derek Bruening, Qin Zhao. 68-78 [doi]
- Getting in control of your control flow with control-data isolationWilliam Arthur, Ben Mehne, Reetuparna Das, Todd M. Austin. 79-90 [doi]
- Reactive tilingJithendra Srinivas, Wei Ding, Mahmut T. Kandemir. 91-102 [doi]
- Branch prediction and the performance of interpreters: don't trust folkloreErven Rohou, Bharath Narasimha Swamy, André Seznec. 103-114 [doi]
- Optimizing the flash-RAM energy trade-off in deeply embedded systemsJames Pallister, Kerstin Eder, Simon J. Hollis. 115-124 [doi]
- EMEURO: a framework for generating multi-purpose accelerators via deep learningLawrence C. McAfee, Kunle Olukotun. 125-135 [doi]
- Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon PhiWai Teng Tang, Ruizhe Zhao, Mian Lu, Yun Liang, Huynh Phung Huyng, Xibai Li, Rick Siow Mong Goh. 136-145 [doi]
- Data provenance tracking for concurrent programsBrandon Lucia, Luis Ceze. 146-156 [doi]
- Locality aware concurrent start for stencil applicationsSunil Shrestha, Guang R. Gao, Joseph Manzano, Andrès Márquez, John Feo. 157-166 [doi]
- Checking correctness of code generator architecture specificationsNiranjan Hasabnis, Rui Qiao, R. Sekar. 167-178 [doi]
- Snapshot-based loading-time acceleration for web applicationsJinseok Oh, Soo-Mook Moon. 179-189 [doi]
- PSLP: padded SLP automatic vectorizationVasileios Porpodas, Alberto Magni, Timothy M. Jones. 190-201 [doi]
- A graph-based higher-order intermediate representationRoland Leißa, Marcel Köster, Sebastian Hack. 202-212 [doi]
- Scalable conditional induction variables (CIV) analysisCosmin E. Oancea, Lawrence Rauchwerger. 213-224 [doi]
- Approximating flow-sensitive pointer analysis using frequent itemset miningVaivaswatha Nagaraj, R. Govindarajan. 225-234 [doi]
- HELIX-UP: relaxing program semantics to unleash parallelizationSimone Campanoni, Glenn H. Holloway, Gu-Yeon Wei, David M. Brooks. 235-245 [doi]
- HERMES: a fast cross-ISA binary translator with post-optimizationXiaochun Zhang, Qi Guo, Yunji Chen, Tianshi Chen, Weiwu Hu. 246-256 [doi]
- Locality-centric thread scheduling for bulk-synchronous programming models on CPU architecturesHee-Seok Kim, Izzat El Hajj, John A. Stratton, Steven S. Lumetta, Wen-mei W. Hwu. 257-268 [doi]