Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2015, San Francisco, CA, USA, February 07 - 11, 2015 - researchr publication

researchr

You are not signed in
Sign in
Sign up

Kunle Olukotun, Aaron Smith, Robert Hundt, Jason Mars, editors, Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2015, San Francisco, CA, USA, February 07 - 11, 2015. ACM, 2015. [doi]

Conference: CGO2015

Abstract is missing.

Improving GPGPU energy-efficiency through concurrent kernel execution and DVFSQing Jiao, Mian Lu, Huynh Phung Huynh, Tulika Mitra. 1-11 [doi]

Characterizing and enhancing global memory data coalescing on GPUsNaznin Fauzia, Louis-Noël Pouchet, P. Sadayappan. 12-22 [doi]

Automatic data placement into GPU on-chip memory resourcesChao Li, Yi Yang, Zhen Lin, Huiyang Zhou. 23-33 [doi]

A parallel abstract interpreter for JavaScriptKyle Dewey, Vineeth Kashyap, Ben Hardekopf. 34-45 [doi]

MemorySanitizer: fast detector of uninitialized memory use in C++Evgeniy Stepanov, Konstantin Serebryany. 46-55 [doi]

On performance debugging of unnecessary lock contentions on multicore processors: a replay-based approachLong Zheng, Xiaofei Liao, Bingsheng He, Song Wu, Hai Jin. 56-67 [doi]

Optimizing binary translation of dynamically generated codeByron Hawkins, Brian Demsky, Derek Bruening, Qin Zhao. 68-78 [doi]

Getting in control of your control flow with control-data isolationWilliam Arthur, Ben Mehne, Reetuparna Das, Todd M. Austin. 79-90 [doi]

Reactive tilingJithendra Srinivas, Wei Ding, Mahmut T. Kandemir. 91-102 [doi]

Branch prediction and the performance of interpreters: don't trust folkloreErven Rohou, Bharath Narasimha Swamy, André Seznec. 103-114 [doi]

Optimizing the flash-RAM energy trade-off in deeply embedded systemsJames Pallister, Kerstin Eder, Simon J. Hollis. 115-124 [doi]

EMEURO: a framework for generating multi-purpose accelerators via deep learningLawrence C. McAfee, Kunle Olukotun. 125-135 [doi]

Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon PhiWai Teng Tang, Ruizhe Zhao, Mian Lu, Yun Liang, Huynh Phung Huyng, Xibai Li, Rick Siow Mong Goh. 136-145 [doi]

Data provenance tracking for concurrent programsBrandon Lucia, Luis Ceze. 146-156 [doi]

Locality aware concurrent start for stencil applicationsSunil Shrestha, Guang R. Gao, Joseph Manzano, Andrès Márquez, John Feo. 157-166 [doi]

Checking correctness of code generator architecture specificationsNiranjan Hasabnis, Rui Qiao, R. Sekar. 167-178 [doi]

Snapshot-based loading-time acceleration for web applicationsJinseok Oh, Soo-Mook Moon. 179-189 [doi]

PSLP: padded SLP automatic vectorizationVasileios Porpodas, Alberto Magni, Timothy M. Jones. 190-201 [doi]

A graph-based higher-order intermediate representationRoland Leißa, Marcel Köster, Sebastian Hack. 202-212 [doi]

Scalable conditional induction variables (CIV) analysisCosmin E. Oancea, Lawrence Rauchwerger. 213-224 [doi]

Approximating flow-sensitive pointer analysis using frequent itemset miningVaivaswatha Nagaraj, R. Govindarajan. 225-234 [doi]

HELIX-UP: relaxing program semantics to unleash parallelizationSimone Campanoni, Glenn H. Holloway, Gu-Yeon Wei, David M. Brooks. 235-245 [doi]

HERMES: a fast cross-ISA binary translator with post-optimizationXiaochun Zhang, Qi Guo, Yunji Chen, Tianshi Chen, Weiwu Hu. 246-256 [doi]

Locality-centric thread scheduling for bulk-synchronous programming models on CPU architecturesHee-Seok Kim, Izzat El Hajj, John A. Stratton, Steven S. Lumetta, Wen-mei W. Hwu. 257-268 [doi]

runs on WebDSL