IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2021, Seoul, South Korea, February 27 - March 3, 2021 - researchr publication

researchr

You are not signed in
Sign in
Sign up

Jae W. Lee, Mary Lou Soffa, Ayal Zaks, editors, IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2021, Seoul, South Korea, February 27 - March 3, 2021. IEEE, 2021. [doi]

Conference: CGO2021

Abstract is missing.

Message from the General ChairJae W. Lee. [doi]

Message from the Program ChairsMary Lou Soffa, Ayal Zaks. [doi]

Report from the Artifact Evaluation CommitteeJubi Taneja, Michel Steuwer. [doi]

Data Layout and Data Representation Optimizations to Reduce Data Movement KeynoteMary Hall. 1 [doi]

MLIR: Scaling Compiler Infrastructure for Domain Specific ComputationChris Lattner, Mehdi Amini, Uday Bondhugula, Albert Cohen 0001, Andy Davis, Jacques A. Pienaar, River Riddle, Tatiana Shpeisman, Nicolas Vasilache, Oleksandr Zinenko. 2-14 [doi]

Progressive Raising in Multi-level IRLorenzo Chelini, Andi Drebes, Oleksandr Zinenko, Albert Cohen 0001, Nicolas Vasilache, Tobias Grosser, Henk Corporaal. 15-26 [doi]

Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUsThomas Koehler, Michel Steuwer. 27-38 [doi]

BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++Ajay Brahmakshatriya, Saman P. Amarasinghe. 39-51 [doi]

An Interval Compiler for Sound Floating-Point ComputationsJoao Rivera, Franz Franchetti, Markus Püschel. 52-64 [doi]

Seamless Compiler Integration of Variable Precision Floating-Point ArithmeticTiago Trevisan Jost, Yves Durand, Christian Fabre, Albert Cohen 0001, Frédéric Pétrot. 65-76 [doi]

UNIT: Unifying Tensorized Instruction CompilationJian Weng 0002, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki. 77-89 [doi]

Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUsGuangli Li, Jingling Xue, Lei Liu, Xueying Wang, Xiu Ma, Xiao Dong, Jiansong Li, Xiaobing Feng 0002. 90-102 [doi]

Cinnamon: A Domain-Specific Language for Binary Profiling and MonitoringMahwish Arif, Ruoyu Zhou, Hsi-Ming Ho, Timothy M. Jones 0001. 103-114 [doi]

GPA: A GPU Performance Advisor Based on Instruction SamplingKeren Zhou, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey. 115-125 [doi]

ELFies: Executable Region Checkpoints for Performance Analysis and SimulationHarish Patil, Alexander Isaev, Wim Heirman, Alen Sabu, Ali Hajiabadi, Trevor E. Carlson. 126-136 [doi]

Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics InstrumentationDavid Pankratz, Tyler Nowicki, Ahmed ElTantawy, José Nelson Amaral. 137-149 [doi]

Loop Parallelization using Dynamic Commutativity AnalysisChristos Vasiladiotis, Roberto Castañeda Lozano, Murray Cole, Björn Franke. 150-161 [doi]

Fine-Grained Pipeline Parallelization for Network Function ProgramsSeungbin Song, Heelim Choi, Hanjun Kim 0001. 162-173 [doi]

YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern ArchitecturesChristie L. Alappat, Johannes Seiferth, Georg Hager, Matthias Korch, Thomas Rauber, Gerhard Wellein. 174-186 [doi]

GoBench: A Benchmark Suite of Real-World Go Concurrency BugsTing Yuan, Guangwei Li, Jie Lu, Chen Liu, Lian Li, Jingling Xue. 187-199 [doi]

Memory-Safe Elimination of Side ChannelsLuigi Soares, Fernando Magno Quintão Pereira. 200-210 [doi]

Variable-Sized Blocks for Locality-Aware SpMVNaveen Namashivavam, Sanyam Mehta, Pen-Chung Yew. 211-221 [doi]

Object Versioning for Flow-Sensitive Pointer AnalysisMohamad Barbar, Yulei Sui, Shiping Chen 0001. 222-235 [doi]

Scaling Up the IFDS Algorithm with Efficient Disk-Assisted ComputingHaofeng Li, Haining Meng, Hengjie Zheng, Liqing Cao, Jie Lu, Lian Li 0002, Lin Gao 0002. 236-247 [doi]

Compiling Graph Applications for GPU s with GraphItAjay Brahmakshatriya, Yunming Zhang, Changwan Hong, Shoaib Kamil, Julian Shun, Saman P. Amarasinghe. 248-261 [doi]

Efficient Execution of Graph Algorithms on CPU with SIMD ExtensionsRuohuang Zheng, Sreepathi Pai. 262-276 [doi]

r3d3: Optimized Query Compilation on GPUsAlexander Krolik, Clark Verbrugge, Laurie J. Hendren. 277-288 [doi]

C-for-Metal: High Performance Simd Programming on Intel GPUsGuei-Yuan Lueh, Kaiyu Chen, Gang Chen, Joel Fuentes, Wei-Yu Chen, Fangwen Fu, Hong Jiang, Hongzheng Li, Daniel Rhee. 289-300 [doi]

Relaxed Peephole Optimization: A Novel Compiler Optimization for Quantum CircuitsJi Liu, Luciano Bello, Huiyang Zhou. 301-314 [doi]

StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing SystemsJohannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler. 315-326 [doi]

Thread-Aware Area-Efficient High-Level Synthesis Compiler for Embedded DevicesChangsu Kim, Shinnung Jeong, Sungjun Cho, Yongwoo Lee, William Song, Youngsok Kim, Hanjun Kim 0001. 327-339 [doi]

HHVM Jump-Start: Boosting Both Warmup and Steady-State Performance at ScaleGuilherme Ottoni, Bin Liu. 340-350 [doi]

Enhancing Atomic Instruction Emulation for Cross-ISA Dynamic Binary TranslationZiyi Zhao, Zhang Jiang, Ying Chen, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew. 351-362 [doi]

An Experience with Code-Size Optimization for Production iOS Mobile ApplicationsMilind Chabbi, Jin Lin, Raj Barik. 363-377 [doi]

ANGHABENCH: A Suite with One Million Compilable C Benchmarks for Code-Size ReductionAnderson Faustino da Silva, Bruno Conde Kind, José Wesley de Souza Magalhães, Jerônimo Nunes Rocha, Breno Campos Ferreira Guimarães, Fernando Magno Quintão Pereira. 378-390 [doi]

runs on WebDSL