Abstract is missing.
- Message from the General ChairJae W. Lee. [doi]
- Message from the Program ChairsMary Lou Soffa, Ayal Zaks. [doi]
- Report from the Artifact Evaluation CommitteeJubi Taneja, Michel Steuwer. [doi]
- Data Layout and Data Representation Optimizations to Reduce Data Movement KeynoteMary Hall. 1 [doi]
- MLIR: Scaling Compiler Infrastructure for Domain Specific ComputationChris Lattner, Mehdi Amini, Uday Bondhugula, Albert Cohen 0001, Andy Davis, Jacques A. Pienaar, River Riddle, Tatiana Shpeisman, Nicolas Vasilache, Oleksandr Zinenko. 2-14 [doi]
- Progressive Raising in Multi-level IRLorenzo Chelini, Andi Drebes, Oleksandr Zinenko, Albert Cohen 0001, Nicolas Vasilache, Tobias Grosser, Henk Corporaal. 15-26 [doi]
- Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUsThomas Koehler, Michel Steuwer. 27-38 [doi]
- BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++Ajay Brahmakshatriya, Saman P. Amarasinghe. 39-51 [doi]
- An Interval Compiler for Sound Floating-Point ComputationsJoao Rivera, Franz Franchetti, Markus Püschel. 52-64 [doi]
- Seamless Compiler Integration of Variable Precision Floating-Point ArithmeticTiago Trevisan Jost, Yves Durand, Christian Fabre, Albert Cohen 0001, Frédéric Pétrot. 65-76 [doi]
- UNIT: Unifying Tensorized Instruction CompilationJian Weng 0002, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki. 77-89 [doi]
- Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUsGuangli Li, Jingling Xue, Lei Liu, Xueying Wang, Xiu Ma, Xiao Dong, Jiansong Li, Xiaobing Feng 0002. 90-102 [doi]
- Cinnamon: A Domain-Specific Language for Binary Profiling and MonitoringMahwish Arif, Ruoyu Zhou, Hsi-Ming Ho, Timothy M. Jones 0001. 103-114 [doi]
- GPA: A GPU Performance Advisor Based on Instruction SamplingKeren Zhou, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey. 115-125 [doi]
- ELFies: Executable Region Checkpoints for Performance Analysis and SimulationHarish Patil, Alexander Isaev, Wim Heirman, Alen Sabu, Ali Hajiabadi, Trevor E. Carlson. 126-136 [doi]
- Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics InstrumentationDavid Pankratz, Tyler Nowicki, Ahmed ElTantawy, José Nelson Amaral. 137-149 [doi]
- Loop Parallelization using Dynamic Commutativity AnalysisChristos Vasiladiotis, Roberto Castañeda Lozano, Murray Cole, Björn Franke. 150-161 [doi]
- Fine-Grained Pipeline Parallelization for Network Function ProgramsSeungbin Song, Heelim Choi, Hanjun Kim 0001. 162-173 [doi]
- YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern ArchitecturesChristie L. Alappat, Johannes Seiferth, Georg Hager, Matthias Korch, Thomas Rauber, Gerhard Wellein. 174-186 [doi]
- GoBench: A Benchmark Suite of Real-World Go Concurrency BugsTing Yuan, Guangwei Li, Jie Lu, Chen Liu, Lian Li, Jingling Xue. 187-199 [doi]
- Memory-Safe Elimination of Side ChannelsLuigi Soares, Fernando Magno Quintão Pereira. 200-210 [doi]
- Variable-Sized Blocks for Locality-Aware SpMVNaveen Namashivavam, Sanyam Mehta, Pen-Chung Yew. 211-221 [doi]
- Object Versioning for Flow-Sensitive Pointer AnalysisMohamad Barbar, Yulei Sui, Shiping Chen 0001. 222-235 [doi]
- Scaling Up the IFDS Algorithm with Efficient Disk-Assisted ComputingHaofeng Li, Haining Meng, Hengjie Zheng, Liqing Cao, Jie Lu, Lian Li 0002, Lin Gao 0002. 236-247 [doi]
- Compiling Graph Applications for GPU s with GraphItAjay Brahmakshatriya, Yunming Zhang, Changwan Hong, Shoaib Kamil, Julian Shun, Saman P. Amarasinghe. 248-261 [doi]
- Efficient Execution of Graph Algorithms on CPU with SIMD ExtensionsRuohuang Zheng, Sreepathi Pai. 262-276 [doi]
- r3d3: Optimized Query Compilation on GPUsAlexander Krolik, Clark Verbrugge, Laurie J. Hendren. 277-288 [doi]
- C-for-Metal: High Performance Simd Programming on Intel GPUsGuei-Yuan Lueh, Kaiyu Chen, Gang Chen, Joel Fuentes, Wei-Yu Chen, Fangwen Fu, Hong Jiang, Hongzheng Li, Daniel Rhee. 289-300 [doi]
- Relaxed Peephole Optimization: A Novel Compiler Optimization for Quantum CircuitsJi Liu, Luciano Bello, Huiyang Zhou. 301-314 [doi]
- StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing SystemsJohannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler. 315-326 [doi]
- Thread-Aware Area-Efficient High-Level Synthesis Compiler for Embedded DevicesChangsu Kim, Shinnung Jeong, Sungjun Cho, Yongwoo Lee, William Song, Youngsok Kim, Hanjun Kim 0001. 327-339 [doi]
- HHVM Jump-Start: Boosting Both Warmup and Steady-State Performance at ScaleGuilherme Ottoni, Bin Liu. 340-350 [doi]
- Enhancing Atomic Instruction Emulation for Cross-ISA Dynamic Binary TranslationZiyi Zhao, Zhang Jiang, Ying Chen, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew. 351-362 [doi]
- An Experience with Code-Size Optimization for Production iOS Mobile ApplicationsMilind Chabbi, Jin Lin, Raj Barik. 363-377 [doi]
- ANGHABENCH: A Suite with One Million Compilable C Benchmarks for Code-Size ReductionAnderson Faustino da Silva, Bruno Conde Kind, José Wesley de Souza Magalhães, Jerônimo Nunes Rocha, Breno Campos Ferreira Guimarães, Fernando Magno Quintão Pereira. 378-390 [doi]