Abstract is missing.
- Welcome from the Program ChairsGuilherme Ottoni, Fernando Pereira, Michel Steuwer, Jingling Xue. [doi]
- A Tensor Algebra Compiler for Sparse DifferentiationAmir Shaikhha, Mathieu Huot, Shideh Hashemian. 1-12 [doi]
- Energy-Aware Tile Size Selection for Affine Programs on GPUsMalith Jayaweera, Martin Kong, Yanzhi Wang, David R. Kaeli. 13-27 [doi]
- PolyTOPS: Reconfigurable and Flexible Polyhedral SchedulerGianpietro Consolaro, Zhen Zhang, Harenome Razanajato, Nelson Lossing, Nassim Tchoulak, Adilla Susungi, Artur Cesar Araujo Alves, Renwei Zhang, Denis Barthou, Corinne Ancourt, Cédric Bastoul. 28-40 [doi]
- AskIt: Unified Programming Interface for Programming with Large Language ModelsKatsumi Okuda, Saman P. Amarasinghe. 41-54 [doi]
- Revealing Compiler Heuristics Through Automated Discovery and OptimizationVolker Seeker, Chris Cummins, Murray Cole, Björn Franke, Kim M. Hazelwood, Hugh Leather. 55-66 [doi]
- SLaDe: A Portable Small Language Model Decompiler for Optimized AssemblyJordi Armengol-Estapé, Jackson Woodruff, Chris Cummins, Michael F. P. O'Boyle. 67-80 [doi]
- TapeFlow: Streaming Gradient Tapes in Automatic DifferentiationMilad Hakimi, Arrvindh Shriraman. 81-92 [doi]
- A Framework for Fine-Grained Synchronization of Dependent GPU KernelsAbhinav Jangda, Saeed Maleki, Maryam Mehri Dehnavi, Madan Musuvathi, Olli Saarikivi. 93-105 [doi]
- Enhancing Performance Through Control-Flow Unmerging and Loop Unrolling on GPUsAlnis Murtovi, Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, Ignacio Laguna, Bernhard Steffen. 106-118 [doi]
- Retargeting and Respecializing GPU Workloads for Performance PortabilityIvan R. Ivanov, Oleksandr Zinenko, Jens Domke, Toshio Endo, William S. Moses. 119-132 [doi]
- Seer: Predictive Runtime Kernel Selection for Irregular ProblemsRyan Swann, Muhammad Osama, Karthik Sangaiah, Jalal Mahmud. 133-142 [doi]
- AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based AcceleratorsNicolas Bohm Agostini, Jude Haris, Perry Gibson, Malith Jayaweera, Norm Rubin, Antonino Tumeo, José L. Abellán, José Cano 0001, David R. Kaeli. 143-157 [doi]
- Ecmas: Efficient Circuit Mapping and Scheduling for Surface CodeMingzheng Zhu, Hao Fu, Jun Wu, Chi Zhang, Wei Xie, Xiang-Yang Li. 158-169 [doi]
- PresCount: Effective Register Allocation for Bank Conflict ReductionXiaofeng Guan, Hao Zhou, Guoqing Bao, Handong Li, Liang Zhu, Jianguo Yao. 170-181 [doi]
- Tackling the Matrix Multiplication Micro-Kernel Generation with ExoAdrián Castelló 0001, Julian Bellavita, Grace Dinh, Yuka Ikarashi, Héctor Martínez. 182-193 [doi]
- One Automaton to Rule Them All: Beyond Multiple Regular Expressions ExecutionLuisa Cicolini, Filippo Carloni, Marco D. Santambrogio, Davide Conficconi. 193-206 [doi]
- Whose Baseline Compiler is it Anyway?Ben L. Titzer. 207-220 [doi]
- Enabling Fine-Grained Incremental Builds by Making Compiler StatefulRuobing Han, Jisheng Zhao, Hyesoon Kim. 221-232 [doi]
- Compile-Time Analysis of Compiler Frameworks for Query CompilationAlexis Engelke, Tobias Schwarz. 233-244 [doi]
- DrPy: Pinpointing Inefficient Memory Usage in Multi-Layer Python ApplicationsJinku Cui, Qidong Zhao, Yueming Hao, Xu Liu 0001. 245-257 [doi]
- SCHEMATIC: Compile-Time Checkpoint Placement and Memory Allocation for Intermittent SystemsHugo Reymond, Jean-Luc Béchennec, Mikaël Briday, Sébastien Faucou, Isabelle Puaut, Erven Rohou. 258-269 [doi]
- Latent Idiom Recognition for a Minimalist Functional Array Language Using Equality SaturationJonathan Van der Cruysse, Christophe Dubach. 270-282 [doi]
- BEC: Bit-Level Static Analysis for Reliability against Soft ErrorsYousun Ko 0001, Bernd Burgstaller. 283-295 [doi]
- Boosting the Performance of Multi-Solver IFDS Algorithms with Flow-Sensitivity OptimizationsHaofeng Li, Jie Lu 0009, Haining Meng, Liqing Cao, Lian Li 0002, Lin Gao 0002. 296-307 [doi]
- Representing Data Collections in an SSA FormTommy McMichen, Nathan Greiner, Peter Zhong, Federico Sossai, Atmn Patel, Simone Campanoni. 308-321 [doi]
- Revamping Sampling-Based PGO with Context-Sensitivity and Pseudo-instrumentationWenlei He, Hongtao Yu, Lei Wang, Taewook Oh. 322-333 [doi]
- Compiler Testing with Relaxed Memory ModelsLuke Geeson, Lee Smith. 334-348 [doi]
- High-Throughput, Formal-Methods-Assisted Fuzzing for LLVMYuyou Fan, John Regehr. 349-358 [doi]
- EasyTracker: A Python Library for Controlling and Inspecting Program ExecutionThéo Barollet, Christophe Guillon, Manuel Selva, François Broquedis, Florent Bouchez-Tichadou, Fabrice Rastello. 359-372 [doi]
- OptiWISE: Combining Sampling and Instrumentation for Granular CPI AnalysisYuxin Guo, Alex W. Chadwick, Márton Erdos, Utpal Bora, Ilias Vougioukas, Giacomo Gabrielli, Timothy M. Jones 0001. 373-385 [doi]
- EasyView: Bringing Performance Profiles into Integrated Development EnvironmentsQidong Zhao, Milind Chabbi, Xu Liu 0001. 386-398 [doi]
- Experiences Building an MLIR-Based SYCL CompilerEttore Tiotto, Victor Perez 0001, Whitney Tsang, Lukas Sommer, Julian Oppermann, Victor Lomüller, Mehdi Goli 0001, James Brodman. 399-410 [doi]
- Unveiling and Vanquishing Goroutine Leaks in Enterprise Microservices: A Dynamic Analysis ApproachGeorgian-Vlad Saioc, Dmitriy Shirchenko, Milind Chabbi. 411-422 [doi]
- A System-Level Dynamic Binary Translator Using Automatically-Learned Translation RulesJinhu Jiang, Chaoyi Liang, Rongchao Dong, Zhaohui Yang, Zhongjun Zhou, Wenwen Wang 0001, Pen-Chung Yew, Weihua Zhang. 423-434 [doi]
- Instruction Scheduling for the GPU on the GPUGhassan Shobaki, Pinar Muyan-Özçelik, Josh Hutton, Bruce Linck, Vladislav Malyshenko, Austin Kerbow, Ronaldo Ramirez-Ortega, Vahl Scott Gordon. 435-447 [doi]
- JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix MultiplicationQiang Fu, Thomas B. Rolinger, H. Howie Huang. 448-459 [doi]
- oneDNN Graph Compiler: A Hybrid Approach for High-Performance Deep Learning CompilationJianhui Li, Zhennan Qin, Yijie Mei, Jingze Cui, Yunfei Song, Ciyong Chen, Yifei Zhang, Longsheng Du, Xianhang Cheng, Baihui Jin, Yan Zhang, Jason Ye, Eric Lin, Dan Lavery. 460-470 [doi]