Abstract is missing.
- PInTE: Probabilistic Induction of Theft EvictionsCesar Gomes, Xuesi Chen, Mark Hempstead. 1-13 [doi]
- GRANITE: A Graph Neural Network Model for Basic Block Throughput EstimationOndrej Sýkora, Phitchaya Mangpo Phothilimthana, Charith Mendis, Amir Yazdanbakhsh. 14-26 [doi]
- UVM Discard: Eliminating Redundant Memory Transfers for AcceleratorsWeixi Zhu, Guilherme Cox, Ján Veselý, Mark Hairgrove, Alan L. Cox, Scott Rixner. 27-38 [doi]
- FPChecker: Floating-Point Exception Detection Tool and Benchmark for Parallel and Distributed HPCIgnacio Laguna, Tanmay Tirpankar, Xinyi Li, Ganesh Gopalakrishnan. 39-50 [doi]
- Splash-4: A Modern Benchmark Suite with Lock-Free ConstructsEduardo José Gómez-Hernández, Juan M. Cebrian, Stefanos Kaxiras, Alberto Ros. 51-64 [doi]
- Characterizing Molecular Dynamics Simulation on Commodity PlatformsFrancesco Peverelli, Davide Conficconi, Davide Basilio Bartolini, Alberto Scolari, Marco Domenico Santambrogio. 65-78 [doi]
- An Evaluation of Edge TPU Accelerators for Convolutional Neural NetworksKiran Seshadri, Berkin Akin, James Laudon, Ravi Narayanaswami, Amir Yazdanbakhsh. 79-91 [doi]
- Accelerating Transformer Networks through Recomposing Softmax LayersJaewan Choi, Hailong Li, Byeongho Kim, Seunghwan Hwang, Jung Ho Ahn. 92-103 [doi]
- A Slice and Dice Approach to Accelerate Compound Sparse Attention on GPUHailong Li, Jaewan Choi, Jung Ho Ahn. 104-116 [doi]
- FedGPO: Heterogeneity-Aware Global Parameter optimization for Efficient Federated LearningYoung-geun Kim, Carole-Jean Wu. 117-129 [doi]
- Bottleneck Analysis of Dynamic Graph Neural Network Inference on CPU and GPUHanqiu Chen, Yahya Alhinai, Yihan Jiang, Eunjee Na, Cong Hao. 130-145 [doi]
- gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUsTaha Tekdogan, Serkan Göktas, Ayse Yilmazer-Metin. 146-159 [doi]
- Characterizing the Efficiency of Graph Neural Network Frameworks with a Magnifying GlassXin Huang, Jongryool Kim, Bradley Rees, Chul-Ho Lee. 160-170 [doi]
- Performance Characterization of AutoNUMA Memory Tiering on Graph AnalyticsDiego Moura, Daniel Mossé, Vinicius Petrucci. 171-184 [doi]
- Understanding the Power of Evolutionary Computation for GPU Code OptimizationJhe-Yu Liou, Muaaz Awan, Steven A. Hofmeyr, Stephanie Forrest, Carole-Jean Wu. 185-198 [doi]
- The Implications of Page Size Management on Graph AnalyticsAninda Manocha, Zi Yan, Esin Tureci, Juan L. Aragón, David W. Nellans, Margaret Martonosi. 199-214 [doi]
- Revisiting Temporal Storage I/O Behaviors of Smartphone Applications: Analysis and SynthesisQiang Zou, Bo Mao. 215-227 [doi]
- How Far We've Come - A Characterization Study of Standalone WebAssembly RuntimesWenwen Wang. 228-241 [doi]
- SpotLake: Diverse Spot Instance Dataset Archive ServiceSungjae Lee, Jaeil Hwang, Kyungyong Lee. 242-255 [doi]
- Leaps and bounds: Analyzing WebAssembly's performance with a focus on bounds checkingRaven Szewczyk, Kimberley Stonehouse, Antonio Barbalace, Tom Spink. 256-268 [doi]
- Demystifying Map Space Exploration for NPUsSheng-Chun Kao, Angshuman Parashar, Po-An Tsai, Tushar Krishna. 269-281 [doi]
- LongTail-Bench: A Benchmark Suite for Domain-Specific Operators in Deep LearningXiuhong Li, Shengen Yan, Lijuan Jiang, Ping Xu, Jinming Ma, Xingcheng Zhang, Dahua Lin. 282-295 [doi]
- Demystifying BERT: System Design ImplicationsSuchita Pati, Shaizeen Aga, Nuwan Jayasena, Matthew D. Sinclair. 296-309 [doi]