AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs

Yaosheng Fu, Evgeny Bolotin, Aamer Jaleel, Gal Dalal, Shie Mannor, Jacob Subag, Noam Korem, Michael Behar, David W. Nellans. AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs. In Dawn Song, Michael Carbin, Tianqi Chen 0001, editors, Proceedings of the Sixth Conference on Machine Learning and Systems, MLSys 2023, Miami, FL, USA, June 4-8, 2023. mlsys.org, 2023. [doi]

Abstract

Abstract is missing.