Architecture-Aware Models of AI Engines for High-Performance Matrix Matrix Multiplication

Elliott D. Binder, Jeffrey Low, Tze Meng Low. Architecture-Aware Models of AI Engines for High-Performance Matrix Matrix Multiplication. In Proceedings of the 54th International Conference on Parallel Processing, ICPP 2025, San Diego, CA, USA, September 8-11, 2025. pages 531-540, ACM, 2025. [doi]

Abstract

Abstract is missing.