Towards Using Partitioned GPU Virtual Functions for Mixture of Experts

Vignesh Chander, Tony Yi, Jerry Jiang, Vamsi Alla. Towards Using Partitioned GPU Virtual Functions for Mixture of Experts. In Silvina Caíno-Lores, Demetris Zeinalipour, Thaleia Dimitra Doudali, David E. Singh, Gracia Ester Martín Garzón, Leonel Sousa, Diego Andrade, Tommaso Cucinotta, Donato D'Ambrosio, Patrick Diehl, Manuel F. Dolz, Admela Jukan, Raffaele Montella, Matteo Nardelli 0001, Marta Garcia-Gasulla, Sarah Neuwirth, editors, Euro-Par 2024: Parallel Processing Workshops - Euro-Par 2024 International Workshops, Madrid, Spain, August 26-30, 2024, Proceedings, Part I. Volume 15385 of Lecture Notes in Computer Science, pages 163-172, Springer, 2024. [doi]

Abstract

Abstract is missing.