DAFTA: Distributed Architecture for Fusion-Transformer training Acceleration

Shailesh Shankar Deshpande, Shruti Kunde, Ravi Singh, Chaman Banolia, Rekha Singhal, Balamurlidhar P.. DAFTA: Distributed Architecture for Fusion-Transformer training Acceleration. In Sven Groppe, Le Gruenwald, Ching-Hsien Hsu, editors, Proceedings of the International Workshop on Big Data in Emergent Distributed Environments, BiDEDE 2023, Seattle, WA, USA, 18 June 2023. ACM, 2023. [doi]

Abstract

Abstract is missing.