SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration

Jinming Zhuang, Zhuoping Yang, Shixin Ji, Heng Huang, Alex K. Jones, Jingtong Hu, Yiyu Shi 0001, Peipei Zhou 0001. SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration. In Zhiru Zhang, Andrew Putnam, editors, Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA 2024, Monterey, CA, USA, March 3-5, 2024. pages 55-66, ACM, 2024. [doi]

Authors

Jinming Zhuang

This author has not been identified. Look up 'Jinming Zhuang' in Google

Zhuoping Yang

This author has not been identified. Look up 'Zhuoping Yang' in Google

Shixin Ji

This author has not been identified. Look up 'Shixin Ji' in Google

Heng Huang

This author has not been identified. Look up 'Heng Huang' in Google

Alex K. Jones

This author has not been identified. Look up 'Alex K. Jones' in Google

Jingtong Hu

This author has not been identified. Look up 'Jingtong Hu' in Google

Yiyu Shi 0001

This author has not been identified. Look up 'Yiyu Shi 0001' in Google

Peipei Zhou 0001

This author has not been identified. Look up 'Peipei Zhou 0001' in Google