The following publications are possibly variants of this publication:
- MALMM: A multi-array architecture for large-scale matrix multiplication on FPGAYou Huang, Junzhong Shen, Yuran Qiao, Mei Wen, Chunyuan Zhang. ieiceee, 15(10):20180286, 2018. [doi]
- Domain-specific library generation for parallel software and hardware platformsFranz Franchetti, Yevgen Voronenko, Peter A. Milder, Srinivas Chellappa, Marek R. Telgarsky, Hao Shen, Paolo D Alberto, Frédéric de Mesmay, James C. Hoe, José M. F. Moura, Markus Püschel. ipps 2008: 1-5 [doi]
- FPGA-Based Multi-precision Architecture for Accelerating Large-Scale Floating-Point Matrix ComputingLonglong Zhang, Yuanxi Peng, Xiao Hu, Ahui Huang, Tian Tian. npc 2021: 191-202 [doi]