Hengliang Guo, Haolei Wang, Wanting Chen, Congxiang Zhang, Yubo Han, Shengguang Zhu, Dujuan Zhang, Yang Guo, Jiandong Shang, Tao Wan, Qingyang Li, Gang Wu. Optimizing sparse general matrix-matrix multiplication for DCUs. The Journal of Supercomputing, 80(14):20176-20200, September 2024. [doi]