AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs

Chendi Li, Haipeng Jia, Hang Cao, Jianyu Yao, Boqian Shi, Chunyang Xiang, Jinbo Sun, Pengqi Lu, Yunquan Zhang. AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs. In 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30 - Oct. 3, 2021. pages 159-166, IEEE, 2021. [doi]

@inproceedings{LiJCYSXSLZ21,
  title = {AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs},
  author = {Chendi Li and Haipeng Jia and Hang Cao and Jianyu Yao and Boqian Shi and Chunyang Xiang and Jinbo Sun and Pengqi Lu and Yunquan Zhang},
  year = {2021},
  doi = {10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00034},
  url = {https://doi.org/10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00034},
  researchr = {https://researchr.org/publication/LiJCYSXSLZ21},
  cites = {0},
  citedby = {0},
  pages = {159-166},
  booktitle = {2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30 - Oct. 3, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-3574-1},
}