AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs

Chendi Li, Haipeng Jia, Hang Cao, Jianyu Yao, Boqian Shi, Chunyang Xiang, Jinbo Sun, Pengqi Lu, Yunquan Zhang. AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs. In 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, NY, USA, September 30 - Oct. 3, 2021. pages 159-166, IEEE, 2021. [doi]

Authors

Chendi Li

This author has not been identified. Look up 'Chendi Li' in Google

Haipeng Jia

This author has not been identified. Look up 'Haipeng Jia' in Google

Hang Cao

This author has not been identified. Look up 'Hang Cao' in Google

Jianyu Yao

This author has not been identified. Look up 'Jianyu Yao' in Google

Boqian Shi

This author has not been identified. Look up 'Boqian Shi' in Google

Chunyang Xiang

This author has not been identified. Look up 'Chunyang Xiang' in Google

Jinbo Sun

This author has not been identified. Look up 'Jinbo Sun' in Google

Pengqi Lu

This author has not been identified. Look up 'Pengqi Lu' in Google

Yunquan Zhang

This author has not been identified. Look up 'Yunquan Zhang' in Google