tcFFT: A Fast Half-Precision FFT Library for NVIDIA Tensor Cores

Bin-Rui Li, Shenggan Cheng, James Lin. tcFFT: A Fast Half-Precision FFT Library for NVIDIA Tensor Cores. In IEEE International Conference on Cluster Computing, CLUSTER 2021, Portland, OR, USA, September 7-10, 2021. pages 1-11, IEEE, 2021. [doi]

Abstract

Abstract is missing.