Improving GPU Throughput through Parallel Execution Using Tensor Cores and CUDA Cores

Khoa Ho, Hui Zhao 0013, Adwait Jog, Saraju P. Mohanty. Improving GPU Throughput through Parallel Execution Using Tensor Cores and CUDA Cores. In IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2022, Nicosia, Cyprus, July 4-6, 2022. pages 223-228, IEEE, 2022. [doi]

Abstract

Abstract is missing.