Improving GPU Throughput through Parallel Execution Using Tensor Cores and CUDA Cores

Khoa Ho, Hui Zhao 0013, Adwait Jog, Saraju P. Mohanty. Improving GPU Throughput through Parallel Execution Using Tensor Cores and CUDA Cores. In IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2022, Nicosia, Cyprus, July 4-6, 2022. pages 223-228, IEEE, 2022. [doi]

Authors

Khoa Ho

This author has not been identified. Look up 'Khoa Ho' in Google

Hui Zhao 0013

This author has not been identified. Look up 'Hui Zhao 0013' in Google

Adwait Jog

This author has not been identified. Look up 'Adwait Jog' in Google

Saraju P. Mohanty

This author has not been identified. Look up 'Saraju P. Mohanty' in Google