Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors

Wei Sun, Ang Li 0006, Tong Geng, Sander Stuijk, Henk Corporaal. Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors. IEEE Trans. Parallel Distrib. Syst., 34(1):246-261, 2023. [doi]

@article{SunLGSC23,
  title = {Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors},
  author = {Wei Sun and Ang Li 0006 and Tong Geng and Sander Stuijk and Henk Corporaal},
  year = {2023},
  doi = {10.1109/TPDS.2022.3217824},
  url = {https://doi.org/10.1109/TPDS.2022.3217824},
  researchr = {https://researchr.org/publication/SunLGSC23},
  cites = {0},
  citedby = {0},
  journal = {IEEE Trans. Parallel Distrib. Syst.},
  volume = {34},
  number = {1},
  pages = {246-261},
}