A software-defined tensor streaming multiprocessor for large-scale machine learning

Dennis Abts, Garrin Kimmell, Andrew C. Ling, John Kim, Matthew Boyd, Andrew Bitar, Sahil Parmar, Ibrahim Ahmed, Roberto DiCecco, David Han, John Thompson, Michael Bye, Jennifer Hwang, Jeremy Fowers, Peter Lillian, Ashwin Murthy, Elyas Mehtabuddin, Chetan Tekur, Thomas Sohmers, Kris Kang, Stephen Maresh, Jonathan Ross. A software-defined tensor streaming multiprocessor for large-scale machine learning. In Valentina Salapura, Mohamed Zahran 0001, Fred Chong, Lingjia Tang, editors, ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18 - 22, 2022. pages 567-580, ACM, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.