Understanding Soft Error Sensitivity of Deep Learning Models and Frameworks through Checkpoint Alteration

Elvis Rojas, Diego PĂ©rez, Jon C. Calhoun, Leonardo Bautista-Gomez, Terry Jones, Esteban Meneses. Understanding Soft Error Sensitivity of Deep Learning Models and Frameworks through Checkpoint Alteration. In IEEE International Conference on Cluster Computing, CLUSTER 2021, Portland, OR, USA, September 7-10, 2021. pages 492-503, IEEE, 2021. [doi]

Abstract

Abstract is missing.