What does fault tolerant deep learning need from MPI?

Vinay Amatya, Abhinav Vishnu, Charles Siegel, Jeff Daily. What does fault tolerant deep learning need from MPI?. In Antonio J. Peña, Pavan Balaji, William Gropp, Rajeev Thakur, editors, Proceedings of the 24th European MPI Users' Group Meeting, EuroMPI/USA 2017, Chicago, IL, USA, September 25-28, 2017. ACM, 2017. [doi]

Authors

Vinay Amatya

This author has not been identified. Look up 'Vinay Amatya' in Google

Abhinav Vishnu

This author has not been identified. Look up 'Abhinav Vishnu' in Google

Charles Siegel

This author has not been identified. Look up 'Charles Siegel' in Google

Jeff Daily

This author has not been identified. Look up 'Jeff Daily' in Google