Identifying the Right Replication Level to Detect and Correct Silent Errors at Scale

Anne Benoit, Aurélien Cavelan, Franck Cappello, Padma Raghavan, Yves Robert, Hongynag Sun. Identifying the Right Replication Level to Detect and Correct Silent Errors at Scale. In Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale, FTXS@HPDC 2017, Washington, DC, USA, June, 2017. pages 31-38, ACM, 2017. [doi]

Authors

Anne Benoit

This author has not been identified. Look up 'Anne Benoit' in Google

Aurélien Cavelan

This author has not been identified. Look up 'Aurélien Cavelan' in Google

Franck Cappello

This author has not been identified. Look up 'Franck Cappello' in Google

Padma Raghavan

This author has not been identified. Look up 'Padma Raghavan' in Google

Yves Robert

This author has not been identified. Look up 'Yves Robert' in Google

Hongynag Sun

This author has not been identified. Look up 'Hongynag Sun' in Google