Extending checksum-based ABFT to tolerate soft errors online in iterative methods

Longxiang Chen, Dingwen Tao, Panruo Wu, Zizhong Chen. Extending checksum-based ABFT to tolerate soft errors online in iterative methods. In 20th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2014, Hsinchu, Taiwan, December 16-19, 2014. pages 344-351, IEEE Computer Society, 2014. [doi]

Abstract

Abstract is missing.