A 'cool' way of improving the reliability of HPC machines

Osman Sarood, Esteban Meneses, Laxmikant V. Kalé. A 'cool' way of improving the reliability of HPC machines. In William Gropp, Satoshi Matsuoka, editors, International Conference for High Performance Computing, Networking, Storage and Analysis, SC'13, Denver, CO, USA - November 17 - 21, 2013. pages 58, ACM, 2013. [doi]

Abstract

Abstract is missing.