Proactive Resource Management for Failure Resilient High Performance Computing Clusters

Song Fu, Cheng-Zhong Xu. Proactive Resource Management for Failure Resilient High Performance Computing Clusters. In Proceedings of the The Forth International Conference on Availability, Reliability and Security, ARES 2009, March 16-19, 2009, Fukuoka, Japan. pages 257-264, IEEE Computer Society, 2009. [doi]

Abstract

Abstract is missing.