Beyond Availability: Towards a Deeper Understanding of Machine Failure Characteristics in Large Distributed Systems

Praveen Yalagandula, Suman Nath. Beyond Availability: Towards a Deeper Understanding of Machine Failure Characteristics in Large Distributed Systems. In David E. Culler, Timothy Roscoe, editors, First USENIX Workshop on Real, Large Distributed Systems, WORLDS'04, San Francisco, CA, USA, December 5, 2004. USENIX Association, 2004. [doi]

Abstract

Abstract is missing.