Jan Calta, Miroslaw Malek. Formal Analysis of Fault Recovery in Self-Organizing Systems. In Proceedings of the 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing. DASC '09, pages 11-18, IEEE Computer Society, Washington, DC, USA, 2009. [doi]
The members of a self-organizing distributed system have ability to automatically organize themselves into a specific structure. The functionality of the system is achieved by collaboration of the members in this structure. Through automatic (re)organization, such a system is able to recover from various temporary faults which may disturb the established structure. In this paper, we propose a technique to identify all recoverable faults as well as to analyze fault tolerance and recovery from temporary faults by reorganization in self-organizing systems.