Capturing and Enhancing In Situ System Observability for Failure Detection

Peng Huang, Chuanxiong Guo, Jacob R. Lorch, Lidong Zhou, Yingnong Dang. Capturing and Enhancing In Situ System Observability for Failure Detection. In Andrea C. Arpaci-Dusseau, Geoff Voelker, editors, 13th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2018, Carlsbad, CA, USA, October 8-10, 2018. pages 1-16, USENIX Association, 2018. [doi]

Abstract

Abstract is missing.