Peng Huang, Chuanxiong Guo, Jacob R. Lorch, Lidong Zhou, Yingnong Dang. Capturing and Enhancing In Situ System Observability for Failure Detection. In Andrea C. Arpaci-Dusseau, Geoff Voelker, editors, 13th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2018, Carlsbad, CA, USA, October 8-10, 2018. pages 1-16, USENIX Association, 2018. [doi]
Abstract is missing.