Capturing and Enhancing In Situ System Observability for Failure Detection

Peng Huang, Chuanxiong Guo, Jacob R. Lorch, Lidong Zhou, Yingnong Dang. Capturing and Enhancing In Situ System Observability for Failure Detection. In Andrea C. Arpaci-Dusseau, Geoff Voelker, editors, 13th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2018, Carlsbad, CA, USA, October 8-10, 2018. pages 1-16, USENIX Association, 2018. [doi]

@inproceedings{HuangGLZD18,
  title = {Capturing and Enhancing In Situ System Observability for Failure Detection},
  author = {Peng Huang and Chuanxiong Guo and Jacob R. Lorch and Lidong Zhou and Yingnong Dang},
  year = {2018},
  url = {https://www.usenix.org/conference/osdi18/presentation/huang},
  researchr = {https://researchr.org/publication/HuangGLZD18},
  cites = {0},
  citedby = {0},
  pages = {1-16},
  booktitle = {13th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2018, Carlsbad, CA, USA, October 8-10, 2018},
  editor = {Andrea C. Arpaci-Dusseau and Geoff Voelker},
  publisher = {USENIX Association},
}