A Survey on Failure Prediction in Large-scale Computing Systems

Fei Xia, Hu Song, Long-Chuan Yan, Yan Li, Li-jun Wang. A Survey on Failure Prediction in Large-scale Computing Systems. In 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys), Haikou, Hainan, China, December 20-22, 2021. pages 2028-2033, IEEE, 2021. [doi]

Abstract

Abstract is missing.