KAIOPS: A Platform Solution of End-to-End Multi-Modal AIOps for AI Training at Scale

Zeying Wang, Junhong Liu, Penghao Zhang, Xiaoyang Sun, Xu Wang, Tianyu Wo, Chunming Hu, Chengru Song, Jin Ouyang, Renyu Yang. KAIOPS: A Platform Solution of End-to-End Multi-Modal AIOps for AI Training at Scale. In 40th IEEE/ACM International Conference on Automated Software Engineering, ASE 2025, Seoul, Korea, Republic of, November 16-20, 2025. pages 3192-3203, IEEE, 2025. [doi]

Abstract

Abstract is missing.