Fault Localization for Microservice Applications with System Logs and Monitoring Metrics

Qixun Zhang,Tong Jia,Zhonghai Wu, Qingxin Wu, Lichun Jia, Donglei Li, Yuqing Tao, Yutong Xiao

2022 7th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA)(2022)

引用 1|浏览0
暂无评分
摘要
Microservices have been widely used in enterprises due to their excellent scalability and timely update capabilities. However, while the fine-grained modularity and service orientation decrease the complexity of system development, the complexity of system operation and maintenance has been greatly increased because system faults are becoming very frequent and complex. Therefore, fault localization, that is, diagnosing fault service and its root cause, is very important yet challenging for microservice applications. One of the most challenges of fault localization is fusing multiple data sources because system faults will exhibit different features in multiple data sources. Therefore, it is necessary to fuse multiple data sources and build a unified model for fault localization. In this paper, we propose a fault localization approach with the fusion of system logs and monitoring metrics. Our approach first discovers service dependencies and then utilizes system logs and monitoring metrics to detect anomalies for each microservice. Finally, it locates the fault service and recommends the root cause system metrics based on the service dependencies and detected anomalies. The experiment results show that the average precision of our approach is ∼75%.
更多
查看译文
关键词
microservice,fault localization,logs,metrics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要