Adaptive algorithms for diagnosing large-scale failures in computer networks

IEEE Trans. Parallel Distrib. Syst.(2015)

引用 21|浏览28
暂无评分
摘要
In this paper, we propose an algorithm to efficiently diagnose large-scale clustered failures. The algorithm, Cluster-MAX-COVERAGE (CMC), is based on greedy approach. We address the challenge of determining faults with incomplete symptoms. CMC makes novel use of both positive and negative symptoms to output a hypothesis list with a low number of false negatives and false positives quickly. CMC requires reports from about half as many nodes as other existing algorithms to determine failures with 100% accuracy. Moreover, CMC accomplishes this gain significantly faster (sometimes by two orders of magnitude) than an algorithm that matches its accuracy. Furthermore, we propose an adaptive algorithm called Adaptive-MAX-COVERAGE (AMC) that performs efficiently during both kinds of failures, i.e., independent and clustered. During a series of failues that include both independent and clustered, AMC results in a reduced number of false negatives and false positives.
更多
查看译文
关键词
fault diagnosis,large-scale cluster failure diagnosis,pattern clustering,incomplete information,large-scale failures,negative symptoms,adaptive-max-coverage,greedy approach,positive symptoms,clustered failures,false negatives,greedy algorithm,amc,cmc algorithm,large-scale clustered failure diagnosis,computer networks,greedy algorithms,large-scale systems,computer network reliability,cluster-max-coverage,cmc,workstation clusters,cluster-max-coverage algorithm,amc algorithm,adaptive-max-coverage algorithm,false positives,adaptive algorithms,clustering algorithms,network topology,accuracy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要