Pythia: detection, localization, and diagnosis of performance problems

Communications Magazine, IEEE(2013)

引用 18|浏览12
暂无评分
摘要
Performance problem diagnosis is a critical part of network operations in ISPs. Service providers use a combination of approaches to troubleshoot performance of their networks, such as active monitoring infrastructure and data collection (SNMP, Netflow, router logs, table dumps, etc.) along with customer trouble tickets. Some of these approaches, however, do not scale to wide area inter-domain networks due to unavailability of such data; moreover, troubleshooting is either reactive (e.g., driven by customer complaints) or (typically) automated using static thresholds. In this article, we describe the design and implementation of a system for root cause analysis and localization of performance problems in ISP networks. Our approach works with legacy monitoring infrastructure (e.g., perfSONAR deployments) and does not need specialized active probing tools or network data. Our system provides a language for network operators to define performance problem signatures, and provides near-real-time performance diagnosis and localization. We describe our deployment of Pythia in perfSONAR monitors in production networks in Georgia, covering over 250 inter-domain paths.
更多
查看译文
关键词
monitoring,performance evaluation,real-time systems,telecommunication network routing,wide area networks,ISP networks,Netflow,Pythia,SNMP,active monitoring infrastructure,active probing tools,customer complaints,customer trouble tickets,data collection,legacy monitoring infrastructure,localization,near-real-time performance diagnosis,network data,network operations,network operators,network performance,perfSONAR deployments,performance problem diagnosis,performance problem signatures,performance problems,root cause analysis,router logs,service providers,static thresholds,table dumps,wide area inter-domain networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要