Detecting Transient Bottlenecks in n-Tier Applications through Fine-Grained Analysis

Distributed Computing Systems(2013)

引用 67|浏览1
暂无评分
摘要
Identifying the location of performance bottlenecks is a non-trivial challenge when scaling n-tier applications in computing clouds. Specifically, we observed that an n-tier application may experience significant performance loss when there are transient bottlenecks in component servers. Such transient bottlenecks arise frequently at high resource utilization and often result from transient events (e.g., JVM garbage collection) in an n-tier system and bursty workloads. Because of their short lifespan (e.g., milliseconds), these transient bottlenecks are difficult to detect using current system monitoring tools with sampling at intervals of seconds or minutes. We describe a novel transient bottleneck detection method that correlates throughput (i.e., request service rate) and load (i.e., number of concurrent requests) of each server in an n-tier system at fine time granularity. Both throughput and load can be measured through passive network tracing at millisecond-level time granularity. Using correlation analysis, we can identify the transient bottlenecks at time granularities as short as 50ms. We validate our method experimentally through two case studies on transient bottlenecks caused by factors at the system software layer (e.g., JVM garbage collection) and architecture layer (e.g., Intel SpeedStep).
更多
查看译文
关键词
correlation analysis,component servers,system software layer,fine time granularity,resource utilization,passive network tracing,detecting transient bottlenecks,n-tier application,jvm garbage collection,performance evaluations,current system monitoring tool,resource allocation,n-tier applications,scalability,transient bottleneck detection,transient events,fine-grained analysis,file servers,web-facing applications,millisecond-level time granularity,system monitoring tools,novel transient bottleneck detection,computing clouds,bottleneck,cloud computing,n-tier system,architecture layer,transient bottleneck,transient event,throughput,servers,time measurement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要