Label-Less: A Semi-automatic Labeling Tool for KPI Anomalies
ieee international conference computer and communications(2019)
摘要
KPI (Key Performance Indicator) anomaly detection is critical for Internet-based services to ensure the quality and reliability. However, existing algorithms’ performance in reality is far from satisfying due to the lack of sufficient KPI anomaly data to help train and evaluate these algorithms. In this paper, we argue that labeling overhead is the main hurdle to obtain such datasets.Thus we novelly propose a semi-automatic labelling tool called Label-Less, which minimizes the labeling overhead in order to enable an ImageNet-like large-scale KPI anomaly dataset with high-quality ground truth. One novel technique in Label-Less is robust and rapid anomaly similarity search, which saves operators from scanning and checking the long KPIs back and forth for abnormal patterns or label consistency. In our evaluations using 30 real KPIs from a large Internet company, our anomaly similarity search achieves the best F-score of 0.95 on average, and a real-time per-KPI response time (less than 0.5 second). Overall, the feedback from deployment in practice shows that Label-Less can reduce operators’ labeling overhead by more than 90%.
更多查看译文
关键词
Anomaly detection,Labeling,Key performance indicator,Time series analysis,Time factors,Companies,Forestry
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络