LogInsights - Understanding and Extracting Information from Logs for Fast Fault Classification by Weak Supervision

Suranjana Samanta,Prateeti Mohapatra,Fabian Lim, Meenakshi Madugula,Xiaotong Liu, Sarasi Lalithsena

2023 IEEE International Conference on Software Services Engineering (SSE)(2023)

引用 0|浏览1
暂无评分
摘要
In many real-world applications, labeled training data is hard to come by for text classification. These tasks are often domain specific, where the vocabulary of the textual input is different than that of the general language vocabulary. In this paper, we deal with one of such tasks of automation of a software monitoring system, where logs are analyzed in real-time. We describe a weakly supervised method to process incoming streams of logs for identifying fault types in logs. We propose hand-crafted feature extractions, specially designed for the classifiers for log inputs. In order to make the processing time efficient and generalizable across various log sources, we rely on a weak supervised fault classifier, where the domain knowledge is incorporated using a word embedding mode built on a domain specific corpus. Experiments on logs obtained from various applications show the efficacy of our proposed method.
更多
查看译文
关键词
erroneous log,negative sentiment,fault categorization,domain specific embedding,domain dictionary,dependency parsing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要