Using classification methods to label tasks in process mining

Journal of Software Maintenance(2010)

引用 26|浏览15
暂无评分
摘要
We investigate a method designed to improve the accuracy of process mining in scenarios where the identification of task labels for log events is uncertain. Such situations are prevalent in business processes where events consist of communications between people, such as email messages. We examine how the accuracy of an independent task identifier, such as a classification or clustering engine, can be improved by examining the currently mined process model. First, a classification scheme based on identifying the keywords in each message is presented to provide an initial labeling. We then demonstrate how these labels can be refined by considering the likelihood that the event represents a particular task as obtained via an analysis of the current representation of the process model. This process is then repeated a number of times until the model is sufficiently refined. Results show that both keyword classification and the current process model analysis can be significantly effective on their own, and when combined have the potential to correct virtually all errors when noise is low (less than 20%), and can reduce the error rate by about 85% when noise is in the 30–40% range. Copyright © 2010 Crown in the right of Canada. We propose a technique for improving the accuracy of a process model in situations where errors in the task labels for log events may be present, causing the corresponding model to become highly erratic. We demonstrate how an initial labeling can be iteratively refined by considering the likelihood that each event represents a particular task, as obtained via analysis of the current representation of the process model. Labeling errors can be consequently reduced by 85% when noise is less than 40%. Copyright © 2010 Crown in the right of Canada.
更多
查看译文
关键词
workflow,process mining,bayesian classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要