uRule: A Rule-Based Classification System for Uncertain Data

Data Mining Workshops(2010)

引用 12|浏览0
暂无评分
摘要
Data uncertainty is common in real-world applications. Various reasons lead to data uncertainty, including imprecise measurements, network latency, outdated sources and sampling errors. These kinds of uncertainties have to be handled cautiously, or else the data mining results could be unreliable or wrong. In this demo, we will show uRule, a new rule-based classification and prediction system for uncertain data. This system uses new measures for generating, pruning and optimizing classification rules. These new measures are computed considering uncertain data intervals and probability distribution functions. Based on the new measures, the optimal splitting attributes and splitting values can be identified and used in classification rules. uRule can process uncertainty in both numerical and categorical data. It has satisfactory classification performance even when data is highly uncertain.
更多
查看译文
关键词
uncertain data,new measure,rule-based classification system,satisfactory classification performance,data uncertainty,new rule-based classification,uncertain data interval,categorical data,optimizing classification rule,data mining result,classification rule,probability distribution function,classification,probabilistic logic,sampling error,classification system,knowledge based systems,measurement uncertainty,data mining,rule based,accuracy,probability,uncertainty,cancer,probability distribution
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要