A novel method for inducing ID3 decision trees based on variable precision rough set.

ICNC(2011)

引用 2|浏览5
暂无评分
摘要
Classification is the main research target of many algorithms in data mining. Of all the algorithms, decision trees are more preferred by researchers due to their clarity and readability. ID3, as a heuristic algorithm, is fairly classic and popular in the induction of decision trees. The key of ID3 is to choose information gain as the standard for testing attributes. ID3 algorithm, however, tends to choose the attribute with more attribute values as the splitting node, and this attribute is often not the best attribute. In this paper, the improved information gain based on dependency degree of condition attributes on decision attribute is used as a heuristic for selecting the optimal splitting attribute in order to overcome above-stated shortcoming of the traditional ID3 algorithm. Experiments prove that the tree size and classification accuracy of the decision trees generated by the improved algorithm is superior to the ID3 algorithm. © 2011 IEEE.
更多
查看译文
关键词
classification,condition attribute,decision attribute,decision tree,dependency degree,id3 algorithm,variable precision rough set (vprs),data mining,decision trees,classification algorithms,rough set,rough set theory,set theory,information systems,information system,algorithm design,algorithm design and analysis,accuracy,noise,information gain,heuristic algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要