A Metric Approach to Building Decision Trees Based on Goodman-Kruskal Association Index

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS(2004)

引用 17|浏览8
暂无评分
摘要
We introduce a numerical measure on sets of partitions of finite sets that is linked to the Goodman-Kruskal association index commonly used in statistics. This measure allows us to define a metric on such partions used for constructing decision trees. Experimental results suggest that by replacing the usual splitting criterion used in C4.5 by a metric criterion based on the Goodman-Kruskal coefficient it is possible, in most cases, to obtain smaller decision trees without sacrificing accuracy.
更多
查看译文
关键词
Goodman-Kruskal association index,metric,partition,decision tree
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要