Communication Efficient Construction Of Decision Trees Over Heterogeneously Distributed Data

ICDM '04: Proceedings of the Fourth IEEE International Conference on Data Mining(2004)

引用 32|浏览31
暂无评分
摘要
We present an algorithm designed to efficiently construct a decision tree over heterogeneously distributed data without centralizing. We compare our algorithm against a standard centralized decision tree implementation in terms of accuracy as well as the communication complexity. Our experimental results show that by using only 20% of the communication cost necessary to centralize the data we can achieve trees with accuracy at least 80% of the trees produced by the centralized version.
更多
查看译文
关键词
decision trees,distributed data mining,random projection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要