gpuRF and gpuERT: Efficient and Scalable GPU Algorithms for Decision Tree Ensembles

Parallel & Distributed Processing Symposium Workshops(2014)

引用 24|浏览2
暂无评分
摘要
We present two new parallel implementations of the ensemble learning methods Random Forests (RF) and Extremely Randomized Trees (ERT), called gpuRF and gpuERT, for emerging many-core platforms, e.g., contemporary graphics cards suitable for general-purpose computing (GPGPU). RF and ERT are two ensemble methods for generating predictive models that are of high importance within machine learning. They operate by constructing a multitude of decision trees at training time and outputting a prediction by comparing the outputs of the individual trees. Thanks to the inherent parallelism of the task, an obvious platform for its computation is to employ contemporary GPUs with a large number of processing cores. Previous parallel algorithms for RF in the literature are either designed for traditional multi-core CPU platforms or early history GPUs with simpler architecture and relatively few cores. For ERT, only briefly sketched parallelization attempts exist in the literature. The new parallel algorithms are designed for contemporary GPUs with a large number of cores and take into account aspects of the newer hardware architectures, such as memory hierarchy and thread scheduling. They are implemented using the C/C++ language and the CUDA interface to attain the best possible performance on NVidia-based GPUs. An experimental study comparing the most important previous solutions for CPU and GPU platforms to the novel implementations shows significant advantages in the aspect of efficiency for the latter, often with several orders of magnitude.
更多
查看译文
关键词
decision trees,graphics processing units,learning (artificial intelligence),mathematics computing,multi-threading,parallel algorithms,parallel architectures,C/C++ language,CUDA interface,GPGPU,GPU algorithms,NVidia-based GPUs,contemporary graphics cards,decision tree ensembles,decision trees,ensemble learning methods,ensemble methods,extremely randomized trees,general-purpose computing,gpuERT,gpuRF,machine learning,many-core platforms,memory hierarchy,parallel algorithms,parallel implementations,predictive model generation,random forests,thread scheduling,CUDA,GPGPU,Machine Learning,Parallel Algorithms,Random Forest
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要