HMDE-FS: A homogeneous distributed ensemble feature selection framework based on resampling with/without replacement.

Concurr. Comput. Pract. Exp.(2023)

引用 0|浏览6
暂无评分
摘要
Applying ensemble feature selection (EFS) models in various problems has not been actively discussed, and there has been a lack of effort to make it applicable in the situations such as distributed environments. Due to restrictions of centralized algorithms such as their poor scalability in the high dimension data and also distributed nature of some data, using the traditional centralized computing for dealing with such problems may be inevitable. This paper aims to develop a homogenous distributed ensemble feature selection (HMDE-FS) framework through a distributed resampling approach rather than a centralized one. The homogenous ensembles mainly operate along with a resampling process, so applying various methods to resampling can affect the performance of the model. Among various strategies, those with and without replacement are two of the main technique families, hence we investigated the efficiency of two well-known with/without replacement techniques: bootstrapping (BS) and cross-validation (CV) inspired method that we named crisscross (CC). The proposed HMDE-FS approaches are tested on eight datasets, and the heavy experimental results illustrate that these methods considerably reduce runtime, while classification accuracy maintains its competitiveness.
更多
查看译文
关键词
bootstrap resampling,cross validation resampling,distributed computing,feature selection,homogeneous ensemble learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要