Distributed support vector machines.

A Navia-Vázquez,D Gutiérrez-González,E Parrado-Hernández,J J Navarro-Abellán

IEEE Transactions on Neural Networks（2006）

引用 99|浏览0

暂无评分

摘要

A truly distributed (as opposed to parallelized) support vector machine (SVM) algorithm is presented. Training data are assumed to come from the same distribution and are locally stored in a number of different locations with processing capabilities (nodes). In several examples, it has been found that a reasonably small amount of information is interchanged among nodes to obtain an SVM solution, which is better than that obtained when classifiers are trained only with the local data and comparable (although a little bit worse) to that of the centralized approach (obtained when all the training data are available at the same place). We propose and analyze two distributed schemes: a "naïve" distributed chunking approach, where raw data (support vectors) are communicated, and the more elaborated distributed semiparametric SVM, which aims at further reducing the total amount of information passed between nodes while providing a privacy-preserving mechanism for information sharing. We show the feasibility of our proposal by evaluating the performance of the algorithms in benchmarks with both synthetic and real-world datasets.

查看译文

关键词

raw data,local data,centralized approach,semiparametric svm,svm solution,small amount,support vector,support vector machine,chunking approach,information sharing,training data,data mining,collaborative,computer networks,compact,learning artificial intelligence,information analysis,process capability,support vector machines,data privacy

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要