Finding Potential Support Vectors in Separable Classification Problems

Neural Networks and Learning Systems, IEEE Transactions(2013)

引用 3|浏览25
暂无评分
摘要
This paper considers the classification problem using support vector (SV) machines and investigates how to maximally reduce the size of the training set without losing information. Under separable data set assumptions, we derive the exact conditions stating which observations can be discarded without diminishing the overall information content. For this purpose, we introduce the concept of potential SVs, i.e., those data that can become SVs when future data become available. To complement this, we also characterize the set of discardable vectors (DVs), i.e., those data that, given the current data set, can never become SVs. Thus, these vectors are useless for future training purposes and can eventually be removed without loss of information. Then, we provide an efficient algorithm based on linear programming that returns the potential and DVs by constructing a simplex tableau. Finally, we compare it with alternative algorithms available in the literature on some synthetic data as well as on data sets from standard repositories.
更多
查看译文
关键词
linear programming,pattern classification,support vector machines,DV,SVM,discardable vectors,information content,linear programming,potential SV concept,separable classification problem,separable data set assumptions,simplex tableau,support vector machines,Data discardability conditions,discardable vectors,linear programming,potential support vectors,separable data sets,support vector machines
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要