【24h】

An Effective K-means Clustering Based SVM Algorithm

机译:基于有效的K-Means聚类的SVM算法

获取原文

摘要

Support Vector Machine (SVM) is one of the most popular and effective data mining algorithms which can be used to resolve classification or regression problems, and has attracted much attention these years. SVM could find the optimal separating hyperplane between classes, which afford outstanding generalization ability with it. Usually all the labeled records are used as training set. However, the optimal separating hyperplane only depends on a few crucial samples (Support Vectors, SVs), we needn't train SVM model on the whole training set. In this paper a novel SVM model based on K-means clustering is presented, in which only a small subset of the original training set is selected to constitute the final training set, and the SVM classifier is built through training on these selected samples. This greatly decrease the scale of the training set, and effectively saves the training and predicting cost of SVM, meanwhile guarantees its generalization performance.
机译:支持向量机(SVM)是最受欢迎和有效的数据挖掘算法之一,可以用于解决分类或回归问题,并且这些年来吸引了很多关注。 SVM可以在类之间找到最佳分离超平面,这提供了出色的泛化能力。通常所有标记的记录都用作培训集。但是,最佳分离超平面仅取决于几个重要的样本(支持向量,SV),我们不需要在整个训练集上培训SVM模型。在本文中,提出了一种基于K-Means群集的新型SVM模型,其中选择了原始训练集的小子集来构成最终训练集,并且SVM分类器是通过对这些所选样本的培训构建的。这大大降低了训练集的规模,并有效地节省了SVM的培训和预测成本,同时保证其泛化性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号