首页> 外文期刊>Journal of Computers >K-SVM: An Effective SVM Algorithm Based on K-means Clustering
【24h】

K-SVM: An Effective SVM Algorithm Based on K-means Clustering

机译:K-SVM:基于K-means聚类的有效SVM算法

获取原文
       

摘要

—Support Vector Machine (SVM) is one of the most popular and effective classification algorithms and has attracted much attention in recent years. As an important large margin classifier, SVM dedicates to find the optimal separating hyperplane between two classes, thus can give outstanding generalization ability for it. In order to find the optimal hyperplane, we commonly take most of the labeled records as our training set. However, the separating hyperplane is only determined by a few crucial samples (Support Vectors, SVs), we needn’t train SVM model on the whole training set. This paper presents a novel approach based on clustering algorithm, in which only a small subset was selected from the original training set to act as our final training set. Our algorithm works to select the most informative samples using K-means clustering algorithm, and the SVM classifier is built through training on those selected samples. Experiments show that our approach greatly reduces the scale of training set, thus effectively saves the training and predicting time of SVM, and at the same time guarantees the generalization performance.
机译:-Support向量机(SVM)是最受欢迎和最有效的分类算法之一,近年来引起了很多关注。作为一个重要的大型保证金分类器,SVM致力于找到两个类之间的最佳分离超平面,因此可以为其提供出色的泛化能力。为了找到最佳的超平面,我们通常将最大部分标记的记录作为我们的培训集。然而,分离超平面仅由几个重要的样本(支持向量,SV)决定,我们不需要在整个训练集上培训SVM模型。本文提出了一种基于聚类算法的新方法,其中仅选中了从原始培训集中选择的小型子集,以充当我们的最终培训集。我们的算法可以使用K-Means Clustering算法选择最具信息性的样本,并且SVM分类器通过对选定样本的培训构建。实验表明,我们的方法大大降低了训练集的规模,从而有效地节省了SVM的训练和预测时间,同时保证了泛化性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号