...
首页> 外文期刊>Intelligent data analysis >Fast classification for large data sets via random selection clustering and Support Vector Machines
【24h】

Fast classification for large data sets via random selection clustering and Support Vector Machines

机译:通过随机选择聚类和支持向量机对大型数据集进行快速分类

获取原文
获取原文并翻译 | 示例

摘要

Support Vector Machines (SVMs) are high-accuracy classifiers. However, normal SVM algorithms are unsuitable for classification of large data sets because of their training complexity. In this paper, we propose a novel SVM classification approach for large data sets. We first use the random selection to select a small group of training data for the first-stage SVM. Then a de-clustering technique is proposed to recover the training data for the second-stage SVM. This two-stage SVM classifier has distinctive advantages on dealing with huge data sets such as those in bioinformatics. The performance analysis is also given in this paper. Finally, we apply the proposed method on several benchmark problems. Experimental results demonstrate that this approach has good classification accuracy while the training is significantly faster than other SVM classifiers.
机译:支持向量机(SVM)是高精度分类器。但是,普通的SVM算法由于训练复杂,因此不适合用于大数据集的分类。在本文中,我们针对大数据集提出了一种新颖的SVM分类方法。我们首先使用随机选择为第一阶段SVM选择一小组训练数据。然后提出了一种去聚类技术来恢复第二阶段支持向量机的训练数据。这种两阶段的SVM分类器在处理海量数据集(如生物信息学中的数据集)方面具有显着优势。本文还对性能进行了分析。最后,我们将提出的方法应用于几个基准问题。实验结果表明,该方法具有良好的分类精度,而训练速度明显快于其他SVM分类器。

著录项

  • 来源
    《Intelligent data analysis》 |2012年第6期|897-914|共18页
  • 作者单位

    Departamento de Computation, Cinvestav-Ipn, Mexico City, Mexico;

    Departamento de Computation, Cinvestav-Ipn, Mexico City, Mexico;

    Departamento de Control Automatico, Cinvestav-Ipn, Mexico City, Mexico;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    large data set; random selection; SVM;

    机译:大数据集;随机选择;支持向量机;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号