首页> 外文期刊>Neurocomputing >Relative density based support vector machine
【24h】

Relative density based support vector machine

机译:基于相对密度的支持向量机

获取原文
获取原文并翻译 | 示例
           

摘要

As a support vector machine (SVM) has good generalization ability, it has been implemented in various applications. Yet in the process of resolving the mathematical model, it needs to compute the kernel matrix, the dimension of which is equal to the number of data points in the dataset, thereby costing a very high amount of memory. Some improved algorithms are proposed to extract the boundary of the dataset so that the number of data points participating in the training process decreases and the training process can be accelerated. But the prediction accuracy of most of these algorithms is so low that many support vectors are discarded. Moreover, those methods all need to perform the main computation by the kernel function in linear feature space, which increases the computational cost. In this paper, the concept "relative density" is proposed to extract the subset containing support vectors. The measure "relative density" is designed to be more meticulous so that the new method performs more precisely than existing methods. The proposed method makes use of the fact that it has good local characteristics to perform the computations in original space without having to use any kernel function. Therefore, efficiency is also improved. Furthermore, the proposed method can be used to detect noise data, by which an inseparable problem can be transformed into a separable problem so that cross validation can be avoided in various SVM algorithms. This is an advantage that none of the existing SVM methods has. Yet another advantage of this method is that it can be considered as a framework to be used in various SVM methods. This paper presents the details of the proposed accelerated algorithm, having a time complexity of O(n log n), that decreases training time significantly without decreasing prediction accuracy. The effectiveness and efficiency of the method is demonstrated through experiments on artificial and public datasets.
机译:由于支持向量机(SVM)具有良好的泛化能力,因此已在各种应用中实现。然而,在解析数学模型的过程中,它需要计算核矩阵,该核矩阵的维数等于数据集中数据点的数量,从而花费大量内存。提出了一些改进的算法来提取数据集的边界,从而减少参与训练过程的数据点的数量,并可以加速训练过程。但是大多数这些算法的预测精度都非常低,以至于许多支持向量都被丢弃了。此外,这些方法都需要通过核函数在线性特征空间中执行主要计算,这增加了计算成本。在本文中,提出了“相对密度”概念来提取包含支持向量的子集。测量“相对密度”时要更加谨慎,以便新方法比现有方法执行得更精确。所提出的方法利用以下事实:它具有良好的局部特性,可以在原始空间中执行计算,而不必使用任何内核函数。因此,效率也得到提高。此外,所提出的方法可以用于检测噪声数据,由此可以将不可分割的问题转换为可分离的问题,从而可以在各种SVM算法中避免交叉验证。这是现有SVM方法所没有的优势。该方法的另一个优点是可以将其视为要在各种SVM方法中使用的框架。本文介绍了所提出的加速算法的细节,该算法的时间复杂度为O(n log n),可显着减少训练时间而不会降低预测精度。通过在人工和公共数据集上进行的实验证明了该方法的有效性和效率。

著录项

  • 来源
    《Neurocomputing》 |2015年第ptac期|1424-1432|共9页
  • 作者单位

    College of Computer Sci ence, Chongqing University, Chongqing 400044, China;

    College of Computer Sci ence, Chongqing University, Chongqing 400044, China;

    Network Information Center, Yangtze Normal University, Chongqing 408100, China;

    Institute of Electrical Engineering and Information, Sichuan University, Chengdu 400015, China;

    College of Computer Sci ence, Chongqing University, Chongqing 400044, China,Network Information Center, Yangtze Normal University, Chongqing 408100, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Relative density; Support vector machine; Boundary;

    机译:相对密度;支持向量机;边界;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号