...
首页> 外文期刊>Neural Networks, IEEE Transactions on >An Improved Algorithm for the Solution of the Regularization Path of Support Vector Machine
【24h】

An Improved Algorithm for the Solution of the Regularization Path of Support Vector Machine

机译:支持向量机正则化路径求解的一种改进算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper describes an improved algorithm for the numerical solution to the support vector machine (SVM) classification problem for all values of the regularization parameter $C$ . The algorithm is motivated by the work of Hastie and follows the main idea of tracking the optimality conditions of the SVM solution for ascending value of $C$ . It differs from Hastie's approach in that the tracked path is not assumed to be 1-D. Instead, a multidimensional feasible space for the optimality condition is used to solve the tracking problem. Such a treatment allows the algorithm to properly handle data sets which Hastie's approach fails. These data sets are characterized by the presence of linearly dependent points (in the kernel space), duplicate points, or nearly duplicate points. Such data sets are quite common among many real-world data, especially those with nominal features. Other contributions of this paper include a unifying formulation of the tracking process in the form of a linear programming problem, update formula for the linear programs, considerations that guard against accumulation of errors resulting from the use of incremental updates, and routines to speed up the algorithm. The algorithm is implemented under the Matlab environment and is available for download. Experiments with several data sets including data set having up to several thousand data points are reported.
机译:本文描述了一种针对正则化参数$ C $的所有值的支持向量机(SVM)分类问题数值解的改进算法。该算法受Hastie的推动,遵循的主要思想是跟踪$ C $的SVM解决方案的最优条件。它与Hastie的方法的不同之处在于,跟踪路径不假定为一维。取而代之的是,将用于最优条件的多维可行空间用于解决跟踪问题。这种处理方式使算法能够正确处理Hastie方法失败的数据集。这些数据集的特征是存在线性相关的点(在内核空间中),重复的点或几乎重复的点。这样的数据集在许多现实世界的数据中非常普遍,尤其是那些具有名义特征的数据。本文的其他贡献包括以线性编程问题的形式统一描述跟踪过程,线性程序的更新公式,防止因使用增量更新而导致的错误累积的注意事项以及加快程序运行速度的例程算法。该算法在Matlab环境下实现,可以下载。报告了几个数据集的实验,包括具有多达数千个数据点的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号