Two-stage Incremental Working Set Selection for Fast Support Vector Training on Large Datasets

机译：大型数据集快速支持向量训练的两阶段增量工作集选择

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose iSVM - an incremental algorithm that achieves high speed in training support vector machines (SVMs) on large datasets. In the common decomposition framework, iSVM starts with a minimum working set (WS), and then iteratively selects one training example to update the WS in each optimization loop. iSVM employs a two-stage strategy in processing the training data. In the first stage, the most prominent vector among randomly sampled data is added to the WS. This stage results in an approximate SVM solution. The second stage uses temporal solutions to scan through the whole training data once again to find the remaining support vectors (SVs). We show that iSVM is especially efficient for training SVMs on applications where data size is much larger than number of SVs. On the KDD-CUP 1999 network intrusion detection dataset with nearly five millions training examples, iSVM takes less than one hour to train an SVM with 94% testing accuracy, compared to seven hours with LibSVM - one of the state-of-the-art SVM implementations. We also provide analysis and experimental comparisons between iSVM and the related algorithms.

机译：我们提出了ISVM - 一种增量算法，可以在大型数据集上实现高速训练支持向量机（SVM）。在公共分解框架中，ISVM以最小工作集（WS）开始，然后迭代地选择一个训练示例以在每个优化循环中更新WS。 ISVM在处理培训数据时采用两级策略。在第一阶段，随机采样数据中最突出的向量添加到WS。该阶段导致近似的SVM解决方案。第二阶段使用时间解决方案再次扫描整个训练数据，以查找剩余的支持向量（SVS）。我们表明ISVM对于培训SVMS在数据大于SV的数量大于SV的应用中特别有效。在KDD-Cup 1999年网络入侵检测数据集具有近五百万培训的实例中，ISVM花费不到一小时才能培训具有94％的测试精度的SVM，而Libsvm与Libsvm相比 - 最先进的SVM - 其中一个SVM实现。我们还提供ISVM与相关算法之间的分析和实验比较。

著录项

来源
《IEEE International Conference on Research, Innovation and Vision for the Future》|2008年||共6页
会议地点
作者
DucDung NGUYEN; Kazunori MATSUMOTO; Yasuhiro TAKISHIMA; Kazuo HASHIMOTO; Masahiro TERABE;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Support vector machine; Optimization; Decomposition method; Sequential minimal optimization;

机译：支持向量机;优化;分解方法;顺序最小优化;

相似文献

外文文献
中文文献
专利

1. Working Set Selection Using Second Order Information for Training Support Vector Machines [J] . Fan Rong-En, Chen Pai-Hsuen, Lin Chih-Jen Journal of machine learning research . 2005,第Dec期

机译：使用二阶信息选择工作集来训练支持向量机
2. Fast and de-noise support vector machine training method based on fuzzy clustering method for large real world datasets [J] . OMID NAGHASH ALMASI, MODJTABA ROUHANI Turkish Journal of Electrical Engineering and Computer Sciences . 2016,第1期

机译：基于模糊聚类的大型现实世界数据集快速降噪支持向量机训练方法
3. Evolutionary wrapper approaches for training set selection as preprocessing mechanism for support vector machines: Experimental evaluation and support vector analysis [J] . Verbiest Nele, Derrac Joaquin, Cornelis Chris, Applied Soft Computing . 2016,第Null期

机译：用于训练集选择的进化包装方法作为支持向量机的预处理机制：实验评估和支持向量分析
4. Two-stage Incremental Working Set Selection for Fast Support Vector Training on Large Datasets [C] . DucDung NGUYEN, Kazunori MATSUMOTO, Yasuhiro TAKISHIMA, IEEE International Conference on Research, Innovation and Vision for the Future . 2008

机译：大型数据集快速支持向量训练的两阶段增量工作集选择
5. Support vector parameter selection using experimental design based generating set search (SVEG) with application to predictive software data modeling. [D] . Lim, Hojung. 2004

机译：使用基于实验设计的生成集搜索（SVEG）支持向量参数选择，并将其应用于预测软件数据建模。
6. Feature Selection Method Based on Artificial Bee Colony Algorithm and Support Vector Machines for Medical Datasets Classification [O] . Mustafa Serter Uzer, Nihat Yilmaz, Onur Inan 2013

机译：基于人工蜂群算法和支持向量机的医学数据集特征选择方法
7. Optimizing working sets for training support vector regressors by Newton's method [O] . Abe Shigeo 2015

机译：用牛顿法优化训练支持向量回归器的工作集
8. Fast query-optimized kernel machine classification via incremental approximate nearest support vectors [R] . DeCoste, D., Mazzoni, D. 2003

机译：通过增量近似最近支持向量快速查询优化的内核机器分类

Two-stage Incremental Working Set Selection for Fast Support Vector Training on Large Datasets

摘要

著录项

相似文献

相关主题

期刊订阅