Selecting valuable training samples for SVMs via data structure analysis

Defeng Wang; Lin Shi

首页> 外文期刊>Neurocomputing >Selecting valuable training samples for SVMs via data structure analysis

【24h】

Selecting valuable training samples for SVMs via data structure analysis

机译：通过数据结构分析为SVM选择有价值的培训样本

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In spite of its salient properties and wide acceptance, support vector machines (SVMs) still face difficulties in scalability, because solving the quadratic programming (QP) problems in SVMs training is especially costly when dealing with large sets of training data. This paper presents a new algorithm named sample reduction by data structure analysis (SR-DSA) for SVMs to improve their scalability. The SR-DSA utilizes data structure information in selecting the data points valuable in learning the separating plane. As this method is performed completely before SVMs training, it avoids the problem suffered by most sample reduction methods that choose samples heavily depending on repeated training of SVMs. Experiments on both synthetic and real world datasets show that the SR-DSA is capable of reducing the number of samples as well as the time for SVMs training while maintaining high testing accuracy.

机译：尽管支持向量机（SVM）具有突出的特性和广泛的接受性，但在扩展性方面仍然面临困难，因为在处理大量的训练数据时，解决SVM训练中的二次编程（QP）问题特别昂贵。本文提出了一种新的算法，用于支持向量机的数据结构分析（SR-DSA）样本减少以提高其可扩展性。 SR-DSA利用数据结构信息来选择对学习分离平面有价值的数据点。由于此方法是在SVM训练之前完全执行的，因此它避免了大多数样本缩减方法所遇到的问题，这些方法大量依赖SVM的反复训练来选择样本。在合成和真实数据集上进行的实验表明，SR-DSA能够减少样本数量以及SVM训练的时间，同时保持较高的测试精度。

著录项

来源
《Neurocomputing》 |2008年第15期|p.2772-2781|共10页
作者
Defeng Wang; Lin Shi;
展开▼
作者单位

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
sample reduction; support vector machines; hierarchical clustering; mahalanobis distance;

机译：样本约简;支持向量机;层次聚类;马氏距离;

相似文献

外文文献
中文文献
专利

1. Adaptive memetic algorithm enhanced with data geometry analysis to select training data for SVMs [J] . Nalepa Jakub, Kawulok Michal Neurocomputing . 2016,第apra12期

机译：通过数据几何分析增强自适应模因算法以选择支持向量机的训练数据
2. Training the SVM to Larger Dataset Applications using the SVM Sampling Technique [J] . G. M. Sangeetha, Prashanth Indian Journal of Science and Technology . 2015,第15期

机译：使用SVM采样技术将SVM训练到更大的数据集应用程序
3. Feature Selection Based on SVM in Photo-Thermal Infrared (IR) Imaging Spectroscopy Classification With Limited Training Samples [J] . NIAN ZHANG, KEENAN LEATHAM WSEAS Transactions on Signal Processing . 2017,第Pta2期

机译：基于SVM在光热红外（IR）成像光谱分类中的特征选择，具有有限的训练样本
4. Automatic selection of informative samples for SVM-based classification of hyperspectral data using limited training sets [C] . Plaza Antonio, Plaza Javier 2nd Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing . 2010

机译：使用有限的训练集为基于SVM的高光谱数据分类自动选择信息样本
5. Selected Problems in Survey Sampling Design and Survey Missing Data Analysis [D] . Stubblefield, Alexander Ross. 2019

机译：调查采样设计和调查中的选定问题缺少数据分析
6. Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data [O] . Xuegong Zhang, Xin Lu, Qian Shi, 2006

机译：质谱和微阵列数据的递归SVM特征选择和样品分类
7. Sample reduction for SVMs via data structure analysis [O] . Wang DF, Yeung DS, Tsang ECC 2007

机译：通过数据结构分析减少SVM的样本
8. Analysis and Interpretation of Sinai Test Structure Data and Investigation of theApplicability of Selected Dynamic Response Programs to Such Structures [R] . Paul, S. L., Haltiwangeer, J. D., Hayes, J. R., 1995

机译：西奈测试结构数据的分析与解释及所选动态响应程序对此类结构的适用性研究

Selecting valuable training samples for SVMs via data structure analysis

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅