首页> 外文期刊>Neurocomputing >Selecting valuable training samples for SVMs via data structure analysis
【24h】

Selecting valuable training samples for SVMs via data structure analysis

机译:通过数据结构分析为SVM选择有价值的培训样本

获取原文
获取原文并翻译 | 示例
           

摘要

In spite of its salient properties and wide acceptance, support vector machines (SVMs) still face difficulties in scalability, because solving the quadratic programming (QP) problems in SVMs training is especially costly when dealing with large sets of training data. This paper presents a new algorithm named sample reduction by data structure analysis (SR-DSA) for SVMs to improve their scalability. The SR-DSA utilizes data structure information in selecting the data points valuable in learning the separating plane. As this method is performed completely before SVMs training, it avoids the problem suffered by most sample reduction methods that choose samples heavily depending on repeated training of SVMs. Experiments on both synthetic and real world datasets show that the SR-DSA is capable of reducing the number of samples as well as the time for SVMs training while maintaining high testing accuracy.
机译:尽管支持向量机(SVM)具有突出的特性和广泛的接受性,但在扩展性方面仍然面临困难,因为在处理大量的训练数据时,解决SVM训练中的二次编程(QP)问题特别昂贵。本文提出了一种新的算法,用于支持向量机的数据结构分析(SR-DSA)样本减少以提高其可扩展性。 SR-DSA利用数据结构信息来选择对学习分离平面有价值的数据点。由于此方法是在SVM训练之前完全执行的,因此它避免了大多数样本缩减方法所遇到的问题,这些方法大量依赖SVM的反复训练来选择样本。在合成和真实数据集上进行的实验表明,SR-DSA能够减少样本数量以及SVM训练的时间,同时保持较高的测试精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号