【24h】

Data Selection Using SASH Trees for Support Vector Machines

机译:使用支持向量机的SASH树进行数据选择

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a data preprocessing procedure to select support vector (SV) candidates. We select decision boundary region vectors (BRVs) as SV candidates. Without the need to use the decision boundary, BRVs can be selected based on a vector's nearest neighbor of opposite class (NNO). To speed up the process, two spatial approximation sample hierarchical (SASH) trees are used for estimating the BRVs. Empirical results show that our data selection procedure can reduce a full dataset to the number of SVs or only slightly higher. Training with the selected subset gives performance comparable to that of the full dataset. For large datasets, overall time spent in selecting and training on the smaller dataset is significantly lower than the time used in training on the full dataset.
机译:本文提出了一种数据预处理程序,以选择支持向量(SV)候选对象。我们选择决策边界区域向量(BRV)作为SV候选者。无需使用决策边界,可以基于向量的相反类别的最近邻居(NNO)选择BRV。为了加快该过程,使用了两个空间近似样本分层(SASH)树来估计BRV。实证结果表明,我们的数据选择程序可以将整个数据集减少到SV的数量,或者仅略高一些。使用选定的子集进行训练可获得与整个数据集相当的性能。对于大型数据集,在较小数据集上进行选择和训练所花费的总时间明显少于在整个数据集上进行训练所花费的时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号