...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A multiple criteria active learning method for support vector regression
【24h】

A multiple criteria active learning method for support vector regression

机译:支持向量回归的多准则主动学习方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a novel active learning method developed in the framework of e-insensitive support vector regression (SVR) for the solution of regression problems with small size initial training data. The proposed active learning method selects iteratively the most informative as well as representative unlabeled samples to be included in the training set by jointly evaluating three criteria: (i) relevancy, (ii) diversity, and (iii) density of samples. All three criteria are implemented according to the SVR properties and are applied in two clustering-based consecutive steps. In the first step, a novel measure to select the most relevant samples that have high probability to be located either outside or on the boundary of the e-tube of SVR is defined. To this end, initially a clustering method is applied to all unlabeled samples together with the training samples that are inside the e-tube (those that are not support vectors, i.e., non-SVs); then the clusters with non-SVs are eliminated. The unlabeled samples in the remaining clusters are considered as the most relevant patterns. In the second step, a novel measure to select diverse samples among the relevant patterns from the high density regions in the feature space is defined to better model the SVR learning function. To this end, initially clusters with the highest density of samples are chosen to identify the highest density regions in the feature space. Then,the sample from each selected cluster that is associated with the portion of feature space having the highest density (i.e., the most representative of the underlying distribution of samples contained in the related cluster) is selected to be included in the training set. In this way diverse samples taken from high density regions are efficiently identified. Experimental results obtained on four different data sets show the robustness of the proposed technique particularly when a small-size initial training set are available.
机译:本文提出了一种在电子不敏感支持向量回归(SVR)框架下开发的新颖主动学习方法,用于解决小规模初始训练数据的回归问题。拟议的主动学习方法通​​过联合评估三个标准:(i)相关性,(ii)多样性和(iii)样本密度,迭代地选择信息量最大和代表性的未标记样本包括在训练集中。所有这三个条件都是根据SVR属性实现的,并在两个基于群集的连续步骤中应用。在第一步中,定义了一种新颖的方法来选择最相关的样本,这些样本最有可能位于SVR电子管的外部或边界上。为此,最初将聚类方法应用于所有未标记的样本以及电子管内的训练样本(不支持向量的样本,即非SV);然后排除具有非SV的群集。其余簇中未标记的样本被认为是最相关的模式。在第二步中,定义了一种从特征空间中高密度区域的相关模式中选择不同样本的新颖方法,以更好地对SVR学习功能进行建模。为此,首先选择具有最高密度样本的聚类,以识别特征空间中最高密度的区域。然后,从每个选定聚类中选择与特征空间中具有最高密度(即,最能代表相关聚类中样本的基础分布的部分)相关联的样本,以将其包含在训练集中。通过这种方式,可以有效地识别从高密度区域采集的各种样本。在四个不同的数据集上获得的实验结果表明了所提出技术的鲁棒性,尤其是在有小尺寸的初始训练集的情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号