...
首页> 外文期刊>Neurocomputing >Efficient semi-supervised feature selection with noise insensitivetrace ratio criterion
【24h】

Efficient semi-supervised feature selection with noise insensitivetrace ratio criterion

机译:噪声不敏感迹线比率准则的高效半监督特征选择

获取原文
获取原文并翻译 | 示例

摘要

Feature selection is an effective method to deal with high-dimensional data. While in many applications such as multimedia and web mining, the data are often high-dimensional and very large scale, but the labeled data are often very limited. On these kind of applications, it is important that the feature selection algorithm is efficient and can explore labeled data and unlabeled data simultaneously. In this paper, we target on this problem and propose an efficient semi-supervised feature selection algorithm to select relevant features using both labeled and unlabeled data. First, we analyze a popular trace ratio criterion in the dimensionality reduction, and point out that the trace ratio criterion tends to select features with very small variance. To solve this problem, we propose a noise insensitive trace ratio criterion for feature selection with a re-scale preprocessing. Interestingly, the feature selection with the noise insensitive trace ratio criterion can be much more efficiently solved. Based on the noise insensitive trace ratio criterion, we propose a new semi-supervised feature selection algorithm. The algorithm fully explores the distribution of the labeled and unlabeled data with a special label propagation method. Experimental results verify the effectiveness of the proposed algorithm, and show improvement over traditional supervised feature selection algorithms.
机译:特征选择是处理高维数据的有效方法。尽管在诸如多媒体和Web挖掘之类的许多应用程序中,数据通常是高维且规模很大的,但是标记的数据通常非常有限。在这类应用程序中,重要的是特征选择算法必须高效并且可以同时浏览标记的数据和未标记的数据。在本文中,我们针对此问题并提出了一种有效的半监督特征选择算法,以使用标记和未标记的数据选择相关特征。首先,我们分析了降维中常用的迹线比率准则,并指出迹线比率准则倾向于选择方差很小的特征。为了解决这个问题,我们提出了一种噪声不敏感的跟踪比率准则,用于具有重新缩放预处理的特征选择。有趣的是,可以使用噪声不敏感迹线比率准则更有效地解决特征选择。基于噪声不敏感迹线比率准则,我们提出了一种新的半监督特征选择算法。该算法使用一种特殊的标签传播方法来充分探索标记和未标记数据的分布。实验结果验证了该算法的有效性,并表明了对传统监督特征选择算法的改进。

著录项

  • 来源
    《Neurocomputing 》 |2013年第1期| 12-18| 共7页
  • 作者单位

    Department of Computer Science and Engineering, University of Texas, Arlington, USA;

    Department of Computer Science and Engineering, University of Texas, Arlington, USA;

    Department of Computer Science, Tianjin Polytechnic University, Tianjin, China;

    School of Electrical ss Electronic Engineering, Nanyang Technological University, Singapore;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    feature selection; semi-supervised; trace ratio; graph based learning; re-scale preprocessing;

    机译:特征选择;半监督痕量比基于图的学​​习重新缩放预处理;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号