首页> 外文期刊>Pattern recognition letters >Feature subset selection from positive and unlabelled examples
【24h】

Feature subset selection from positive and unlabelled examples

机译:从阳性和未标记的示例中选择特征子集

获取原文
获取原文并翻译 | 示例
           

摘要

The feature subset selection problem has a growing importance in many machine learning applications where the amount of variables is very high. There is a great number of algorithms that can approach this problem in supervised databases but, when examples from one or more classes are not available, supervised feature subset selection algorithms cannot be directly applied. One of these algorithms is the correlation based filter selection (CFS). In this work we propose an adaptation of this algorithm that can be applied when only positive and unlabelled examples are available. As far as we know, this is the first time the feature subset selection problem is studied in the positive unlabelled learning context. We have tested this adaptation on synthetic datasets obtained by sampling Bayesian network models where we know which variables are (in)dependent of the class. We have also tested our adaptations on real-life databases where the absence of negative examples has been simulated. The results show that, having enough positive examples, it is possible to obtain good solutions to the feature subset selection problem when only positive and unlabelled instances are available.
机译:在子集非常多的许多机器学习应用中,特征子集选择问题变得越来越重要。有很多算法可以在监督数据库中解决此问题,但是,当一个或多个类的示例不可用时,监督特征子集选择算法将无法直接应用。这些算法之一是基于相关的滤波器选择(CFS)。在这项工作中,我们提出了该算法的一种改编,当只有肯定和未标记的示例可用时可以应用。据我们所知,这是第一次在积极的未标记学习环境中研究特征子集选择问题。我们已经对通过采样贝叶斯网络模型获得的合成数据集测试了这种适应性,在贝叶斯网络模型中我们知道哪些变量与该类相关。我们还在真实数据库中测试了我们的改编,其中模拟了没有负面例子的情况。结果表明,有了足够多的肯定实例,当只有肯定实例和未标记实例可用时,就可以获得特征子集选择问题的良好解决方案。

著录项

  • 来源
    《Pattern recognition letters》 |2009年第11期|1027-1036|共10页
  • 作者单位

    Intelligent Systems Group, Department of Computer Science and Artificial Intelligence, University of the Basque Country, Paseo Manuel de Lardizabal, I, E-20018 Donostia-San Sebastian, Spain;

    Departamento de lngeligencia Artificial, Universidad Politecnica de Madrid, E-28660 Boadilla del Monte, Spain;

    Intelligent Systems Group, Department of Computer Science and Artificial Intelligence, University of the Basque Country, Paseo Manuel de Lardizabal, I, E-20018 Donostia-San Sebastian, Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    positive unlabelled learning; partially supervised classification; feature subset selection; filter methods;

    机译:积极的;没有标签的学习;部分监督分类;特征子集选择;过滤方法;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号