【24h】

Positive and Unlabeled Examples Hlep learning

机译:正面和未标记的例子提升学习

获取原文
获取原文并翻译 | 示例

摘要

In many learning problems, labeled examples are rare or expensive while numerous unlabeled and positive examples are available. However, most learnign algorithms only use labeled examples. Thus we address the prolem of learning with the help of positive and unlabeled data given a small number of labeled examples. We present both theoretical and empirical arguments showing that learning algorithms from statistics for monotone conjunctions in the presence of classification noise and give empirical evidence of our assumptions. We give theoretical results for the improvement of Statistical Query learning algorithms from positive and unlabeled data. Lastly, we apply these ideas to tree induction algorithms. We modify the code of C4.5 to get an algorithm which takes as input a set LAB of labeled examples, a set POS of positive examples and a set UNL of unlabeled data and which uses these three sets to construct the decision tree. We provide experimental results based on data taken from UCI repository which confirm the relevance of this approach.
机译:在许多学习问题中,带标签的示例很少或很昂贵,而有大量未标记的积极示例可用。但是,大多数学习算法仅使用标记的示例。因此,在给出少量标记示例的情况下,我们借助正面和未标记的数据来解决学习的难题。我们同时提供理论和经验论证,这些论证表明在存在分类噪声的情况下从统计数据中学习单调连词的算法,并为我们的假设提供经验证据。我们给出了从阳性和未标记数据中改进统计查询学习算法的理论结果。最后,我们将这些思想应用于树归纳算法。我们修改C4.5的代码,以获得一种算法,该算法将一组带标签的示例的LAB,一组正示例的POS和一组无标签的数据的UNL作为输入,并使用这三组构造决策树。我们基于从UCI资料库中获取的数据提供实验结果,证实了此方法的相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号