...
首页> 外文期刊>International journal of systems science >Learning naive Bayes classifiers from positive and unlabelled examples with uncertainty
【24h】

Learning naive Bayes classifiers from positive and unlabelled examples with uncertainty

机译:从带有不确定性的阳性和未标记示例中学习朴素贝叶斯分类器

获取原文
获取原文并翻译 | 示例
           

摘要

Traditional classification algorithms require a large number of labelled examples from all the predefined classes, which is generally difficult and time-consuming to obtain. Furthermore, data uncertainty is prevalent in many real-world applications, such as sensor network, market analysis and medical diagnosis. In this article, we explore the issue of classification on uncertain data when only positive and unlabelled examples are available. We propose an algorithm to build naive Bayes classifier from positive and unlabelled examples with uncertainty. However, the algorithm requires the prior probability of positive class, and it is generally difficult for the user to provide this parameter in practice. Two approaches are proposed to avoid this user-specified parameter. One approach is to use a validation set to search for an appropriate value for this parameter, and the other is to estimate it directly. Our extensive experiments show that the two approaches can basically achieve satisfactory classification performance on uncertain data. In addition, our algorithm exploiting uncertainty in the dataset can potentially achieve better classification performance comparing to traditional naive Bayes which ignores uncertainty when handling uncertain data.
机译:传统的分类算法需要来自所有预定义类的大量带标签的示例,这通常很难且耗时。此外,数据不确定性在许多实际应用中都很普遍,例如传感器网络,市场分析和医疗诊断。在本文中,我们将探讨只有不确定的示例可用的情况下才能对不确定数据进行分类的问题。我们提出了一种从不确定的正样本和未标记样本构建朴素贝叶斯分类器的算法。但是,该算法要求先验的概率为肯定等级,因此用户通常很难在实践中提供此参数。提出了两种避免该用户指定参数的方法。一种方法是使用验证集为该参数搜索适当的值,另一种方法是直接估计它。我们广泛的实验表明,这两种方法基本上可以在不确定数据上实现令人满意的分类性能。此外,与传统的朴素贝叶斯算法(在处理不确定数据时忽略不确定性)相比,我们利用数据集中不确定性的算法可以潜在地实现更好的分类性能。

著录项

  • 来源
    《International journal of systems science》 |2012年第12期|p.1805-1825|共21页
  • 作者单位

    College of Information Engineering, Northwest A&F University, Yangling, China;

    College of Information Engineering, Northwest A&F University, Yangling, China,State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China;

    School of Information Technology and Electrical Engineering,The University of Queensland, Brisbane, Australia;

    Department of Computing and Mathematical Sciences,University of Glamorgan, Pontypridd, UK,School of Engineering and Science, Victoria University, Melbourne, Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    positive unlabelled learning; uncertain data; naive bayes; positive naive bayes;

    机译:积极的;没有标签的学习;不确定的数据;天真的贝叶斯朴素的贝叶斯;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号