首页> 外文会议> >Learning Bayesian network classifiers from data with missing values
【24h】

Learning Bayesian network classifiers from data with missing values

机译:从缺少值的数据中学习贝叶斯网络分类器

获取原文
获取外文期刊封面目录资料

摘要

Learning accurate Bayesian network (BN) classifiers from complete databases is a very active research topic in data mining and machine learning. However, in practice, databases are rarely complete. This affects their real world data mining applications. This paper investigates the methods for learning four types well-known Bayesian network classifiers from incomplete databases. These four types BN classifiers are: Naive-Bayes, tree augmented Naive-Bayes, BN augmented Naive-Bayes, and general BN, where the latter two are learned using dependency analysis based algorithms that work only on the database completeness assumption. In order to enable this kind of algorithms to handle with missing data, this paper introduces a novel deterministic method to estimate the (conditional) mutual information from incomplete databases, which can be used to do CI tests, a fundamental step in the dependency analysis based algorithms. The experimental results show that our algorithm is efficient and reliable.
机译:从完整的数据库中学习准确的贝叶斯网络(BN)分类器是数据挖掘和机器学习中非常活跃的研究主题。但是,实际上,数据库很少是完整的。这影响了他们在现实世界中的数据挖掘应用程序。本文研究了从不完整数据库中学习四种类型的著名贝叶斯网络分类器的方法。这四种类型的BN分类器是:朴素贝叶斯,树形增强朴素贝叶斯,BN增强朴素贝叶斯和普通BN,其中后两种是使用仅基于数据库完整性假设的基于依赖关系分析的算法学习的。为了使这种算法能够处理丢失的数据,本文介绍了一种新颖的确定性方法,可从不完整的数据库中估计(条件)互信息,该方法可用于进行CI测试,这是基于依赖关系分析的基本步骤算法。实验结果表明,该算法是有效且可靠的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号