首页> 外文会议>International Conference on Data Mining >Learning Bayesian Networks from Incomplete Data: An Efficient Method for Generating Approximate Predictive Distributions
【24h】

Learning Bayesian Networks from Incomplete Data: An Efficient Method for Generating Approximate Predictive Distributions

机译:从不完整的数据学习贝叶斯网络:一种有效的方法,用于产生近似预测分布

获取原文

摘要

We present an efficient method for learning Bayesian network models and parameters from incomplete data. With our approach an approximation is obtained of the predictive distribution. By way of this distribution any learning algorithm that works for complete data can be easily adapted to work for incomplete data as well. Our method exploits the dependence relations between the variables explicitly given by the Bayesian network model to predict missing values. Based on strength of influence and predictive quality, a subset of those predictor variables is selected, from which an approximate predictive distribution is generated by taking the observed part of the data into consideration. The approximate predictive distribution is obtained by traversing the data sample only twice and no iteration is required. Therefore our algorithm is more efficient than iterative algorithms such as EM and SEM. Our experiments show that the method performs well both for parameter learning and model learning compared to EM and SEM.
机译:我们提出了一种学习贝叶斯网络模型和来自不完整数据的参数的有效方法。通过我们的方法,获得预测分布的近似。通过这种分布,任何用于完整数据的学习算法都可以很容易地适应为不完整的数据工作。我们的方法利用贝叶斯网络模型明确给出的变量之间的依赖关系来预测缺失值。基于影响力和预测质量的强度,选择了那些预测变量的子集,从中考虑了观察到的数据,从中选择了近似预测分布。通过仅遍历数据样本仅两次并且不需要迭代来获得近似预测分布。因此,我们的算法比迭代算法更有效,例如EM和SEM。我们的实验表明,与EM和SEM相比,该方法对参数学习和模型学习效果均匀。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号