首页> 外文会议>German Conference on Artificial Intelligence >Bayesian Treatment of Incomplete Discrete Data Applied to Mutual Information and Feature Selection
【24h】

Bayesian Treatment of Incomplete Discrete Data Applied to Mutual Information and Feature Selection

机译:贝叶斯治疗不完整的离散数据应用于相互信息和特征选择

获取原文

摘要

Given the joint chances of a pair of random variables one can compute quantities of interest, like the mutual information. The Bayesian treatment of unknown chances involves computing, from a second order prior distribution and the data likelihood, a posterior distribution of the chances. A common treatment of incomplete data is to assume ignorability and determine the chances by the expectation maximization (EM) algorithm. The two different methods above are well established but typically separated. This paper joins the two approaches in the case of Dirichlet priors, and derives efficient approximations for the mean, mode and the (co) variance of the chances and the mutual information. Further-more, we prove the unimodality of the posterior distribution, whence the important property of convergence of EM to the global maximum in the chosen framework. These results are applied to the problem of selecting features for incremental learning and naive Bayes classification. A fast filter based on the distribution of mutual information is shown to out-perform the traditional filter based on empirical mutual information on a number of incomplete real data sets.
机译:鉴于一对随机变量的关节机会,可以计算利息的数量,如互信息。贝叶斯治疗未知机会涉及计算,从二阶现有分发和数据可能性,机会的后部分布。不完全数据的常见处理是假设无知性并通过期望最大化(EM)算法来确定机会。上面的两种不同的方法是很好的,但通常是分开的。本文在Dirichlet Priors的情况下加入了两种方法,并且可以为机会的平均值,模式和(CO)方差和相互信息产生有效近似。更重要的是,我们证明了后部分布的单位性,何种在所选框架中将EM与全球最大值的重要性。这些结果适用于选择增量学习和幼稚贝叶斯分类的特征问题。基于互相信息分布的快速滤波器基于关于许多不完整的真实数据集的经验互信息,为传统滤波器出出传统滤波器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号