...
首页> 外文期刊>Statistical papers >Mutual information and redundancy for categorical data
【24h】

Mutual information and redundancy for categorical data

机译:相互信息和分类数据的冗余

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Most methods for describing the relationship among random variables require specific probability distributions and some assumptions concerning random variables. Mutual information, based on entropy to measure the dependency among random variables, does not need any specific distribution and assumptions. Redundancy, which is an analogous version of mutual information, is also proposed as a method. In this paper, the concepts of redundancy and mutual information are explored as applied to multi-dimensional categorical data. We found that mutual information and redundancy for categorical data can be expressed as a function of the generalized likelihood ratio statistic under several kinds of independent log-linear models. As a consequence, mutual information and redundancy can also be used to analyze contingency tables stochastically. Whereas the generalized likelihood ratio statistic to test the goodness-of-fit of the log-linear models is sensitive to the sample size, the redundancy for categorical data does not depend on sample size but depends on its cell probabilities.
机译:描述随机变量之间关系的大多数方法都需要特定的概率分布和一些有关随机变量的假设。互信息基于熵来度量随机变量之间的依赖性,不需要任何特定的分布和假设。还提出了一种冗余方法,它是互信息的一种类似形式。本文探讨了冗余和互信息的概念,并将其应用于多维分类数据。我们发现,在几种独立的对数线性模型下,分类数据的互信息和冗余可以表示为广义似然比统计的函数。结果,相互信息和冗余也可以用于随机分析列联表。尽管用于检验对数线性模型的拟合优度的广义似然比统计量对样本大小敏感,但分类数据的冗余度并不取决于样本大小,而是取决于其像元概率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号