首页> 外文期刊>AStA Advances in statistical analysis >Discriminant analysis with Gaussian graphical tree models
【24h】

Discriminant analysis with Gaussian graphical tree models

机译:高斯图形树模型的判别分析

获取原文
获取原文并翻译 | 示例
       

摘要

We consider Gaussian graphical tree models in discriminant analysis for two populations. Both the parameters and the structure of the graph are assumed to be unknown. For the estimation of the parameters maximum likelihood is used, and for the estimation of the structure of the tree graph we propose three methods; in these, the function to be optimized is the J-divergence for one and the empirical log-likelihood ratio for the two others. The main contribution of this paper is the introduction of these three computationally efficient methods. We show that the optimization problem of each proposed method is equivalent to one of finding a minimum weight spanning tree, which can be solved efficiently even if the number of variables is large. This property together with the existence of the maximum likelihood estimators for small group sample sizes is the main advantage of the proposed methods. A numerical comparison of the classification performance of discriminant analysis using these methods, as well as three other existing ones, is presented. This comparison is based on the estimated error rates of the corresponding plug-in allocation rules obtained from real and simulated data. Diagonal discriminant analysis is considered as a benchmark, as well as quadratic and linear discriminant analysis whenever the sample size is sufficient. The results show that discriminant analysis with Gaussian tree models, using these methods for selecting the graph structure, is competitive with diagonal discriminant analysis in high-dimensional settings.
机译:在对两个总体进行判别分析时,我们考虑使用高斯图形树模型。假定图的参数和结构都是未知的。为了估计参数,使用了最大似然,对于树形图的结构,我们提出了三种方法:在这些函数中,要优化的函数是一个函数的J散度和两个函数的经验对数似然比。本文的主要贡献是介绍了这三种计算有效的方法。我们表明,每种提出的方​​法的优化问题都等同于找到最小权重生成树之一,即使变量数量很大,该树也可以有效解决。对于小规模的样本量,此属性以及最大似然估计器的存在是所提出方法的主要优势。给出了使用这些方法以及其他三个现有方法进行判别分析的分类性能的数值比较。该比较基于从真实数据和模拟数据获得的相应插件分配规则的估计错误率。只要样本量足够,对角判别分析以及二次和线性判别分析都将被视为基准。结果表明,使用高斯树模型的判别分析,使用这些方法选择图结构,在高维环境下与对角判别分析相比具有竞争优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号