首页> 外文学位 >Testing Independence in High Dimensions & Identifiability of Graphical Models.
【24h】

Testing Independence in High Dimensions & Identifiability of Graphical Models.

机译:在图形模型的高维和可识别性中测试独立性。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis two problems in multivariate statistics will be studied.;In the first chaper, we treat the problem of testing independence between m continuous observations when m can be larger than the available sample size n. We consider three types of test statistics that are constructed as sums of many pairwise rank correlation signals. In the asymptotic regime where both m and n converge to infinity, a martingale central limit theorem is applied to show that the null distributions of these statistics converge to Gaussian limits, which are valid with no specific distributional or moment assumptions on the data. Using the framework of U-statistics, our result covers a variety of rank correlations including Kendall's tau and a dominating term of Spearman's rank correlation coefficient (rho), but also degenerate U-statistics such as Hoeffding's D, or the tau* of Bergsma and Dassios. Like the classical theory for U-statistics, the test statistics need to be scaled differently when the rank correlations used to construct them are degenerate U-statistics. The power of the considered tests is explored in rate-optimality theory under a Gaussian equicorrelation alternative as well as in numerical experiments for specific cases of more general alternatives.;In the second chapter, we study parameter identifiability of directed Gaussian graphical models with one latent variable. In the scenario we consider, the latent variable is a confounder that forms a source node of the graph and is a parent to all other nodes, which correspond to the observed variables. We give a graphical condition that is sufficient for the Jacobian matrix of the parametrization map to be full rank, which entails that the parametrization is generically finite-to-one, a fact that is sometimes also referred to as local identifiability. We also derive a graphical condition that is necessary for such identifiability. Finally, we give a condition under which generic parameter identifiability can be determined from identifiability of a model associated with a subgraph. The power of these criteria is assessed via an exhaustive algebraic computational study for small models with 4, 5, and 6 observable variables, and a simulation study for large models with 25 or 35 observable variables.
机译:本文将研究多元统计中的两个问题。在第一章中,当m大于可用样本量n时,我们处理m个连续观测值之间的检验独立性问题。我们考虑了三种类型的测试统计数据,它们被构造为许多成对的秩相关信号之和。在m和n都收敛到无穷大的渐近状态下,应用mar中心极限定理表明这些统计量的零分布收敛到高斯极限,这在数据上没有特定分布或矩假设的情况下是有效的。使用U统计的框架,我们的结果涵盖了各种秩相关,包括肯德尔的tau和Spearman秩相关系数(rho)的主导项,还退化了U统计,例如Hoeffding's D或Bergsma的tau *和达索斯像经典的U统计量理论一样,当用于构造检验量级的相关性是退化的U统计量时,需要对测试统计量进行不同的缩放。在高斯等相关方案下的速率最优理论以及针对更一般方案的特殊情况的数值实验中,探索了所考虑检验的能力。第二章,我们研究了具有一个潜伏的有向高斯图形模型的参数可识别性变量。在我们考虑的场景中,潜在变量是一个混杂因素,它构成了图的源节点,并且是所有其他节点(与观察到的变量相对应)的父级。我们给出一个足以使参数化图的雅可比矩阵达到满秩的图形条件,这意味着参数化通常是有限的一对一,这一事实有时也称为局部可识别性。我们还得出了这种可识别性所必需的图形条件。最后,我们给出了一个条件,在该条件下可以根据与子图相关联的模型的可识别性确定通用参数的可识别性。这些标准的功效是通过对4个,5个和6个可观察变量的小型模型进行详尽的代数计算研究以及对25个或35个可观察变量的大型模型进行模拟研究来评估的。

著录项

  • 作者

    Leung, Dennis.;

  • 作者单位

    University of Washington.;

  • 授予单位 University of Washington.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2016
  • 页码 112 p.
  • 总页数 112
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:50:19

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号