首页> 外文学位 >Data -analytic modeling for high -dimensional statistical problems.
【24h】

Data -analytic modeling for high -dimensional statistical problems.

机译:高维统计问题的数据分析建模。

获取原文
获取原文并翻译 | 示例

摘要

With advancement of technology and huge investment in various forms of data gathering, high dimensional problems have become increasingly important in quantitative research of many disciplines. Large sample sizes with high dimensions are important characteristics of many challenging problems in science, engineering and medicine. To reduce possible modeling biases, many flexible models are introduced at the initial phase of study. It poses a new theoretical and methodological challenge to study whether methods used for finite-dimensional problems are still useful in high-dimensional problems. This forms the theme of this thesis. In this thesis, three inter-related topics on high-dimensional statistical modeling are studied, which are detailed below.;A class of variable selection procedures for parametric models via nonconcave penalized likelihood is proposed by Fan and Li (2001) to simultaneously estimate parameters and select important variables. They demonstrate that this class of procedures has an oracle property when the number of parameters is finite. However, in most model selection problems, the number of parameters should be large, and grow with the sample size. In this thesis, some asymptotic properties of the nonconcave penalized likelihood are established for situations in which the number of parameters tends to infinity as the sample size increases. Under regularity conditions, we have established an oracle property and the asymptotic normality of the penalized likelihood estimators. Furthermore, the consistency of the sandwich formula of the covariance matrix is demonstrated. Nonconcave penalized likelihood ratio statistics are discussed, and their asymptotic distributions under the null hypothesis are obtained by imposing some mild conditions on the penalty functions. The asymptotic results are augmented by a simulation study and the newly developed methodology is illustrated by an analysis of the court case on the sexual discrimination of salary. (Abstract shortened by UMI.).
机译:随着技术的进步和对各种形式的数据收集的巨大投资,高维问题在许多学科的定量研究中变得越来越重要。高尺寸的大样本量是科学,工程和医学领域许多难题的重要特征。为了减少可能的建模偏差,在研究的初始阶段引入了许多灵活的模型。研究用于有限维问题的方法在高维问题中是否仍然有用,这提出了新的理论和方法挑战。这构成了本文的主题。本论文研究了与高维统计建模有关的三个相互关联的主题,下面将对此进行详细介绍。; Fan和Li(2001)提出了一类基于非凹惩罚性的参数模型变量选择程序,以同时估计参数并选择重要的变量。他们证明,当参数数量有限时,此类过程具有oracle属性。但是,在大多数模型选择问题中,参数的数量应该很大,并且会随样本量的增加而增加。在本文中,针对参数数量随着样本数量的增加而趋于无穷大的情况,建立了非凹惩罚可能性的一些渐近性质。在正则条件下,我们建立了被罚似然估计量的预言性质和渐近正态性。此外,证明了协方差矩阵的三明治公式的一致性。讨论了非凹惩罚似然比统计量,并通过在惩罚函数上施加一些温和条件,得到了原假设下它们的渐近分布。通过模拟研究来增加渐近结果,并通过对法院关于工资性别歧视的案例的分析来说明新开发的方法。 (摘要由UMI缩短。)。

著录项

  • 作者

    Peng, Heng.;

  • 作者单位

    The Chinese University of Hong Kong (Hong Kong).;

  • 授予单位 The Chinese University of Hong Kong (Hong Kong).;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2003
  • 页码 100 p.
  • 总页数 100
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号