首页> 外文学位 >Statistical inference for functional and longitudinal data.
【24h】

Statistical inference for functional and longitudinal data.

机译:功能和纵向数据的统计推断。

获取原文
获取原文并翻译 | 示例

摘要

Advances in modern technology have facilitated the collection of high-dimensional functional and low dimensional longitudinal data. For these data, it is often of interest to describe the key signals of the data (mean functions, covariance functions, derivative functions, etc.). Functional data analysis (FDA) and longitudinal data analysis (LDA) techniques have played a central role in the analysis of these data. The primary goal of this dissertation is to provide some novel statistical inference methods for FDA and LDA.;In Chapter 1, we describe the structure (design, notations, etc.) of functional data and describe the spline smoothing technique as a tool to analysis these data. Longitudinal data analysis with missing not at random response is also discussed.;In Chapter 2, a polynomial spline estimator is proposed for the mean function of dense functional data together with a simultaneous confidence band which is asymptotically correct. In addition, the spline estimator and its accompanying confidence band enjoy semiparametric efficiency in the sense that they are asymptotically the same as if all random trajectories are observed entirely and without errors. The confidence band is also extended to the difference of mean functions of two populations of functional data. Simulation experiments provide strong evidence that corroborates the asymptotic theory while computing is efficient. The confidence band procedure is illustrated by analyzing the near infrared spectroscopy data.;A nonparametric estimation of the covariance function for dense functional data using tensor product B-splines is considered in Chapter 3. We develop both local and global asymptotic distributions for the proposed estimator, and show that our estimator is as efficient as an "oracle'' estimator. Monte Carlo simulation experiments and two real data examples are also provided to illustrate the proposed method in this chapter. In Chapter 4, we develop a new procedure to construct simultaneous confidence bands for derivatives of mean curves in FDA. The technique involves polynomial splines that provide an approximation to the derivatives of the mean functions, the covariance functions and the associated eigenfunctions. The confidence band procedure is illustrated through numerical simulation studies and a real life example.;In Chapter 5, we consider data generated from a longitudinal study with potentially non random missing data. For these data, a joint model for the missing data process and the outcome process, is found to be at best weakly identifiable. Due to this identifiability concerns, tests concerning the parameters of interest may not be able to use conventional theories and it may not be clear how to assess statistical significance. We extend the literature by developing a testing procedure that can be used to evaluate hypotheses under non and weakly identifiable semiparametric models. We derive the limiting distribution of this statistic and propose theoretically justified resampling approaches to approximate its asymptotic distribution. The methodology's practical utility is illustrated in simulations and an analysis of quality-of-life outcomes from a longitudinal study on breast cancer.
机译:现代技术的进步促进了高维功能和低维纵向数据的收集。对于这些数据,通常需要描述数据的关键信号(均值函数,协方差函数,导数函数等)。功能数据分析(FDA)和纵向数据分析(LDA)技术在这些数据的分析中发挥了核心作用。本文的主要目的是为FDA和LDA提供一些新颖的统计推断方法。在第一章中,我们描述了功能数据的结构(设计,符号等),并描述了样条平滑技术作为分析工具。这些数据。在第二章中,提出了一种多项式样条估计器,用于密集函数数据的均值函数和同时渐近正确的同时置信带。此外,样条估计器及其伴随的置信带在渐近相同的意义上享有半参数效率,就好像完全观察到所有随机轨迹且没有错误一样。置信带也扩展到两个功能数据总体的平均功能之差。仿真实验提供了有力的证据,可以证明渐近理论,同时计算效率很高。通过分析近红外光谱数据来说明置信带过程。;在第3章中考虑了使用张量积B样条的密集函数数据的协方差函数的非参数估计。我们为拟议的估计量开发了局部和全局渐近分布,并证明我们的估计器与“ oracle”估计器一样有效,还提供了蒙特卡洛模拟实验和两个实际数据示例来说明本章中提出的方法;在第四章中,我们开发了一种新的同时构造方法FDA中均值曲线导数的置信带,该技术涉及多项式样条,提供了均值函数,协方差函数和相关特征函数的导数的近似值,并通过数值模拟研究和实际例子说明了置信带过程。;在第5章中,我们考虑了使用po进行纵向研究得出的数据潜在的非随机丢失数据。对于这些数据,发现缺少数据的过程和结果过程的联合模型充其量是很难识别的。由于存在这种可识别性,有关目标参数的测试可能无法使用常规理论,并且不清楚如何评估统计显着性。我们通过开发一种可用于评估不可识别的半参数模型下的假设的测试程序来扩展文献。我们得出该统计量的极限分布,并提出理论上合理的重采样方法以近似其渐近分布。通过对乳腺癌的纵向研究进行的模拟和对生活质量结果的分析说明了该方法的实际实用性。

著录项

  • 作者

    Cao, Guanqun.;

  • 作者单位

    Michigan State University.;

  • 授予单位 Michigan State University.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 191 p.
  • 总页数 191
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号