首页> 外文学位 >Clustering and Registration of Functional Data with Applications in Time Course Genomics Data.
【24h】

Clustering and Registration of Functional Data with Applications in Time Course Genomics Data.

机译:功能数据的聚类和注册以及时程基因组数据中的应用程序。

获取原文
获取原文并翻译 | 示例

摘要

Functional data analysis aims to provide statistical inference for stochastic processes defined over a functional space. Typical data sources, often modeled using functional data analytic techniques, include: nonlinear longitudinal data in biomedicine, image and spatial data, space-time data, etc. This dissertation will be mainly concerned with the analysis of data arising from bio-molecular processes evolving over time. Specifically, we will consider functional data conceptualized as random curves defined over a time domain. Two important techniques used in the analysis of functional data are clustering and registration.;Functional data clustering aims to identify subgroups of curves with similar shapes and estimate representative mean curves in each cluster. When applied to time course genomics data, functional data clustering identifies clusters of genes sharing similar temporal profiles. These clusters are likely to consist of genes involved in the same biological processes and functions. Functional data registration methods align curves exhibiting phase variability (e.g. variation among timings of features of different curves). After alignment, a common shape function can be estimated consistently to represent the overall pattern shared by all curves. However, when curves show both systematic shape differences and phase variability, neither functional data registration nor clustering alone is appropriate for data analysis.;Motivated by applications in time course genomics data, we propose a joint model for functional data clustering and registration. The proposed method integrates reproducing representations of functions in the framework on Dirichlet process mixtures. Simulation and case studies on real datasets show that our model is able to correctly cluster and register curves simultaneously. We explore several methodological alternatives in both synthetic and case study scenarios and show that jointly accounting for registration and clustering produces more accurate and interpretable inference.
机译:功能数据分析旨在为在功能空间上定义的随机过程提供统计推断。通常使用功能数据分析技术建模的典型数据源包括:生物医学中的非线性纵向数据,图像和空间数据,时空数据等。本论文将主要涉及对生物分子过程演化过程中的数据进行分析。随着时间的推移。具体来说,我们将功能化数据概念化为在时域上定义的随机曲线。功能数据分析中使用的两种重要技术是聚类和配准。功能数据聚类旨在识别形状相似的曲线子组,并估计每个聚类中的代表性平均曲线。当应用于时程基因组数据时,功能数据聚类可识别共享相似时间分布图的基因簇。这些簇可能由参与相同生物学过程和功能的基因组成。功能数据配准方法将表现出相位可变性的曲线对齐(例如,不同曲线的特征的时序之间存在差异)。对齐后,可以一致地估计一个通用的形状函数,以表示所有曲线共享的整体图案。但是,当曲线同时显示出系统的形状差异和相变性时,功能数据的注册和聚类都不适合进行数据分析。;由于在时程基因组学数据中的应用,我们提出了一种功能数据聚类和注册的联合模型。所提出的方法在Dirichlet过程混合物的框架中集成了功能的再现表示。对真实数据集的仿真和案例研究表明,我们的模型能够同时正确地聚类和注册曲线。我们在综合案例研究和案例研究案例中探索了几种方法论替代方法,并表明联合考虑注册和聚类可以得出更准确和可解释的推论。

著录项

  • 作者

    Zhang, Yafeng.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Biology Biostatistics.;Biology Bioinformatics.;Statistics.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 112 p.
  • 总页数 112
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号