首页> 外文学位 >Estimating cumulative distribution functions from JPS data with empty strata and analyzing high-throughput data via hierarchical Bayesian spatial modeling.
【24h】

Estimating cumulative distribution functions from JPS data with empty strata and analyzing high-throughput data via hierarchical Bayesian spatial modeling.

机译:从具有空层的JPS数据估计累积分布函数,并通过分层贝叶斯空间建模分析高通量数据。

获取原文
获取原文并翻译 | 示例

摘要

This research contains three parts: (1) estimating cumulative distribution functions from JPS data with empty strata; (2) analysis of ChIP-chip data via hierarchical Bayesian spatial modeling; (3) analysis of time course gene expression data.;Judgment Post-Stratification (JPS), similar to ranked set sampling (RSS), is an efficient data collection method by incorporating ranking information. In practice, the sample sizes of JPS data are often small, so empty strata might occur. In the literature, existing methods do not perform well when simply ignoring empty strata. The original isotonized estimator (Ozturk 2007) can handle empty strata automatically through two methods, MinMax and MaxMin. However, blindly using them can result in undesirable results in either tail of the CDF. In this work, we propose modified isotonized estimators to address the empty strata issue and improve estimation efficiency.;ChIP-on-chip is a high throughput technology used to investigate interaction be- tween regulatory proteins and DNA sequences in a genome-wide scope. In molecular psychiatry research, detecting epigenetic changes through the analysis of ChIP-chip data would enable us to gain a better understanding of neurobiological mechanisms and the genetic and environmental factors contributing to major mental disorders. However, analyzing these data is often very challenging due to spatial dependence of the data, with high noise levels and only a few replicates available under each experiment. We propose ANOVA models with spatially varying coefficients, combined with a hierarchical Bayes approach, to explicitly model spatial correlation caused by location-dependent biological effects (i.e., epigenetic changes).;In time course microarray experiments, gene expression levels are measured over multiple time points, allowing us to study dynamic gene regulation. Typically, the sample sizes of time course data are very small, and noise level is high. Furthermore, gene expression levels are often measured at only a few time points. All these make it difficult to model temporal gene expression patterns. In this work, we develop a hierarchical Bayesian approach to model spatial correlation in temporal gene expression patterns among neighboring genes. The proposed method borrows information among different genes and thus can improve the efficiency of the analysis.
机译:这项研究包括三个部分:(1)从空层的JPS数据估计累积分布函数; (2)通过分层贝叶斯空间建模分析ChIP芯片数据; (3)时程基因表达数据的分析。判断后分层(JPS)与排序集抽样(RSS)类似,是一种通过合并排序信息的有效数据收集方法。实际上,JPS数据的样本大小通常很小,因此可能会出现空层。在文献中,仅忽略空白层时,现有方法效果不佳。原始的等渗估算器(Ozturk 2007)可以通过MinMax和MaxMin这两种方法自动处理空层。但是,盲目使用它们会在CDF的任一尾部导致不良结果。在这项工作中,我们提出了改进的等渗估计器,以解决空白层问题并提高估计效率。芯片上芯片是一种高通量技术,用于研究全基因组范围内调节蛋白与DNA序列之间的相互作用。在分子精神病学研究中,通过对ChIP芯片数据进行分析来检测表观遗传学变化将使我们对神经生物学机制以及导致重大精神障碍的遗传和环境因素有更好的了解。但是,由于数据的空间依赖性,高噪声水平以及每个实验只能进行少量重复,因此分析这些数据通常非常困难。我们提出具有空间变化系数的ANOVA模型,并结合分层贝叶斯方法,以显式地建模由位置相关的生物学效应(即表观遗传变化)引起的空间相关性。在时程微阵列实验中,多次测量基因表达水平点,使我们能够研究动态基因调控。通常,时程数据的样本大小非常小,并且噪声级别很高。此外,通常仅在几个时间点测量基因表达水平。所有这些使得难以对时间基因表达模式进行建模。在这项工作中,我们开发了一种分层贝叶斯方法来对相邻基因之间的时间基因表达模式中的空间相关性进行建模。所提出的方法在不同基因之间借用信息,从而可以提高分析效率。

著录项

  • 作者

    Wang, Ke.;

  • 作者单位

    Southern Methodist University.;

  • 授予单位 Southern Methodist University.;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2012
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:43:28

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号