首页> 外文学位 >Resampling-based multiple testing with applications to microarray data analysis.
【24h】

Resampling-based multiple testing with applications to microarray data analysis.

机译:基于重采样的多重测试及其在微阵列数据分析中的应用。

获取原文
获取原文并翻译 | 示例

摘要

In microarray data analysis, resampling methods are widely used to discover significantly differentially expressed genes under different biological conditions when the distributions of test statistics are unknown. When sample size is small, however, simultaneous testing of thousands, or even millions, of null hypotheses in microarray data analysis brings challenges to the multiple hypothesis testing field. We study small sample behavior of three commonly used resampling methods, including permutation tests, post-pivot resampling methods, and pre-pivot resampling methods in multiple hypothesis testing. We show the model-based pre-pivot resampling methods have the largest maximum number of unique resampled test statistic values, which tend to produce more reliable P-values than the other two resampling methods. To avoid problems with the application of the three resampling methods in practice, we propose new conditions, based on the Partitioning Principle, to control the multiple testing error rates in fixed-effects general linear models. Meanwhile, from both theoretical results and simulation studies, we show the discrepancies between the true expected values of order statistics and the expected values of order statistics estimated by permutation in the Significant Analysis of Microarrays (SAM) procedure. Moreover, we show the conditions for SAM to control the expected number of false rejections in the permutation-based SAM procedure. We also propose a more powerful adaptive two-step procedure to control the expected number of false rejections with larger critical values than the Bonferroni procedure.
机译:在微阵列数据分析中,当测试统计数据的分布未知时,重采样方法被广泛用于发现在不同生物学条件下显着差异表达的基因。但是,当样本量较小时,同时测试微阵列数据分析中成千上万甚至数百万个无效假设会给多重假设测试领域带来挑战。我们研究了三种假设的重采样方法中的三种采样方法的小样本行为,包括排列检验,数据透视后重采样方法和数据透视前重采样方法。我们显示,基于模型的数据透视前重采样方法具有最大数量的最大唯一重采样测试统计值,与其他两种重采样方法相比,它们倾向于产生更可靠的P值。为了避免在实践中应用这三种重采样方法带来的问题,我们基于划分原理提出了新的条件,以控制固定效果通用线性模型中的多个测试错误率。同时,从理论结果和模拟研究中,我们都显示了顺序统计的真实期望值与通过微阵列的重要分析(SAM)程序通过置换估计的顺序统计的期望值之间的差异。此外,我们显示了在基于置换的SAM过程中SAM控制预期的错误拒绝数量的条件。我们还提出了一种功能更强大的自适应两步过程,以比Bonferroni过程更大的临界值来控制预期的误剔除次数。

著录项

  • 作者

    Li, Dongmei.;

  • 作者单位

    The Ohio State University.;

  • 授予单位 The Ohio State University.;
  • 学科 Biology Biostatistics.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号