...
【24h】

Estimation of false discovery rates in multiple testing: application to gene microarray data.

机译:多重测试中错误发现率的估计:应用于基因芯片数据。

获取原文
获取原文并翻译 | 示例
           

摘要

Testing for significance with gene expression data from DNA microarray experiments involves simultaneous comparisons of hundreds or thousands of genes. If R denotes the number of rejections (declared significant genes) and V denotes the number of false rejections, then V/R, if R > 0, is the proportion of false rejected hypotheses. This paper proposes a model for the distribution of the number of rejections and the conditional distribution of V given R, V / R. Under the independence assumption, the distribution of R is a convolution of two binomials and the distribution of V / R has a noncentral hypergeometric distribution. Under an equicorrelated model, the distributions are more complex and are also derived. Five false discovery rate probability error measures are considered: FDR = E(V/R), pFDR = E(V/R / R > 0) (positive FDR), cFDR = E(V/R / R = r) (conditional FDR), mFDR = E(V)/E(R) (marginal FDR), and eFDR = E(V)/r (empirical FDR). The pFDR, cFDR, and mFDR are shown to be equivalent under the Bayesian framework, in which the number of true null hypotheses is modeled as a random variable. We present a parametric and a bootstrap procedure to estimate the FDRs. Monte Carlo simulations were conducted to evaluate the performance of these two methods. The bootstrap procedure appears to perform reasonably well, even when the alternative hypotheses are correlated (rho = .25). An example from a toxicogenomic microarray experiment is presented for illustration.
机译:使用来自DNA微阵列实验的基因表达数据测试重要性,涉及同时比较数百或数千个基因。如果R表示拒绝(声明的重要基因)的数量,V表示错误拒绝的数量,则如果R> 0,则V / R是错误拒绝假设的比例。本文提出了一个给定R,V / R的拒绝数分布和V的条件分布模型。在独立性假设下,R的分布是两个二项式的卷积,而V / R的分布具有非中心超几何分布。在等相关模型下,分布更加复杂并且也被推导。考虑了五个错误发现率概率误差度量:FDR = E(V / R),pFDR = E(V / R / R> 0)(正FDR),cFDR = E(V / R / R = r)(有条件的) FDR),mFDR = E(V)/ E(R)(边际FDR)和eFDR = E(V)/ r(经验FDR)。在贝叶斯框架下,pFDR,cFDR和mFDR被证明是等效的,其中真实零假设的数量被建模为随机变量。我们提出一个参数和引导程序来估计FDR。进行了蒙特卡洛模拟,以评估这两种方法的性能。引导程序似乎表现得相当不错,即使其他假设相互关联(rho = .25)。提出了毒理基因组微阵列实验的一个例子用于说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号