首页> 外文期刊>Bioinformatics >Assessing the significance of consistently mis-regulated genes in cancer associated gene expression matrices
【24h】

Assessing the significance of consistently mis-regulated genes in cancer associated gene expression matrices

机译:评估持续错误调节的基因在癌症相关基因表达矩阵中的重要性

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: The simplest level of statistical analysis of cancer associated gene expression matrices is aimed at finding consistently up- or down-regulated genes within a given set of tumor samples. Considering the high level of gene expression diversity detected in cancer, one needs to assess the probability that the consistent mis-regulation of a given gene is due to chance. Furthermore, it is important to determine the required sample number that will ensure the meaningful statistical analysis of massively parallel gene expression measurements. Results: The probability of consistent mis-regulation is calculated in this paper for binarized gene expression data, using combinatorial considerations. For practical purposes, we also provide a set of accurate approximate formulas for determining the same probability in a computationally less intensive way. When the pool of mis-regulatable genes is restricted, the probability of consistent mis-regulation can be overestimated. We show, however, that this effect has little practical consequences for cancer associated gene expression measurements published in the literature. Finally, in order to aid experimental design, we have provided estimates on the required sample number that will ensure that the detected consistent mis-regulation is not due to chance. Our results suggest that less than 20 sufficiently diverse tumor samples may be enough to identify consistently mis-regulated genes in a statistically significant manner.
机译:动机:最简单的癌症相关基因表达矩阵的统计分析水平旨在在给定的一组肿瘤样本中找到一致上调或下调的基因。考虑到在癌症中检测到的高水平的基因表达多样性,需要评估给定基因的持续错误调节归因于偶然性的可能性。此外,重要的是确定所需的样本数量,以确保对大规模并行基因表达测量进行有意义的统计分析。结果:本文结合组合考虑因素,针对二进制化的基因表达数据计算了一致错误调节的可能性。出于实际目的,我们还提供了一组准确的近似公式,用于以较低的计算强度确定相同的概率。当错误调节基因库受到限制时,一致错误调节的可能性可能会被高估。但是,我们表明,这种影响对于文献中发表的癌症相关基因表达测量几乎没有实际影响。最后,为了帮助进行实验设计,我们提供了所需样本数量的估计值,以确保检测到的一致错误调节不是偶然的。我们的结果表明,少于20个足够多样化的肿瘤样本可能足以以统计学上显着的方式鉴定出持续错误调节的基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号