首页> 美国卫生研究院文献>Genome Research >An Efficient and Robust Statistical Modeling Approach to Discover Differentially Expressed Genes Using Genomic Expression Profiles
【2h】

An Efficient and Robust Statistical Modeling Approach to Discover Differentially Expressed Genes Using Genomic Expression Profiles

机译:使用基因组表达谱发现差异表达基因的高效鲁棒统计建模方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We have developed a statistical regression modeling approach to discover genes that are differentially expressed between two predefined sample groups in DNA microarray experiments. Our model is based on well-defined assumptions, uses rigorous and well-characterized statistical measures, and accounts for the heterogeneity and genomic complexity of the data. In contrast to cluster analysis, which attempts to define groups of genes and/or samples that share common overall expression profiles, our modeling approach uses known sample group membership to focus on expression profiles of individual genes in a sensitive and robust manner. Further, this approach can be used to test statistical hypotheses about gene expression. To demonstrate this methodology, we compared the expression profiles of 11 acute myeloid leukemia (AML) and 27 acute lymphoblastic leukemia (ALL) samples from a previous study () and found 141 genes differentially expressed between AML and ALL with a 1% significance at the genomic level. Using this modeling approach to compare different sample groups within the AML samples, we identified a group of genes whose expression profiles correlated with that of thrombopoietin and found that genes whose expression associated with AML treatment outcome lie in recurrent chromosomal locations. Our results are compared with those obtained using t-tests or Wilcoxon rank sum statistics.
机译:我们已经开发出一种统计回归建模方法,以发现在DNA微阵列实验中两个预定义样品组之间差异表达的基因。我们的模型基于明确的假设,使用严格且特征明确的统计量度,并说明了数据的异质性和基因组复杂性。与试图定义具有共同总体表达谱的基因和/或样品组的聚类分析相反,我们的建模方法使用已知的样品组成员身份以敏感而稳健的方式专注于单个基因的表达谱。此外,该方法可用于测试有关基因表达的统计假设。为了证明这种方法,我们比较了先前研究()中的11种急性髓细胞性白血病(AML)和27种急性淋巴细胞性白血病(ALL)的表达谱,发现AML和ALL之间有141个差异表达的基因,在该基因上的显着性为1%。基因组水平。使用这种建模方法比较AML样品中的不同样品组,我们鉴定了一组表达谱与血小板生成素相关的基因,并发现了与AML治疗结果相关的表达位于复发性染色体位置的基因。将我们的结果与使用t检验或Wilcoxon秩和统计数据获得的结果进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号