首页> 外文期刊>Journal of Bioinformatics and Computational Biology >COMPARING PEARSON, SPEARMAN AND HOEFFDING'S D MEASURE FOR GENE EXPRESSION ASSOCIATION ANALYSIS
【24h】

COMPARING PEARSON, SPEARMAN AND HOEFFDING'S D MEASURE FOR GENE EXPRESSION ASSOCIATION ANALYSIS

机译:培生,皮尔曼和霍夫丁的基因表达关联分析的D量度的比较

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

DNA microarrays have become a powerful tool to describe gene expression profiles associated with different cellular states, various phenotypes and responses to drugs and other extra- or intra-cellular perturbations. In order to cluster co-expressed genes and/or to construct regulatory networks, definition of distance or similarity between measured gene expression data is usually required, the most common choices being Pearson's and Spearman's correlations. Here, we evaluate these two methods and also compare them with a third one, namely Hoeffding's D measure, which is used to infer nonlinear and non-monotonic associations, i.e. independence in a general sense. By comparing three different variable association approaches, namely Pearson's correlation, Spearman's cor_relation and Hoeffding's D measure, we aimed at assessing the most approppriate one for each purpose. Using simulations, we demonstrate that the Hoeffding's D measure outperforms Pearson's and Spearman's approaches in identifying nonlinear associations. Our results demonstrate that Hoeffding's D measure is less sensitive to outliers and is a more powerful tool to identify nonlinear and non-monotonic associations. We have also applied Hoeffding's D measure in order to identify new putative genes associated with tp53. Therefore, we propose the Hoeffding's D measure to identify nonlinear associations between gene expression profiles.
机译:DNA微阵列已成为描述与不同细胞状态,各种表型和对药物的反应以及其他细胞外或细胞内扰动相关的基因表达谱的有力工具。为了聚类共表达的基因和/或构建调控网络,通常需要定义被测基因表达数据之间的距离或相似性,最常见的选择是皮尔逊氏和斯皮尔曼氏相关。在这里,我们评估这两种方法,并将它们与第三种方法进行比较,即Hoeffding的D度量,该度量用于推断非线性和非单调关联,即一般意义上的独立性。通过比较三种不同的变量关联方法,即Pearson相关,Spearman相关和Hoeffding D测度,我们旨在评估每种目的最合适的一种。通过仿真,我们证明了霍夫丁D度量在识别非线性关联方面优于Pearson和Spearman的方法。我们的结果表明,霍夫丁的D量度对异常值较不敏感,是识别非线性和非单调关联的更强大工具。我们还应用了霍夫丁氏D量度,以鉴定与tp53相关的新推定基因。因此,我们提出了Hoeffding的D度量来识别基因表达谱之间的非线性关联。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号