...
首页> 外文期刊>BMC Bioinformatics >Assessment of weighted topological overlap (wTO) to improve fidelity of gene co-expression networks
【24h】

Assessment of weighted topological overlap (wTO) to improve fidelity of gene co-expression networks

机译:评估加权拓扑重叠(WTO),以提高基因共表达网络的保真度

获取原文
           

摘要

For more than a decade, gene expression data sets have been used as basis for the construction of co-expression networks used in systems biology investigations, leading to many important discoveries in a wide range of subjects spanning human disease to evolution and the development of organisms. A commonly encountered challenge in such investigations is first that of detecting, then subsequently removing, spurious correlations (i.e. links) in these networks. While access to a large number of measurements per gene would reduce this problem, often only a small number of measurements are available. The weighted Topological Overlap (wTO) measure, which incorporates information from the shared network-neighborhood of a given gene-pair into a single score, is a metric that is frequently used with the implicit expectation of producing higher-quality networks. However, the actual extent to which wTO improves on the accuracy of a co-expression analysis has not been quantified. Here, we used a large-sample biological data set containing 338 gene-expression measurements per gene as a reference system. From these data, we generated ensembles consisting of 10, 20 and 50 randomly selected measurements to emulate low-quality data sets, finding that the wTO measure consistently generates more robust scores than what results from simple correlation calculations. Furthermore, for the data sets consisting of only 10 and 20 samples per gene, we find that wTO serves as a better predictor of the correlation scores generated from the full data set. However, we find that using wTO as a score for network building substantially alters several topographical aspects of the resulting networks, with no conclusive evidence that the resulting structure is more accurate. Importantly, we find that the much used approach of applying a soft-threshold modifier to link weights prior to computing the wTO substantially decreases the robustness of the resulting wTO network, but increases the predictive power of wTO networks with regards to the reference correlation (soft threshold) network, particularly as the size of the data sets increases. Our analysis demonstrates that, in agreement with previous assumptions, the wTO approach is capable of significantly improving the fidelity of co-expression networks, and that this effect is especially evident for cases of low-sample number gene-expression data sets.
机译:十多年来,基因表达数据集已被用作系统生物研究中使用的共表达网络的基础,从而导致跨越人类疾病的广泛科目的许多重要发现以及生物的发展。在这种调查中常见的挑战首先是在这些网络中检测,然后在这些网络中除去杂散的相关性(即链路)。虽然每种基因的大量测量值会减少这个问题,但通常只有少量测量。加权拓扑重叠(WTO)测量包含来自给定基因对的共享网络邻域的信息,是一个分数,是经常用于产生更高质量网络的隐含期望的度量。然而,WTO改善了CO-表达分析的准确性的实际程度尚未被量化。这里,我们使用了每个基因包含338个基因表达测量的大样本生物数据集作为参考系统。根据这些数据,我们生成由10,20和50随机选择的测量组成的集合,以模拟低质量数据集,发现WTO测量始终产生比简单相关计算的结果更强的刻度。此外,对于每个基因仅由10和20个样本组成的数据集,我们发现WTO是从完整数据集生成的相关分数的更好预测因子。但是,我们发现,使用WTO作为网络建筑的分数,基本上改变了所得网络的几个地形方面,没有确凿的证据表明所得结构更准确。重要的是,我们发现,在计算WTO之前将软阈值修饰符应用软阈值修饰符的使用方法大大降低了由此产生的WTO网络的鲁棒性,而是增加了关于参考相关的WTO网络的预测力量(软阈值)网络,特别是随着数据集的大小增加。我们的分析表明,与以前的假设一致,WTO方法能够显着提高共表达网络的保真度,并且对于低样本数基因表达数据集的情况尤其明显。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号