首页> 美国卫生研究院文献>Genes >K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks
【2h】

K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks

机译:k模块算法:提高WGCNA共表达网络的聚类结果的额外步骤

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Among biological networks, co-expression networks have been widely studied. One of the most commonly used pipelines for the construction of co-expression networks is weighted gene co-expression network analysis (WGCNA), which can identify highly co-expressed clusters of genes (modules). WGCNA identifies gene modules using hierarchical clustering. The major drawback of hierarchical clustering is that once two objects are clustered together, it cannot be reversed; thus, re-adjustment of the unbefitting decision is impossible. In this paper, we calculate the similarity matrix with the distance correlation for WGCNA to construct a gene co-expression network, and present a new approach called the k-module algorithm to improve the WGCNA clustering results. This method can assign all genes to the module with the highest mean connectivity with these genes. This algorithm re-adjusts the results of hierarchical clustering while retaining the advantages of the dynamic tree cut method. The validity of the algorithm is verified using six datasets from microarray and RNA-seq data. The k-module algorithm has fewer iterations, which leads to lower complexity. We verify that the gene modules obtained by the k-module algorithm have high enrichment scores and strong stability. Our method improves upon hierarchical clustering, and can be applied to general clustering algorithms based on the similarity matrix, not limited to gene co-expression network analysis.
机译:在生物网络中,共表达网络已被广泛研究。用于构建共表达网络的最常用管道之一是加权基因共表达网络分析(WGCNA),其可以识别高度共同表达的基因簇(模块)。 WGCNA使用分层聚类识别基因模块。分层聚类的主要缺点是,一旦两个对象聚集在一起,它就无法颠倒;因此,不可能重新调整不融合的决定。在本文中,我们计算了WGCNA的距离相关性的相似性矩阵构建基因共表达网络,并提出了一种称为K模块算法的新方法,以提高WGCNA聚类结果。该方法可以将所有基因分配给模块,其与这些基因最高的平均连接。该算法重新调整分层聚类的结果,同时保留动态树木切割方法的优点。使用来自微阵列和RNA-SEQ数据的六个数据集来验证算法的有效性。 K模块算法较少的迭代次数导致复杂性较低。我们验证了通过K模块算法获得的基因模块具有高浓缩评分和强的稳定性。我们的方法改善了层次聚类,并且可以基于相似性矩阵应用于一般聚类算法,不限于基因共表达网络分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号