首页> 美国卫生研究院文献>Bioinformatics >SPICi: a fast clustering algorithm for large biological networks
【2h】

SPICi: a fast clustering algorithm for large biological networks

机译:SPICi:大型生物网络的快速聚类算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: Clustering algorithms play an important role in the analysis of biological networks, and can be used to uncover functional modules and obtain hints about cellular organization. While most available clustering algorithms work well on biological networks of moderate size, such as the yeast protein physical interaction network, they either fail or are too slow in practice for larger networks, such as functional networks for higher eukaryotes. Since an increasing number of larger biological networks are being determined, the limitations of current clustering approaches curtail the types of biological network analyses that can be performed.>Results: We present a fast local network clustering algorithm SPICi. SPICi runs in time O(V log V+E) and space O(E), where V and E are the number of vertices and edges in the network, respectively. We evaluate SPICi's performance on several existing protein interaction networks of varying size, and compare SPICi to nine previous approaches for clustering biological networks. We show that SPICi is typically several orders of magnitude faster than previous approaches and is the only one that can successfully cluster all test networks within very short time. We demonstrate that SPICi has state-of-the-art performance with respect to the quality of the clusters it uncovers, as judged by its ability to recapitulate protein complexes and functional modules. Finally, we demonstrate the power of our fast network clustering algorithm by applying SPICi across hundreds of large context-specific human networks, and identifying modules specific for single conditions.>Availability: Source code is available under the GNU Public License at >Contact: >Supplementary information: are available at Bioinformatics online.
机译:>动机:聚类算法在生物网络分析中起着重要作用,可用于发现功能模块并获得有关细胞组织的提示。尽管大多数可用的聚类算法在中等大小的生物网络(例如酵母蛋白质物理相互作用网络)上都可以很好地工作,但是对于大型网络(例如用于高级真核生物的功能网络),它们在实践中要么失败要么太慢。由于正在确定越来越多的大型生物网络,因此当前聚类方法的局限性限制了可以执行的生物网络分析的类型。>结果:我们提出了一种快速的本地网络聚类算法SPICi。 SPICi在时间O(V log V + E)和空间O(E)中运行,其中V和E分别是网络中顶点和边的数量。我们评估了SPICi在几个现有的大小不同的蛋白质相互作用网络上的性能,并将SPICi与以前的九种生物网络聚类方法进行了比较。我们证明,SPICi通常比以前的方法快几个数量级,并且是唯一可以在很短的时间内成功集群所有测试网络的方法。我们证明了SPICi在发现的簇的质量方面具有最先进的性能,这可以通过其概括蛋白质复合物和功能模块的能力来判断。最后,我们通过将SPICi应用于数百个大型的特定于上下文的人为网络,并识别特定于单个条件的模块,来演示快速网络群集算法的功能。>可用性:源代码可在GNU Public下获得。 >联系方式: >补充信息:上的许可证可从在线生物信息学获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号