首页> 美国卫生研究院文献>Journal of Computational Biology >Detection of Complexes in Biological Networks Through Diversified Dense Subgraph Mining
【2h】

Detection of Complexes in Biological Networks Through Diversified Dense Subgraph Mining

机译:多样性密集子图挖掘在生物网络中的复合物检测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Protein–protein interaction (PPI) networks, providing a comprehensive landscape of protein interaction patterns, enable us to explore biological processes and cellular components at multiple resolutions. For a biological process, a number of proteins need to work together to perform a job. Proteins densely interact with each other, forming large molecular machines or cellular building blocks. Identification of such densely interconnected clusters or protein complexes from PPI networks enables us to obtain a better understanding of the hierarchy and organization of biological processes and cellular components. However, most existing graph clustering algorithms on PPI networks often cannot effectively detect densely connected subgraphs and overlapped subgraphs. In this article, we formulate the problem of complex detection as diversified dense subgraph mining and introduce a novel approximation algorithm to efficiently enumerate putative protein complexes from biological networks. The key insight of our algorithm is that instead of enumerating all dense subgraphs, we only need to find a small diverse subset of subgraphs that cover as many proteins as possible. The problem is modeled as finding a diverse set of maximal dense subgraphs where we develop highly effective pruning techniques to guarantee efficiency. To scale up to large networks, we devise a divide-and-conquer approach to speed up the algorithm in a distributed manner. By comparing with existing clustering and dense subgraph-based algorithms on several yeast and human PPI networks, we demonstrate that our method can detect more putative protein complexes and achieves better prediction accuracy.*
机译:>蛋白质-蛋白质相互作用(PPI)网络提供了蛋白质相互作用模式的全面概况,使我们能够以多种分辨率探索生物学过程和细胞成分。对于生物学过程,许多蛋白质需要协同工作才能完成工作。蛋白质彼此紧密地相互作用,形成大分子机器或细胞构件。从PPI网络中识别出这种紧密互连的簇或蛋白质复合物,使我们能够更好地了解生物过程和细胞成分的层次和组织。但是,PPI网络上大多数现有的图聚类算法通常无法有效检测密集连接的子图和重叠的子图。在本文中,我们将复杂检测问题表述为多样化的密集子图挖掘,并介绍了一种新颖的近似算法,可从生物网络中有效枚举假定的蛋白质复合物。我们算法的关键见解是,无需枚举所有密集的子图,我们只需要找到一个小的子图子集即可覆盖尽可能多的蛋白质。该问题被建模为找到一组多样化的最大密集子图,在此我们开发了高效的修剪技术以保证效率。为了扩展到大型网络,我们设计了分而治之的方法来以分布式方式加速算法。通过与几种酵母和人类PPI网络上现有的基于聚类和基于密集子图的算法进行比较,我们证明了我们的方法可以检测更多推定的蛋白质复合物,并获得更好的预测准确性。 *

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号