首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Extending the Applicability of Graphlets to Directed Networks
【24h】

Extending the Applicability of Graphlets to Directed Networks

机译:将Graphlet的适用性扩展到有向网络

获取原文
获取原文并翻译 | 示例
       

摘要

With recent advances in high-throughput cell biology, the amount of cellular biological data has grown drastically. Such data is often modeled as graphs (also called networks) and studying them can lead to new insights into molecule-level organization. A possible way to understand their structure is by analyzing the smaller components that constitute them, namely network motifs and graphlets. Graphlets are particularly well suited to compare networks and to assess their level of similarity due to the rich topological information that they offer but are almost always used as small undirected graphs of up to five nodes, thus limiting their applicability in directed networks. However, a large set of interesting biological networks such as metabolic, cell signaling, ortranscriptional regulatory networks are intrinsically directional, and using metrics that ignore edge direction may gravely hinder information extraction. Our main purpose in this work is to extend the applicability of graphlets to directed networks by considering their edge direction, thus providing a powerful basis for the analysis of directed biological networks. We tested our approach on two network sets, one composed of synthetic graphs and another of real directed biological networks, and verified that they were more accurately grouped using directed graphlets than undirected graphlets. It is also evident that directed graphlets offer substantially more topological information than simple graph metrics such as degree distribution or reciprocity. However, enumerating graphlets in large networks is a computationally demanding task. Our implementation addresses this concern by using a state-of-the-art data structure, the g-trie, which is able to greatly reduce the necessary computation. We compared our tool to other state-of-the art methods and verified that it is the fastest general tool for graphlet counting.
机译:随着高通量细胞生物学的最新进展,细胞生物学数据的数量急剧增加。此类数据通常被建模为图形(也称为网络),对其进行研究可以为分子级组织带来新的见解。理解其结构的一种可能方法是分析构成它们的较小组件,即网络主题和图形小组件。由于小图提供的丰富的拓扑信息,它们特别适合比较网络并评估它们的相似度,但几乎总是用作最多五个节点的小型无向图,因此限制了它们在有向网络中的适用性。但是,大量有趣的生物网络(例如代谢,细胞信号传导或转录调控网络)本质上是定向的,并且使用忽略边缘方向的度量可能严重阻碍信息提取。我们这项工作的主要目的是通过考虑图的边缘方向,将小图的适用性扩展到有向网络,从而为有向生物网络的分析提供有力的基础。我们在两个网络集上测试了我们的方法,其中一个由合成图组成,另一个由真实的有向生物网络组成,并验证了使用有向图小图比非有向图小图可以更准确地对它们进行分组。同样很明显,有向图小图提供的拓扑信息比简单的图指标(例如度分布或互易性)要多得多。但是,枚举大型网络中的图小图是一项计算量很大的任务。我们的实现通过使用最新的数据结构g-trie解决了这一问题,该结构可以大大减少必要的计算。我们将我们的工具与其他最先进的方法进行了比较,并验证了它是用于进行图形计数的最快的通用工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号