首页> 外文期刊>BMC Systems Biology >Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
【24h】

Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes

机译:基于中心性的途径富集:寻找关键基因主导的重要途径的系统方法

获取原文
           

摘要

Background Biological pathways are important for understanding biological mechanisms. Thus, finding important pathways that underlie biological problems helps researchers to focus on the most relevant sets of genes. Pathways resemble networks with complicated structures, but most of the existing pathway enrichment tools ignore topological information embedded within pathways, which limits their applicability. Results A systematic and extensible pathway enrichment method in which nodes are weighted by network centrality was proposed. We demonstrate how choice of pathway structure and centrality measurement, as well as the presence of key genes, affects pathway significance. We emphasize two improvements of our method over current methods. First, allowing for the diversity of genes’ characters and the difficulty of covering gene importance from all aspects, we set centrality as an optional parameter in the model. Second, nodes rather than genes form the basic unit of pathways, such that one node can be composed of several genes and one gene may reside in different nodes. By comparing our methodology to the original enrichment method using both simulation data and real-world data, we demonstrate the efficacy of our method in finding new pathways from biological perspective. Conclusions Our method can benefit the systematic analysis of biological pathways and help to extract more meaningful information from gene expression data. The algorithm has been implemented as an R package CePa, and also a web-based version of CePa is provided.
机译:背景技术生物学途径对于理解生物学机制很重要。因此,找到生物学问题根源的重要途径有助于研究人员专注于最相关的基因组。路径类似于具有复杂结构的网络,但是大多数现有的路径扩充工具都忽略了嵌入在路径中的拓扑信息,这限制了它们的适用性。结果提出了一种系统的,可扩展的路径富集方法,该方法通过网络中心性对节点进行加权。我们演示了如何选择途径结构和中心性测量以及关键基因的存在如何影响途径的意义。我们强调我们的方法相对于当前方法的两个改进。首先,考虑到基因特征的多样性以及从各个方面难以涵盖基因重要性的问题,我们将中心性设置为模型中的可选参数。其次,结点而不是基因形成途径的基本单位,因此一个结点可以由数个基因组成,而一个基因可以位于不同的结点。通过将我们的方法与使用模拟数据和真实世界数据的原始富集方法进行比较,我们证明了从生物学角度寻找新途径的方法的有效性。结论我们的方法可以有益于生物学途径的系统分析,并有助于从基因表达数据中提取更有意义的信息。该算法已实现为R包CePa,并且还提供了基于Web的CePa版本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号