首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Extended Graph-Based Models for Enhanced Similarity Search in Cavbase
【24h】

Extended Graph-Based Models for Enhanced Similarity Search in Cavbase

机译:在Cavbase中增强基于图的模型以增强相似性搜索

获取原文
获取原文并翻译 | 示例

摘要

To calculate similarities between molecular structures, measures based on the maximum common subgraph are frequently applied. For the comparison of protein binding sites, these measures are not fully appropriate since graphs representing binding sites on a detailed atomic level tend to get very large. In combination with an NP-hard problem, a large graph leads to a computationally demanding task. Therefore, for the comparison of binding sites, a less detailed coarse graph model is used building upon so-called pseudocenters. Consistently, a loss of structural data is caused since many atoms are discarded and no information about the shape of the binding site is considered. This is usually resolved by performing subsequent calculations based on additional information. These steps are usually quite expensive, making the whole approach very slow. The main drawback of a graph-based model solely based on pseudocenters, however, is the loss of information about the shape of the protein surface. In this study, we propose a novel and efficient modeling formalism that does not increase the size of the graph model compared to the original approach, but leads to graphs containing considerably more information assigned to the nodes. More specifically, additional descriptors considering surface characteristics are extracted from the local surface and attributed to the pseudocenters stored in Cavbase. These properties are evaluated as additional node labels, which lead to a gain of information and allow for much faster but still very accurate comparisons between different structures.
机译:为了计算分子结构之间的相似性,经常使用基于最大共有子图的度量。对于蛋白质结合位点的比较,由于在详细原子水平上表示结合位点的图可能会变得非常大,因此这些措施并不完全合适。结合NP困难问题,大图导致计算量大的任务。因此,为了比较结合位点,在所谓的伪中心的基础上使用了较不详细的粗图模型。一致地,由于许多原子被丢弃并且没有考虑关于结合位点的形状的信息,导致结构数据的损失。通常可以通过根据其他信息执行后续计算来解决此问题。这些步骤通常很昂贵,使整个方法非常缓慢。但是,仅基于伪中心的基于图的模型的主要缺点是丢失有关蛋白质表面形状的信息。在这项研究中,我们提出了一种新颖有效的建模形式主义,与原始方法相比不会增加图模型的大小,但是会导致图包含分配给节点的更多信息。更具体地说,从局部表面提取考虑表面特征的附加描述符,并将其归因于存储在Cavbase中的伪中心。这些属性被评估为附加的节点标签,从而获得了更多信息,并允许在不同结构之间进行更快但仍然非常准确的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号