首页> 外文会议>Algorithms in bioinformatics >A Tree Based Method for the Rapid Screening of Chemical Fingerprints
【24h】

A Tree Based Method for the Rapid Screening of Chemical Fingerprints

机译:一种基于树的化学指纹图谱快速筛选方法

获取原文
获取原文并翻译 | 示例

摘要

The fingerprint of a molecule is a bitstring based on its structure, constructed such that structurally similar molecules will have similar fingerprints. Molecular fingerprints can be used in an initial phase for identifying novel drug candidates by screening large databases for molecules with fingerprints similar to a query fingerprint. In this paper, we present a method which efficiently finds all fingerprints in a database with Tanimoto coefficient to the query fingerprint above a user defined threshold. The method is based on two novel data structures for rapid screening of large databases: the kD grid and the Multibit tree. The kD grid is based on splitting the fingerprints into k shorter bitstrings and utilising these to compute bounds on the similarity of the complete bitstrings. The Multibit tree uses hierarchical clustering and similarity within each cluster to compute similar bounds. We have implemented our method and tested it on a large data set from the industry. Our experiments show that our method yields a three-fold speed-up over previous methods.
机译:分子的指纹是基于其结构的位串,其构造应使结构相似的分子具有相似的指纹。通过在大型数据库中筛选具有类似于查询指纹的指纹的分子,可以在初始阶段使用分子指纹来识别新药候选物。在本文中,我们提出了一种方法,该方法可以有效地在数据库中找到具有Tanimoto系数的所有指纹,以高于用户定义阈值的查询指纹。该方法基于两个用于快速筛选大型数据库的新颖数据结构:kD网格和Multibit树。 kD网格基于将指纹分为k个较短的位串,并利用这些指纹来计算完整位串相似度的范围。多位树使用层次聚类和每个聚类内的相似性来计算相似范围。我们已经实施了我们的方法,并在业界的大型数据集上对其进行了测试。我们的实验表明,我们的方法比以前的方法提高了三倍的速度。

著录项

  • 来源
    《Algorithms in bioinformatics》|2009年|194-205|共12页
  • 会议地点 Philadelphia PA(US);Philadelphia PA(US)
  • 作者单位

    Bioinformatics Research Center (BiRC) Aarhus University, C.F. Mollers Alle 8, DK-8000 Arhus C, Denmark;

    Bioinformatics Research Center (BiRC) Aarhus University, C.F. Mollers Alle 8, DK-8000 Arhus C, Denmark;

    Bioinformatics Research Center (BiRC) Aarhus University, C.F. Mollers Alle 8, DK-8000 Arhus C, Denmark;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物工程学(生物技术);
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号