...
首页> 外文期刊>Bioinformatics >A maximum common substructure-based algorithm for searching and predicting drug-like compounds.
【24h】

A maximum common substructure-based algorithm for searching and predicting drug-like compounds.

机译:基于最大通用子结构的算法,用于搜索和预测类药物化合物。

获取原文
获取原文并翻译 | 示例

摘要

The prediction of biologically active compounds is of great importance for high-throughput screening (HTS) approaches in drug discovery and chemical genomics. Many computational methods in this area focus on measuring the structural similarities between chemical structures. However, traditional similarity measures are often too rigid or consider only global similarities between structures. The maximum common substructure (MCS) approach provides a more promising and flexible alternative for predicting bioactive compounds. RESULTS: In this article, a new backtracking algorithm for MCS is proposed and compared to global similarity measurements. Our algorithm provides high flexibility in the matching process, and it is very efficient in identifying local structural similarities. To predict and cluster biologically active compounds more efficiently, the concept of basis compounds is proposed that enables researchers to easily combine the MCS-based and traditional similarity measures with modern machine learning techniques. Support vector machines (SVMs) are used to test how the MCS-based similarity measure and the basis compound vectorization method perform on two empirically tested datasets. The test results show that MCS complements the well-known atom pair descriptor-based similarity measure. By combining these two measures, our SVM-based model predicts the biological activities of chemical compounds with higher specificity and sensitivity. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
机译:生物活性化合物的预测对于药物发现和化学基因组学中的高通量筛选(HTS)方法非常重要。该领域中的许多计算方法集中于测量化学结构之间的结构相似性。但是,传统的相似性度量通常过于僵化,或者仅考虑结构之间的全局相似性。最大通用子结构(MCS)方法为预测生物活性化合物提供了一种更有希望和灵活的替代方法。结果:在本文中,提出了一种新的MCS回溯算法,并将其与全局相似性度量进行了比较。我们的算法在匹配过程中提供了高度的灵活性,并且在识别局部结构相似性方面非常有效。为了更有效地预测和聚集生物活性化合物,提出了基础化合物的概念,使研究人员可以轻松地将基于MCS的和传统的相似性度量与现代机器学习技术相结合。支持向量机(SVM)用于测试基于MCS的相似性度量和基础复合矢量化方法在两个经过经验测试的数据集上的性能。测试结果表明,MCS补充了众所周知的基于原子对描述符的相似性度量。通过结合这两种措施,我们基于SVM的模型可以预测具有更高特异性和敏感性的化合物的生物活性。补充信息:补充数据可从Bioinformatics在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号