A maximum common substructure-based algorithm for searching and predicting drug-like compounds.

Cao Y; Jiang T; Girke T

首页> 外文期刊>Bioinformatics >A maximum common substructure-based algorithm for searching and predicting drug-like compounds.

【24h】

A maximum common substructure-based algorithm for searching and predicting drug-like compounds.

机译：基于最大通用子结构的算法，用于搜索和预测类药物化合物。

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The prediction of biologically active compounds is of great importance for high-throughput screening (HTS) approaches in drug discovery and chemical genomics. Many computational methods in this area focus on measuring the structural similarities between chemical structures. However, traditional similarity measures are often too rigid or consider only global similarities between structures. The maximum common substructure (MCS) approach provides a more promising and flexible alternative for predicting bioactive compounds. RESULTS: In this article, a new backtracking algorithm for MCS is proposed and compared to global similarity measurements. Our algorithm provides high flexibility in the matching process, and it is very efficient in identifying local structural similarities. To predict and cluster biologically active compounds more efficiently, the concept of basis compounds is proposed that enables researchers to easily combine the MCS-based and traditional similarity measures with modern machine learning techniques. Support vector machines (SVMs) are used to test how the MCS-based similarity measure and the basis compound vectorization method perform on two empirically tested datasets. The test results show that MCS complements the well-known atom pair descriptor-based similarity measure. By combining these two measures, our SVM-based model predicts the biological activities of chemical compounds with higher specificity and sensitivity. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

机译：生物活性化合物的预测对于药物发现和化学基因组学中的高通量筛选（HTS）方法非常重要。该领域中的许多计算方法集中于测量化学结构之间的结构相似性。但是，传统的相似性度量通常过于僵化，或者仅考虑结构之间的全局相似性。最大通用子结构（MCS）方法为预测生物活性化合物提供了一种更有希望和灵活的替代方法。结果：在本文中，提出了一种新的MCS回溯算法，并将其与全局相似性度量进行了比较。我们的算法在匹配过程中提供了高度的灵活性，并且在识别局部结构相似性方面非常有效。为了更有效地预测和聚集生物活性化合物，提出了基础化合物的概念，使研究人员可以轻松地将基于MCS的和传统的相似性度量与现代机器学习技术相结合。支持向量机（SVM）用于测试基于MCS的相似性度量和基础复合矢量化方法在两个经过经验测试的数据集上的性能。测试结果表明，MCS补充了众所周知的基于原子对描述符的相似性度量。通过结合这两种措施，我们基于SVM的模型可以预测具有更高特异性和敏感性的化合物的生物活性。补充信息：补充数据可从Bioinformatics在线获得。

著录项

来源
《Bioinformatics 》 |2008年第13期| 共9页
作者
Cao Y; Jiang T; Girke T;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学 ; 生物工程学（生物技术） ;
关键词

相似文献

外文文献
中文文献
专利

1. A maximum common substructure-based algorithm for searching and predicting drug-like compounds. [J] . Cao Y, Jiang T, Girke T Bioinformatics . 2008 ,第13期

机译：基于最大通用子结构的算法，用于搜索和预测类药物化合物。
2. Maximum Common Substructure-Based Data Fusion in Similarity Searching [J] . Duesbury Edmund, Holliday John, Willett Peter Journal of chemical information and modeling . 2015 ,第2期

机译：相似度搜索中基于最大公共子结构的数据融合
3. The optimization of running time for a maximum common substructure-based algorithm and its application in drug design [J] . Jian Chen, Jia Sheng, Dijing Lv, Computational biology and chemistry . 2014 ,第FEBa期

机译：基于最大通用子结构的算法的运行时间优化及其在药物设计中的应用
4. Predicting Structural and Functional Sites in Proteins by Searching for Maximum-Weight Cliques [C] . Franco Mascia, Elisa Cilia, Mauro Brunato, Innovative applications of artificial intelligence conference;AAAI conference on artificial intelligence;IAAI-10;Symposium on educational advances in artificial intelligence;AAAI-10;EAAI-10 . 2011

机译：通过搜索最大重量团预测蛋白质的结构和功能位点
5. Novel structure similarity-based methods for identifying drug-like compounds. [D] . Cao, Yi Qun. 2010

机译：基于新型结构相似性的方法，用于识别类药物化合物。
6. A maximum common substructure-based algorithm for searching and predicting drug-like compounds [O] . Yiqun Cao, Tao Jiang, Thomas Girke -1

机译：基于最大通用子结构的算法用于搜索和预测类药物化合物
7. A maximum common substructure-based algorithm for searching and predicting drug-like compounds [O] . Cao, Yiqun, Jiang, Tao, Girke, Thomas 2008

机译：基于最大通用子结构的算法，用于搜索和预测类药物化合物
8. Two Papers on Range Searching: A Survey of Algorithms and Data Structures for Range Searching. Efficient Worst-Case Data Structures for Range Searching. [R] . bentley,jon louis friedman,jerome h. 1978

机译：关于范围搜索的两篇论文：范围搜索的算法和数据结构综述。用于范围搜索的高效最坏情况数据结构。

A maximum common substructure-based algorithm for searching and predicting drug-like compounds.

摘要

著录项

相似文献

相关主题

期刊订阅