首页> 外文期刊>Journal of molecular graphics & modelling >A support vector machines approach for virtual screening of active compounds of single and multiple mechanisms from large libraries at an improved hit-rate and enrichment factor
【24h】

A support vector machines approach for virtual screening of active compounds of single and multiple mechanisms from large libraries at an improved hit-rate and enrichment factor

机译:一种支持向量机方法,可从大型文库中以提高的命中率和富集因子虚拟筛选单一和多种机理的活性化合物

获取原文
获取原文并翻译 | 示例
           

摘要

Support vector machines (SVM) and other machine-learning (ML) methods have been explored as ligand-based virtual screening (VS) tools for facilitating lead discovery. While exhibiting good hit selection performance, in screening large compound libraries, these methods tend to produce lower hit-rate than those of the best performing VS tools, partly because their training-sets contain limited spectrum of inactive compounds. We tested whether the performance of SVM can be improved by using training-sets of diverse inactive compounds. In retrospective database screening of active compounds of single mechanism (HIV protease inhibitors, DHFR inhibitors, dopamine antagonists) and multiple mechanisms (CNS active agents) from large libraries of 2.986 million compounds, the yields, hit-rates, and enrichment factors of our SVM models are 52.4-78.0%, 4.7-73.8%, and 214-10,543, respectively, compared to those of 62-95%, 0.65-35%, and 20-1200 by structure-based VS and 55-81%, 0.2-0.7%, and 110-795 by other ligand-based VS tools in screening libraries of >= 1 million compounds. The hit-rates are comparable and the enrichment factors are substantially better than the best results of other VS tools. 24.3-87.6% of the predicted hits are outside the known hit families. SVM appears to be potentially useful for facilitating lead discovery in VS of large compound libraries. (C) 2007 Elsevier Inc. All rights reserved.
机译:支持向量机(SVM)和其他机器学习(ML)方法已经作为基于配体的虚拟筛选(VS)工具进行了探索,以促进潜在客户的发现。尽管表现出良好的命中选择性能,但在筛选大型化合物库时,这些方法的命中率往往低于性能最佳的VS工具,这部分是因为它们的训练集包含有限范围的非活性化合物。我们测试了通过使用各种非活性化合物的训练集是否可以改善SVM的性能。从298.6万个化合物的大型文库中回顾性数据库筛选单一机理的活性化合物(HIV蛋白酶抑制剂,DHFR抑制剂,多巴胺拮抗剂)和多种机理(CNS活性剂),SVM的收率,命中率和富集因子模型分别为52.4-78.0%,4.7-73.8%和214-10,543,而基于结构的VS分别为62-95%,0.65-35%和20-1200,而55-81%,0.2- 0.7%,其他基于配体的VS工具通过110-795筛选> = 100万种化合物。命中率是可比的,并且富集因子大大优于其他VS工具的最佳结果。 24.3-87.6%的预测命中率不在已知命中家族之内。 SVM对于促进大型化合物库的VS中的潜在顾客发现似乎很有用。 (C)2007 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号