首页> 外文期刊>Journal of chemical information and modeling >Ligand Prediction for Orphan Targets Using Support Vector Machines and Various Target-Ligand Kernels Is Dominated by Nearest Neighbor Effects
【24h】

Ligand Prediction for Orphan Targets Using Support Vector Machines and Various Target-Ligand Kernels Is Dominated by Nearest Neighbor Effects

机译:使用支持向量机和各种目标配体核对孤儿目标的配体预测受最近邻效应的支配

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Support vector machine (SVM) calculations combining protein and small molecule information have been applied to identify ligands for simulated orphan targets (i.e., targets for which no ligands were available). The combination of protein and ligand information was facilitated through the design of target-ligand kernel functions that account for pairwise ligand and target similarity. The design and biological information content of such kernel functions was expected to play a major role for target-directed ligand prediction. Therefore, a variety of target-ligand kernels were implemented to capture different types of target information including sequence, secondary structure, tertiary structure, biophysical properties, ontologies, or structural taxonomy. These kernels were tested in ligand predictions for simulated orphan targets in two target protein systems characterized by the presence of different inter-target relationships. Surprisingly, although there were target- and set-specific differences in prediction rates for alternative target-ligand kernels, the performance of these kernels was overall similar and also similar to SVM linear combinations. Test calculations designed to better understand possible reasons for these observations revealed that ligand information provided by nearest neighbors of orphan targets significantly influenced SVM performance, much more so than the inclusion of protein information. As long as ligands of closely related neighbors of orphan targets were available for SVM learning, orphan target ligands could be well predicted, regardless of the type and sophistication of the kernel function that was used. These findings suggest simplified strategies for SVM-based ligand prediction for orphan targets.
机译:已将结合蛋白质和小分子信息的支持向量机(SVM)计算用于识别模拟孤立目标(即没有可用配体的目标)的配体。通过设计成对配体和靶标相似性的靶标-配体核功能来促进蛋白质和配体信息的组合。预期此类核功能的设计和生物学信息内容将在靶标定向配体预测中发挥重要作用。因此,实施了多种目标配体内核以捕获不同类型的目标信息,包括序列,二级结构,三级结构,生物物理特性,本体论或结构分类学。这些核仁在两个目标蛋白系统中以模拟的孤立目标为目标,在配体预测中进行了测试,其特征是存在不同的目标间关系。出人意料的是,尽管替代目标配体核的预测速率存在目标和组特异性差异,但这些核的性能总体上相似,也与SVM线性组合相似。为了更好地理解这些发现的可能原因而进行的测试计算表明,孤儿靶标的最邻近邻居提供的配体信息显着影响了SVM的性能,远不如包含蛋白质信息。只要与孤儿目标密切相关的邻居的配体可用于SVM学习,就可以很好地预测出孤儿目标配体,而与所使用的内核功能的类型和复杂程度无关。这些发现表明了用于孤儿靶标的基于SVM的配体预测的简化策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号