首页> 美国卫生研究院文献>Journal of Cheminformatics >SimBoost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines
【2h】

SimBoost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines

机译:SimBoost:一种使用梯度增强机预测药物-靶标结合亲和力的跨方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Computational prediction of the interaction between drugs and targets is a standing challenge in the field of drug discovery. A number of rather accurate predictions were reported for various binary drug–target benchmark datasets. However, a notable drawback of a binary representation of interaction data is that missing endpoints for non-interacting drug–target pairs are not differentiated from inactive cases, and that predicted levels of activity depend on pre-defined binarization thresholds. In this paper, we present a method called SimBoost that predicts continuous (non-binary) values of binding affinities of compounds and proteins and thus incorporates the whole interaction spectrum from true negative to true positive interactions. Additionally, we propose a version of the method called SimBoostQuant which computes a prediction interval in order to assess the confidence of the predicted affinity, thus defining the Applicability Domain metrics explicitly. We evaluate SimBoost and SimBoostQuant on two established drug–target interaction benchmark datasets and one new dataset that we propose to use as a benchmark for read-across cheminformatics applications. We demonstrate that our methods outperform the previously reported models across the studied datasets.
机译:在药物发现领域,药物与靶标之间相互作用的计算预测是一个长期的挑战。对于各种二元药物靶标基准数据集,报告了许多相当准确的预测。但是,交互数据的二进制表示形式的显着缺点是,非交互药物-靶对的缺失终点无法与非活性病例区分开,并且预测的活性水平取决于预定义的二值化阈值。在本文中,我们提出了一种称为SimBoost的方法,该方法可预测化合物和蛋白质的结合亲和力的连续(非二进制)值,从而将整个相互作用范围从真正的负向相互作用转变为真正的正向相互作用。另外,我们提出了一种称为SimBoostQuant的方法版本,该方法计算预测间隔以评估预测的亲和力的置信度,从而明确定义“适用性域”度量。我们在两个已建立的药物-靶标相互作用基准数据集和一个我们建议用作跨化学化学信息学应用基准的新数据集中评估SimBoost和SimBoostQuant。我们证明了我们的方法在研究的数据集上优于先前报道的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号