...
首页> 外文期刊>Journal of Cheminformatics >QSAR-derived affinity fingerprints (part 1): fingerprint construction and modeling performance for similarity searching, bioactivity classification and scaffold hopping
【24h】

QSAR-derived affinity fingerprints (part 1): fingerprint construction and modeling performance for similarity searching, bioactivity classification and scaffold hopping

机译:QSAR衍生的亲和指纹(第1部分):相似性搜索,生物活性分类和脚手架跳跃的指纹施工和建模性能

获取原文
           

摘要

An affinity fingerprint is the vector consisting of compound’s affinity or potency against the reference panel of protein targets. Here, we present the QAFFP fingerprint, 440 elements long in silico QSAR-based affinity fingerprint, components of which are predicted by Random Forest regression models trained on bioactivity data from the ChEMBL database. Both real-valued (rv-QAFFP) and binary (b-QAFFP) versions of the QAFFP fingerprint were implemented and their performance in similarity searching, biological activity classification and scaffold hopping was assessed and compared to that of the 1024 bits long Morgan2 fingerprint (the RDKit implementation of the ECFP4 fingerprint). In both similarity searching and biological activity classification, the QAFFP fingerprint yields retrieval rates, measured by AUC (~?0.65 and ~?0.70 for similarity searching depending on data sets, and ~?0.85 for classification) and EF5 (~?4.67 and ~?5.82 for similarity searching depending on data sets, and ~?2.10 for classification), comparable to that of the Morgan2 fingerprint (similarity searching AUC of ~?0.57 and ~?0.66, and EF5 of ~?4.09 and ~?6.41, depending on data sets, classification AUC of ~?0.87, and EF5 of ~?2.16). However, the QAFFP fingerprint outperforms the Morgan2 fingerprint in scaffold hopping as it is able to retrieve 1146 out of existing 1749 scaffolds, while the Morgan2 fingerprint reveals only 864 scaffolds.
机译:亲和指纹是由化合物的亲和力或效力与蛋白质靶标的参考板组成的载体。在这里,我们介绍了基于Silico QSAR的亲和指纹的QAFFP指纹,440元元素,其组成部分被从ChemBL数据库从生物活性数据训练的随机森林回归模型预测。 QAFFPP指纹的实值(RV-QAFFP)和二进制(B-QAFFP)版本得到实施,并评估其在相似性搜索中的性能,并评估生物活动分类和脚手架跳跃,并与1024位LONG MORGAN2指纹( ECFP4指纹的RDKIT实现)。在相似性搜索和生物活动分类中,QAFFP指纹会产生由AUC测量的检索率(〜?0.65和〜0.70,根据数据集进行相似性搜索,以及分类的0.85)和EF5(〜?4.67和〜 ?5.82根据数据集的相似性搜索,〜?2.10进行分类),与Morgan2指纹的〜〜2.10相当(相似性搜索AUC的AUC〜0.57和〜0.66,以及〜2.09和〜?6.41,取决于在数据集中,分类AUC的〜?0.87和EF5的〜?2.16)。然而,QAFFP指纹占据了脚手架跳跃中的Morgan2指纹,因为它能够从现有的1749个支架中取出1146个,而Morgan2指纹仅显示864个脚手架。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号