首页> 外文期刊>Journal of Cheminformatics >ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
【24h】

ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning

机译:吸毒发现的呼叫评估。 20.通过机器学习预测乳腺癌抗性蛋白质的抑制

获取原文
获取外文期刊封面目录资料

摘要

Breast cancer resistance protein (BCRP/ABCG2), an ATP-binding cassette (ABC) efflux transporter, plays a critical role in multi-drug resistance (MDR) to anti-cancer drugs and drug–drug interactions. The prediction of BCRP inhibition can facilitate evaluating potential drug resistance and drug–drug interactions in early stage of drug discovery. Here we reported a structurally diverse dataset consisting of 1098 BCRP inhibitors and 1701 non-inhibitors. Analysis of various physicochemical properties illustrates that BCRP inhibitors are more hydrophobic and aromatic than non-inhibitors. We then developed a series of quantitative structure–activity relationship (QSAR) models to discriminate between?BCRP inhibitors and non-inhibitors. The optimal feature subset was determined by a wrapper feature selection method named rfSA (simulated annealing algorithm coupled with random forest), and the classification models were established by using seven machine learning approaches based on the optimal feature subset, including a deep learning method, two ensemble learning methods, and four classical machine learning methods. The statistical results demonstrated that three methods, including support vector machine (SVM), deep neural networks (DNN) and extreme gradient boosting (XGBoost), outperformed the others, and the SVM classifier yielded the best predictions (MCC?=?0.812 and AUC?=?0.958 for the test set). Then, a perturbation-based model-agnostic method was used to interpret our models and analyze the representative features for different models. The application domain analysis demonstrated the prediction reliability of our models. Moreover, the important structural fragments related to BCRP inhibition were identified by the information gain (IG) method along with the frequency analysis. In conclusion, we believe that the classification models developed in this study can be regarded as simple and accurate tools to distinguish BCRP inhibitors from non-inhibitors in drug design and discovery pipelines.
机译:乳腺癌抗性蛋白(BCRP / ABCG2),ATP结合盒(ABC)流出转运蛋白,在多药物抵抗(MDR)中起重要作用,以抗癌药物和药物 - 药物相互作用。预测BCRP抑制可以促进药物发现早期评估潜在的耐药性和药物 - 药物相互作用。在这里,我们报告了由1098bcrP抑制剂和1701个非抑制剂组成的结构多样化的数据集。各种物理化学性质的分析说明BCRP抑制剂比非抑制剂更疏水和芳族。然后,我们开发了一系列定量结构 - 活动关系(QSAR)模型以区分?BCRP抑制剂和非抑制剂。最佳特征子集由命名RFSA的包装特征选择方法确定(耦合随机林的模拟退火算法),并且通过基于最佳特征子集使用七种机器学习方法来建立分类模型,包括深度学习方法,两个合奏学习方法和四种古典机器学习方法。统计结果表明,三种方法,包括支持向量机(SVM),深神经网络(DNN)和极端梯度升压(XGBoost),表现优于其他方法,并且SVM分类器产生了最佳预测(MCC?=?0.812和AUC ?=?0.958用于测试集)。然后,使用了一种基于扰动的模型 - 不可知方法来解释我们的模型并分析不同模型的代表特征。应用域分析显示了我们模型的预测可靠性。此外,通过信息增益(IG)方法鉴定了与BCRP抑制相关的重要结构片段以及频率分析。总之,我们认为本研究开发的分类模型可被视为简单准确的工具,以区分BCRP抑制剂在药物设计和发现管道中的非抑制剂中。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号