...
首页> 外文期刊>Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological >In Silico Prediction and Screening of c-Secretase Inhibitors by Molecular Descriptors and Machine Learning Methods
【24h】

In Silico Prediction and Screening of c-Secretase Inhibitors by Molecular Descriptors and Machine Learning Methods

机译:在计算机上通过分子描述符和机器学习方法预测和筛选c-分泌酶抑制剂

获取原文
获取原文并翻译 | 示例
           

摘要

c-Secretase inhibitors have been explored for the prevention and treatment of Alzheimer’s disease (AD). Methods for prediction and screening of c-secretase inhibitors are highly desired for facilitating the design of novel therapeutic agents against AD, especially when incomplete knowledge about the mechanism and three-dimensional structure of c-secretase. We explored two machine learning methods, support vector machine (SVM) and random forest (RF), to develop models for predicting c-secretase inhibitors of diverse structures. Quantitative analysis of the receiver operating characteristic (ROC) curve was performed to further examine and optimize the models. Especially, the Youden index (YI) was initially introduced into the ROC curve of RF so as to obtain an optimal threshold of probability for prediction. The developed models were validated by an external testing set with the prediction accuracies of SVM and RF 96.48 and 98.83% for c-secretase inhibitors and 98.18 and 99.27% for noninhibitors, respectively. The different feature selection methods were used to extract the physicochemical features most relevant to c-secretase inhibition. To the best of our knowledge, the RF model developed in this work is the first model with a broad applicability domain, based on which the virtual screening of c-secretase inhibitors against the ZINC database was performed, resulting in 368 potential hit candidates.
机译:已经研究了c-分泌酶抑制剂用于预防和治疗阿尔茨海默氏病(AD)。非常需要用于预测和筛选c-分泌酶抑制剂的方法,以利于设计针对AD的新型治疗剂,尤其是在对c-分泌酶的机理和三维结构不完全了解的情况下。我们探索了两种机器学习方法,即支持向量机(SVM)和随机森林(RF),以开发用于预测各种结构的c-分泌酶抑制剂的模型。对接收器工作特性(ROC)曲线进行了定量分析,以进一步检查和优化模型。特别地,最初将Youden指数(YI)引入RF的ROC曲线中,以获得最佳的预测概率阈值。通过外部测试集对开发的模型进行了验证,其中c-分泌酶抑制剂的预测准确性分别为SVM和RF 96.48和98.83%,非抑制剂的预测准确性分别为98.18和99.27%。使用了不同的特征选择方法来提取与c-分泌酶抑制最相关的理化特征。据我们所知,这项工作中开发的RF模型是第一个具有广泛适用性的模型,在此模型的基础上,针对ZINC数据库进行了c-分泌酶抑制剂的虚拟筛选,产生了368个潜在的候选基因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号