...
首页> 外文期刊>ChemMedChem >In Silico Prediction of Blood-Brain Barrier Permeability of Compounds by Machine Learning and Resampling Methods
【24h】

In Silico Prediction of Blood-Brain Barrier Permeability of Compounds by Machine Learning and Resampling Methods

机译:通过机器学习和重采样方法在血脑屏障渗透性的硅屏障渗透性中

获取原文
获取原文并翻译 | 示例
           

摘要

The blood-brain barrier (BBB) as apart of absorption protects the central nervoussystem by separating the brain tissue from the bloodstream. In recent years,BBB permeability has become acritical issue in chemical ADMET prediction, but almostall modelswere built using imbalanced data sets, which caused ahighfalse-positiverate. Therefore, we tried to solve the problem of biased data sets and built areliable clas- sification model with 2358 compounds. Machine learning and resampling methods were used simultaneously for the refine- ment of models with both 2D molecular descriptors and mo- lecular fingerprints to represent the chemicals. Through a series of evaluation,werealized that resampling methods such as SyntheticMinority Oversampling Technique(SMOTE) and SMOTE+edited nearest neighbor could effectively solve the problemofimbalanced data sets and that MACCS fingerprint combined with support vector machine performed the best. After the final construction of aconsensus model,the overall accuracy rate was increased to 0.966 for the final external data set. Also, the accuracy rate of the model for the test set was 0.919, with an excellent balancedcapacity of 0.925 (sensitivity) to predict BBB-positive compounds and of 0.899 (specificity) to predictBBB-negative compounds. Compared with other BBB classification models, our modelsreduced the rate of false pos- itives and were more robust in prediction of BBB-positive as well as BBB-negative compounds, which would be quite help- ful in early drug discovery.
机译:血脑屏障(BBB)作为吸收除了,通过将脑组织与血液中的脑组织分离来保护中枢神经系统。近年来,BBB渗透率已成为化学备用拍摄预测中的协助问题,但差异模型采用不平衡数据集建造,这导致了Ahighfalse-positiveate。因此,我们试图解决偏置数据集的问题,并用2358个化合物建立了可造成的可接受条款模型。同时使用机器学习和重采样方法,用于用2D分子描述符和Mo-untular指纹的型号进行精炼,以代表化学品。通过一系列评估,防止重采样方法,例如合成材料过采样技术(Smote)和Smote +编辑的最近邻权可以有效地解决问题的数据集,并且MACCS指纹与支持向量机器相结合。在ACONSENS模型的最终构建后,最终外部数据集的总精度率增加到0.966。而且,测试套装的模型的精度率为0.919,具有0.925(敏感性)的优异平衡程度,以预测BBB阳性化合物和0.899(特异性)至预测BBB阴性化合物。与其他BBB分类模型相比,我们模特造型的虚假位置的速度,并且在预测BBB阳性以及BBB阴性化合物方面更加稳健,这在早期药物发现中会非常有帮助。

著录项

  • 来源
    《ChemMedChem》 |2018年第20期|共13页
  • 作者单位

    Shanghai Key Laboratory of New Drug Design School of Pharmacy East China University of Science and Technology Shanghai 200237 (China);

    Shanghai Key Laboratory of New Drug Design School of Pharmacy East China University of Science and Technology Shanghai 200237 (China);

    Shanghai Key Laboratory of New Drug Design School of Pharmacy East China University of Science and Technology Shanghai 200237 (China);

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 药学;
  • 关键词

    blood-brain barrier; imbalanced data; machine learning; QSAR models; resampling methods;

    机译:血脑屏障;数据不平衡;机器学习;QSAR模型;重采样方法;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号