...
首页> 外文期刊>Applied Soft Computing >Neural-based approaches to overcome feature selection and applicability domain in drug-related property prediction
【24h】

Neural-based approaches to overcome feature selection and applicability domain in drug-related property prediction

机译:克服药物相关性能预测中的神经基方法克服特征选择和适用域

获取原文
获取原文并翻译 | 示例

摘要

In the fields of pharmaceutical research and biomedical sciences, QSAR modeling is an established approach during drug discovery for prediction of biological activity of drug candidates. Yet, QSAR modeling poses a series of open challenges. First, chemical compounds are represented on a high-dimensional space and thus feature selection is typically applied, although this task entails a challenging combinatorial problem with potential loss of information. Second, the definition of the applicability domain of a QSAR model is a desirable aspect to determine the reliability of predictions on unseen chemicals, which is often difficult to assess due to the extent of the chemical space. Finally, interpretability of these models is also a critical issue for drug designers. The purpose of this work is to thoroughly assess the application of neural-based methods and recent advances deep learning for QSAR modeling. We hypothesize that neural-based methods can overcome the need to perform a descriptor selection phase. We developed three QSAR models based on neural networks for prediction of relevant chemical and biomedical properties that, in the absence of any feature selection step, can outperform the state-of-the-art models for such properties. We also implemented an embedded applicability domain technique based on network output probabilities that proved to be effective; its application improved the predictive performance of the model. Finally, we proposed the use of a post hoc feature analysis technique based on an aggregation of network weights, which enabled effective detection of relevant features in the model. (C) 2019 Elsevier B.V. All rights reserved.
机译:在制药研究和生物医学科学领域,QSAR建模是在药物发现中的既定方法,以预测毒品候选人的生物活性。然而,QSAR建模构成了一系列开放的挑战。首先,化学化合物在高维空间上表示,因此通常施加特征选择,尽管该任务需要具有潜在信息损失的挑战组合问题。其次,QSAR模型的适用性域的定义是一种理想的方面,以确定看不见的化学物质的预测可靠性,这通常难以评估,由于化学空间的程度。最后,这些模型的可解释性也是药物设计师的重要问题。这项工作的目的是彻底评估神经基方法的应用,最近的深入学习对QSAR建模的影响。我们假设神经基方法可以克服执行描述符选择阶段的需要。我们开发了基于神经网络的三个QSAR模型,以预测相关化学和生物医学特性,即在没有任何特征选择步骤的情况下,可以优于这种性质的最先进的模型。我们还基于被证明是有效的网络输出概率的嵌入式适用性域技术;其应用提高了模型的预测性能。最后,我们建议使用基于网络权重的聚合的HOC特征分析技术,这使得能够有效地检测模型中的相关功能。 (c)2019年Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号