首页> 美国卫生研究院文献>Scientific Reports >Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
【2h】

Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery

机译:QSAR建模中用于药物发现的混合特征选择和特征学习方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information.
机译:使用机器学习技术的定量结构-活性关系建模构成了一个复杂的计算问题,在此过程中,识别最有用的分子描述符以预测特定的目标特性起着至关重要的作用。两种主要的通用方法可用于此建模过程:特征选择和特征学习。本文对与这两种方法相关的两种最先进的方法进行了性能比较研究。特别是,使用两种方法在不同的实验场景下推断出三个不同问题的回归和分类模型:两种药物样特性,例如血脑屏障和人体肠道吸收,以及对映体过量,作为用于测量纯度的方法手性物质。除了将特征选择和特征学习方法作为竞争性方法进行对比分析之外,还基于材料科学领域的先前结果对这些策略的混合性进行了评估。从实验结果可以得出结论,两种方法之间没有明显的赢家,因为性能取决于用于建模的化合物数据库的特性。然而,在某些情况下,观察到当特征选择和特征学习提供的分子描述符集包含互补信息时,通过结合两种方法可以提高模型的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号