首页> 美国卫生研究院文献>Journal of Cheminformatics >Random forests for feature selection in QSPR Models - an application for predicting standard enthalpy of formation of hydrocarbons
【2h】

Random forests for feature selection in QSPR Models - an application for predicting standard enthalpy of formation of hydrocarbons

机译:QSPR模型中用于特征选择的随机森林-预测碳氢化合物形成标准焓的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

BackgroundOne of the main topics in the development of quantitative structure-property relationship (QSPR) predictive models is the identification of the subset of variables that represent the structure of a molecule and which are predictors for a given property. There are several automated feature selection methods, ranging from backward, forward or stepwise procedures, to further elaborated methodologies such as evolutionary programming. The problem lies in selecting the minimum subset of descriptors that can predict a certain property with a good performance, computationally efficient and in a more robust way, since the presence of irrelevant or redundant features can cause poor generalization capacity. In this paper an alternative selection method, based on Random Forests to determine the variable importance is proposed in the context of QSPR regression problems, with an application to a manually curated dataset for predicting standard enthalpy of formation. The subsequent predictive models are trained with support vector machines introducing the variables sequentially from a ranked list based on the variable importance.
机译:背景技术定量结构-性质关系(QSPR)预测模型的开发中的主要主题之一是识别代表分子结构并且是给定特性的预测变量的子集。有几种自动特征选择方法,范围从向后,向前或逐步过程到进一步完善的方法,例如进化编程。问题在于选择描述符的最小子集,这些描述符的子集可以以良好的性能,高效的计算效率和更健壮的方式预测某个属性,因为不相关或多余的特征的存在会导致较差的泛化能力。本文在QSPR回归问题的背景下,提出了一种基于随机森林来确定变量重要性的替代选择方法,并将其应用于手动编排的数据集以预测标准形成焓。随后的预测模型使用支持向量机进行训练,支持向量机根据变量的重要性从排名列表中依次引入变量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号