首页> 外文期刊>Analytical methods >Application of a random forests (RF) method as a new approach for variable selection and modelling in a QSRR study to predict the relative retention time of some polybrominated diphenylethers (PBDEs)
【24h】

Application of a random forests (RF) method as a new approach for variable selection and modelling in a QSRR study to predict the relative retention time of some polybrominated diphenylethers (PBDEs)

机译:随机森林(RF)方法在QSRR研究中作为变量选择和建模的新方法的应用,以预测某些多溴二苯醚(PBDEs)的相对保留时间

获取原文
           

摘要

In this work, the method of random forests (RF) was applied for modeling and prediction of the relative retention time of some polybrominated diphenylethers (PBDEs) with descriptors calculated from the molecular structure alone. The effects of tuning parameters such as the number of trees (nt) and the number of randomly selected variables to split each node (m) were investigated. The obtained results showed that the pair (m = 38, nt = 500) can be considered as a plausible setting so that the generalization error was minimal. Also, the importance level of different descriptors was evaluated using RF to simplify the model. The performance of the RF model was compared with the artificial neural network (ANN). Both ANN and RF methods provided accurate predictions, although more accurate results were obtained by the RF model. The determination coefficients of the test set, obtained by the ANN and RF methods, are 0.9619 and 0.9707 respectively.
机译:在这项工作中,随机森林(RF)方法被用于建模和预测某些多溴代二苯醚(PBDEs)的相对保留时间,其描述符仅由分子结构计算得出。研究了调整参数(例如树数(nt)和随机选择的变量数,以分割每个节点(m))的效果。获得的结果表明,该对(m = 38,nt = 500)可以被视为一个合理的设置,从而使泛化误差最小。另外,使用RF评估了不同描述符的重要性级别以简化模型。将RF模型的性能与人工神经网络(ANN)进行了比较。 ANN和RF方法都提供了准确的预测,尽管RF模型获得了更准确的结果。通过ANN和RF方法获得的测试集的确定系数分别为0.9619和0.9707。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号