首页> 外文期刊>Monatshefte für Chemie / Chemical Monthly >Application of principal component-genetic algorithm-artificial neural network for prediction acidity constant of various nitrogen-containing compounds in water
【24h】

Application of principal component-genetic algorithm-artificial neural network for prediction acidity constant of various nitrogen-containing compounds in water

机译:主成分遗传算法-人工神经网络在预测水中各种含氮化合物酸度常数中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Abstract Principal component-genetic algorithm-multiparameter linear regression (PC-GA-MLR) and principal component-genetic algorithm-artificial neural network (PC-GA-ANN) models were applied for prediction acidity constant (pK a ) for various nitrogen-containing compounds. A data set that consisted of 282 various compounds, including 55 anilines, 77 amines, 82 pyridines, 14 pyrimidines, 26 imidazoles and benzimidazoles, and 28 quinolines, is used in this work. A large number of theoretical descriptors were calculated for each compound. The first 179 principal components (PCs) were found to explain more than 99.9% of variances in the original data matrix. From the pool of these PCs, the genetic algorithm was employed for selection of the best set of extracted PCs for PC-MLR and PC-ANN models. The models were generated using 15 PCs as variables. For evaluation of the predictive power of the models, pK a values of 56 compounds in the prediction set were calculated. Root mean square errors (RMSE) for PC-GA-MLR and PC-GA-ANN models are 1.4863 and 0.0750. Comparison of the results obtained by the models reveals superiority of the PC-GA-ANN model relative to the PC-GA-MLR model. Mean percent deviation for the PC-GA-ANN model in the prediction set is 2.123. The improvements are due to the fact that pK a of the compounds demonstrates non-linear correlations with the PCs.
机译:摘要应用主成分-遗传算法-多参数线性回归(PC-GA-MLR)模型和主成分-遗传算法-人工神经网络(PC-GA-ANN)模型预测酸度常数(pK a )用于各种含氮化合物。本工作使用了由282种不同化合物组成的数据集,其中包括55种苯胺,77种胺,82种吡啶,14种嘧啶,26种咪唑和苯并咪唑以及28种喹啉。为每种化合物计算了大量的理论描述符。发现前179个主成分(PC)可以解释原始数据矩阵中超过99.9%的方差。从这些PC池中,采用遗传算法为PC-MLR和PC-ANN模型选择最佳的提取PC集。使用15台PC作为变量生成模型。为了评估模型的预测能力,计算了预测集中56种化合物的pK a 值。 PC-GA-MLR和PC-GA-ANN模型的均方根误差(RMSE)为1.4863和0.0750。通过模型获得的结果的比较揭示了PC-GA-ANN模型相对于PC-GA-MLR模型的优越性。预测集中PC-GA-ANN模型的平均偏差百分比为2.123。改进归因于这些化合物的pK a 表现出与PC的非线性相关性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号