...
首页> 外文期刊>Molecular informatics >Machine Learning Estimation of Atom Condensed Fukui Functions
【24h】

Machine Learning Estimation of Atom Condensed Fukui Functions

机译:原子凝聚的Fukui函数的机器学习估计

获取原文
获取原文并翻译 | 示例
           

摘要

To enable the fast estimation of atom condensed Fukui functions, machine learning algorithms were trained with databases of DFT pre-calculated values for ca. 23,000 atoms in organic molecules. The problem was approached as the ranking of atom types with the Bradley-Terry (BT) model, and as the regression of the Fukui function. Random Forests (RF) were trained to predict the condensed Fukui function, to rank atoms in a molecule, and to classify atoms as high/low Fukui function. Atomic descriptors were based on counts of atom types in spheres around the kernel atom. The BT coefficients assigned to atom types enabled the identification (93-94% accuracy) of the atom with the highest Fukui function in pairs of atoms in the same molecule with differences 0.1. In whole molecules, the atom with the top Fukui function could be recognized in ca. 50% of the cases and, on the average, about 3 of the top 4 atoms could be recognized in a shortlist of 4. Regression RF yielded predictions for test sets with R-2=0.68-0.69, improving the ability of BT coefficients to rank atoms in a molecule. Atom classification (as high/low Fukui function) was obtained with RF with sensitivity of 55-61% and specificity of 94-95%.
机译:为了能够快速估计原子凝聚的Fukui函数,使用DFT预先计算的值数据库对机器学习算法进行了训练。有机分子中有23,000个原子。用Bradley-Terry(BT)模型对原子类型进行排名,以及对Fukui函数进行回归来解决这个问题。随机森林(RF)受过培训,可以预测凝聚的Fukui函数,对分子进行排名,并将原子分类为高/低Fukui函数。原子描述符基于核原子周围球体中原子类型的计数。分配给原子类型的BT系数使得能够在同一分子中以0.1为差的原子对中具有最高Fukui函数的原子进行识别(准确度为93-94%)。在整个分子中,具有最高Fukui功能的原子可以在大约10中识别出来。 50%的情况(平均而言,前4个原子中的大约3个)可以在4个候选列表中被识别。回归RF得出了R-2 = 0.68-0.69的测试集的预测,从而提高了BT系数达到在分子中排列原子。使用RF可获得原子分类(作为高/低Fukui函数),灵敏度为55-61%,特异性为94-95%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号