...
首页> 外文期刊>The Journal of Chemical Physics >Input vector optimization of feed-forward neural networks for fitting ab initio potential-energy databases
【24h】

Input vector optimization of feed-forward neural networks for fitting ab initio potential-energy databases

机译:前馈神经网络的输入向量优化,从头拟合势能数据库

获取原文
获取原文并翻译 | 示例
           

摘要

The variation in the fitting accuracy of neural networks (NNs) when used to fit databases comprising potential energies obtained from ab initio electronic structure calculations is investigated as a function of the number and nature of the elements employed in the input vector to the NN. Ab initio databases for _2O_2, HONO, Si_5, and H_2 CCHBr were employed in the investigations. These systems were chosen so as to include four-, five-, and six-body systems containing first, second, third, and fourth row elements with a wide variety of chemical bonding and whose conformations cover a wide range of structures that occur under high-energy machining conditions and in chemical reactions involving cis-trans isomerizations, six different types of two-center bond ruptures, and two different three-center dissociation reactions. The ab initio databases for these systems were obtained using density functional theory/B3LYP, MP2, and MP4 methods with extended basis sets. A total of 31 input vectors were investigated. In each case, the elements of the input vector were chosen from interatomic distances, inverse powers of the interatomic distance, three-body angles, and dihedral angles. Both redundant and nonredundant input vectors were investigated. The results show that among all the input vectors investigated, the set employed in the Z-matrix specification of the molecular configurations in the electronic structure calculations gave the lowest NN fitting accuracy for both Si_5 and vinyl bromide. The underlying reason for this result appears to be the discontinuity present in the dihedral angle for planar geometries. The use of trigometric functions of the angles as input elements produced significantly improved fitting accuracy as this choice eliminates the discontinuity. The most accurate fitting was obtained when the elements of the input vector were taken to have the form R_(ij)~(-n), where the Rij are the interatomic distances. When the Levenberg-Marquardt procedure was modified to permit error minimization with respect to n as well as the weights and biases of the NN, the optimum powers were all found to lie in the range of 1.625-2.38 for the four systems studied. No statistically significant increase in fitting accuracy was achieved for vinyl bromide when a different value of n was employed and optimized for each bond type. The rate of change in the fitting error with n is found to be very small when n is near its optimum value. Consequently, good fitting accuracy can be achieved by employing a value of n in the middle of the above range. The use of interparticle distances as elements of the input vector rather than the Z-matrix variables employed in the electronic structure calculations is found to reduce the rms fitting errors by factors of 8.86 and 1.67 for Si_5 and vinyl bromide, respectively. If the interparticle distances are replaced with input elements of the form R_(ij)~(-n) with n optimized, further reductions in the rms error by a factor of 1.31 to 2.83 for the four systems investigated are obtained. A major advantage of using this procedure to increase NN fitting accuracy rather than increasing the number of neurons or the size of the database is that the required increase in computational effort is very small.
机译:神经网络 (NN) 的拟合精度变化,当用于拟合包含从头开始电子结构计算中获得的势能的数据库时,作为 NN 输入向量中使用的元素的数量和性质的函数H_2 Si_5_2O_2进行研究。选择这些系统是为了包括四体、五体和六体系统,其中包含具有多种化学键合的第一排、第二排、第三排和第四排元素,其构象涵盖了在高能加工条件下和涉及顺反异构化的化学反应中发生的各种结构,六种不同类型的双中心键断裂, 和两种不同的三中心解离反应。使用密度泛函理论/B3LYP、MP2 和 MP4 方法获得这些系统的从头开始数据库,并具有扩展的基集。共研究了 31 个输入向量。在每种情况下,输入向量的元素都是从原子间距离、原子间距离的逆幂、三体角和二面角中选择的。研究了冗余和非冗余输入向量。结果表明,在所研究的所有输入向量中,电子结构计算中分子构型的Z矩阵规范中采用的集合对Si_5和乙烯基溴的NN拟合精度最低。造成这一结果的根本原因似乎是平面几何形状的二面角中存在的不连续性。使用角度的三角函数作为输入元素,可以显著提高拟合精度,因为这种选择消除了不连续性。当输入向量的元素具有 R_(ij)~(-n) 的形式时,获得了最精确的拟合,其中 Rij 是原子间距离。当修改 Levenberg-Marquardt 程序以允许对 n 以及 NN 的权重和偏差进行误差最小化时,发现所研究的四个系统的最佳功效都在 1.625-2.38 的范围内。当采用不同的 n 值并针对每种键类型进行优化时,溴乙烯的拟合精度没有统计学上的显着提高。当 n 接近其最佳值时,发现 n 的拟合误差变化率非常小。因此,通过在上述范围的中间使用n值可以实现良好的拟合精度。使用粒子间距离作为输入矢量的元素,而不是电子结构计算中使用的Z矩阵变量,可以将Si_5和溴乙烯的均方根拟合误差分别降低8.86倍和1.67倍。如果将粒子间距离替换为R_(ij)~(-n)形式的输入元素,并优化n,则所研究的四个系统的均方根误差进一步降低了1.31至2.83倍。使用此过程来提高 NN 拟合精度而不是增加神经元数量或数据库大小的一个主要优点是所需的计算工作量非常小。

著录项

  • 来源
    《The Journal of Chemical Physics》 |2010年第20期|204103-1-204103-8|共8页
  • 作者单位

    Mechanical and Aerospace Engineering, Oklahoma State University, 218 Engineering North, Stillwater, OK 74078, United States;

    Department of Chemistry, Oklahoma State University, 107 Physical Science I, Stillwater, OK 74078, United States;

    Electrical and Computer Engineering, Oklahoma State University, 202 Engineering South, Stillwater, OK 74078, United StatesIndustrial and Management Engineering, Oklahoma State University, 322 Engineering North, Stillwater, OK 74078, United States;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 英语
  • 中图分类 物理化学(理论化学)、化学物理学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号