首页> 外文期刊>Scientific reports. >Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method
【24h】

Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method

机译:数据驱动模型的诊断:PM7半经验量子化学方法的不确定度定量

获取原文
       

摘要

We report an evaluation of a semi-empirical quantum chemical method PM7 from the perspective of uncertainty quantification. Specifically, we apply Bound-to-Bound Data Collaboration, an uncertainty quantification framework, to characterize (a) variability of PM7 model parameter values consistent with the uncertainty in the training data and (b) uncertainty propagation from the training data to the model predictions. Experimental heats of formation of a homologous series of linear alkanes are used as the property of interest. The training data are chemically accurate , i.e., they have very low uncertainty by the standards of computational chemistry. The analysis does not find evidence of PM7 consistency with the entire data set considered as no single set of parameter values is found that captures the experimental uncertainties of all training data. A set of parameter values for PM7 was able to capture the training data within ±1?kcal/mol, but not to the smaller level of uncertainty in the reported data. Nevertheless, PM7 was found to be consistent for subsets of the training data. In such cases, uncertainty propagation from the chemically accurate training data to the predicted values preserves error within bounds of chemical accuracy if predictions are made for the molecules of comparable size. Otherwise, the error grows linearly with the relative size of the molecules.
机译:我们从不确定性量化的角度报告了对半经验量子化学方法PM7的评估。具体来说,我们应用不确定性量化框架Bound-to-Bound Data Collaboration,来表征(a)与训练数据中的不确定性一致的PM7模型参数值的可变性,以及(b)从训练数据到模型预测的不确定性传播。使用形成一系列直链烷烃的同源序列的实验热作为感兴趣的性质。训练数据在化学上是准确的,即,根据计算化学的标准,它们具有非常低的不确定性。该分析没有发现PM7与整个数据集一致的证据,因为未找到捕获所有训练数据的实验不确定性的单个参数值集。 PM7的一组参数值能够捕获±1?kcal / mol以内的训练数据,但在所报告的数据中不确定性较小。然而,发现PM7对于训练数据的子集是一致的。在这种情况下,如果对可比较大小的分子进行预测,则从化学准确的训练数据到预测值的不确定性传播会将误差保留在化学准确性的范围内。否则,误差会随着分子的相对大小线性增长。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号