首页> 外文学位 >Kriging model approach to modeling study on relationship between molecular quantitative structures and chemical properties.
【24h】

Kriging model approach to modeling study on relationship between molecular quantitative structures and chemical properties.

机译:用克氏模型法对分子定量结构与化学性质之间的关系进行建模研究。

获取原文
获取原文并翻译 | 示例

摘要

The molecular descriptors include various topological indices, quantum chemical descriptors, physicochemical parameters and so on. They all give structure descriptions of chemical compounds. Chemometrics, especially, quantitative structure activity relationship (QSAR) and quantitative structure-property relationship (QSPR) attempt to correlate physical, chemical and biological activities or properties with structural descriptors of compounds and find a suitable model, called metamodel, to establish relationships between molecule descriptors and activities or properties. The results are useful in theoretical and computational chemistry, biochemistry, pharmacology and environment research.; Techniques in multivariate analysis and data mining, such as ordinary least squares regression, principal components regression, partial least squares regression, multivariate adaptive regression splines and multivariate additive regression tree, are useful tools for modeling. Metamodels generated by these methods, basically are linear models with independently identical distributed (i.i.d.) random errors. However, the assumption of independent and identical distributed errors in general metamodel is not always true. For instance, many examples show that there can still be unacceptably large residuals compared to measurement errors in many models of QSAR/QSPR research. The reason for this may be diverse. The simplest and the most natural reflection on our mind is that the unaccepted residuals could be dependent. These dependent errors will present more information than independent situation. For instance, we might use a stationary Gaussian process {lcub}z(x i), i = 1. 2, ···, n{rcub} instead of independent random variables epsiloni's. In fact, the general Kriging approach just consists of parametric item and a stochastic process. In this thesis, we compared the Kriging models with other metamodels. Experiments showed that the proposed Kriging approach could improve the regression models used widely in Chemometrics.; It is known that Kriging is an interpolating predictor which would be very beneficial for the fitting of the training data, but is not very so good for the predictions of the testing data, when the data were collected with random noise epsilon(x). So if we add a disturbing input epsilon(x) in the original Kriging, the new Kriging model called empirical Kriging in some literature will provide more accurate prediction for the noisy data than the Kriging model. Many authors have paid attention to the merit of noninterpolating Kriging model. One of purposes of this thesis is to apply the empirical Kriging model to quantitative structure-activity relationship (QSAR) research. We demonstrate in the case study that the empirical Kriging model can significantly improve the prediction accuracy of other metamodels, including the Kriging models.
机译:分子描述符包括各种拓扑指数,量子化学描述符,理化参数等。它们都给出了化学化合物的结构描述。化学计量学,尤其是定量结构活性关系(QSAR)和定量结构性质关系(QSPR)试图将物理,化学和生物学活性或特性与化合物的结构描述符相关联,并找到合适的模型(称为元模型)来建立分子之间的关系描述符和活动或属性。该结果可用于理论和计算化学,生物化学,药理学和环境研究。多元分析和数据挖掘中的技术,例如普通最小二乘回归,主成分回归,偏最小二乘回归,多元自适应回归样条和多元加性回归树,是建模的有用工具。通过这些方法生成的元模型基本上是具有独立相同的分布(i.i.d.)随机误差的线性模型。但是,一般元模型中独立且相同的分布错误的假设并不总是正确的。例如,许多示例表明,在许多QSAR / QSPR研究模型中,与测量误差相比,仍然会有不可接受的大残差。其原因可能是多种多样的。对我们而言,最简单,最自然的反映是,不可接受的残差可能是依赖的。与独立情况相比,这些相关错误将提供更多信息。例如,我们可以使用平稳的高斯过程{lcub} z(x i),i =1。2,···,n {rcub},而不是独立的随机变量epsiloni。实际上,一般的克里金法只是由参数项和随机过程组成。在本文中,我们将Kriging模型与其他元模型进行了比较。实验表明,提出的Kriging方法可以改善在化学计量学中广泛使用的回归模型。众所周知,克里格插值法是一种插值预测器,当以随机噪声epsilon(x)收集数据时,这对训练数据的拟合将非常有益,但对测试数据的预测却不是那么好。因此,如果我们在原始Kriging中添加干扰输入epsilon(x),则某些文献中称为经验Kriging的新Kriging模型将比Kriging模型提供更准确的噪声数据预测。许多作者已经关注了非插值Kriging模型的优点。本文的目的之一是将经验克里格模型应用于定量构效关系(QSAR)研究。我们在案例研究中证明,经验克里格模型可以显着提高其他元模型(包括克里格模型)的预测准确性。

著录项

  • 作者

    Yin, Hong.;

  • 作者单位

    Hong Kong Baptist University (People's Republic of China).;

  • 授予单位 Hong Kong Baptist University (People's Republic of China).;
  • 学科 Statistics.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 105 p.
  • 总页数 105
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 统计学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号