首页> 外文学位 >Computer-assisted development of quantitative structure-property relationships and design of feature selection routines.
【24h】

Computer-assisted development of quantitative structure-property relationships and design of feature selection routines.

机译:计算机辅助的定量结构-属性关系的开发和特征选择例程的设计。

获取原文
获取原文并翻译 | 示例

摘要

Quantitative structure-property relationships (QSPR) can be used to develop models that accurately predict physical and chemical properties for organic compounds. If the assumption is made that structure is the primary influence on certain observed phenomenon, then numerically encoding the structure with descriptors can provide the means for a mathematical link between structure and some measured property. Linear regression and neural networks are the primary methods employed to build the models. The use of these computational tools is presented in this thesis.;Two descriptor (feature) selection routines using the genetic algorithm are developed and presented. The theory of the genetic algorithm and how it is used for both linear and non-linear feature selection is presented. A linear routine couples the genetic algorithm with linear regression. A non-linear routine couples neural networks and the genetic algorithm.;Three models are developed using linear regression and neural networks to predict reduced ion mobility constants K;Models that accurately predict normal boiling points for organic compounds containing have also been developed with regression and neural network methods. The accuracy of prediction of the models developed for three separate sets of organic compounds is similar to experimental errors in all cases. Each method is also compared to a widely used group contribution method for boiling point estimation.;The genetic algorithm non-linear feature selection routine has been applied to a specific set of data to test its usefulness. A high quality non-linear model is developed and presented. The behavior of computational neural networks has also been investigated, and is presented in some detail. It is hoped that a better understanding of computational neural networks can be gained by extracting some of the hidden neuron behavioral patterns.
机译:定量结构-性质关系(QSPR)可用于开发可准确预测有机化合物的物理和化学性质的模型。如果假设结构是对某些观察到的现象的主要影响,那么使用描述符对结构进行数字编码可以为结构与某些已测量属性之间的数学联系提供手段。线性回归和神经网络是建立模型的主要方法。本文介绍了这些计算工具的用法。;开发并提出了两种使用遗传算法的描述符(特征)选择例程。介绍了遗传算法的原理以及如何将其用于线性和非线性特征选择。线性例程将遗传算法与线性回归结合在一起。一个非线性程序将神经网络和遗传算法相结合。;使用线性回归和神经网络开发了三个模型,以预测降低的离子迁移常数K;通过回归分析和回归,还开发了精确预测含有机化合物的正常沸点的模型。神经网络方法。在所有情况下,为三套单独的有机化合物开发的模型的预测准确性与实验误差相似。还将每种方法与用于沸点估计的广泛使用的组贡献方法进行比较。;遗传算法的非线性特征选择例程已应用于特定的数据集,以测试其有效性。开发并提出了高质量的非线性模型。还已经研究了计算神经网络的行为,并对其进行了详细介绍。希望可以通过提取一些隐藏的神经元行为模式来更好地理解计算神经网络。

著录项

  • 作者

    Wessel, Matthew David.;

  • 作者单位

    The Pennsylvania State University.;

  • 授予单位 The Pennsylvania State University.;
  • 学科 Analytical chemistry.;Computer science.
  • 学位 Ph.D.
  • 年度 1997
  • 页码 288 p.
  • 总页数 288
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号