首页> 外文期刊>Computers & Chemistry >AUTOMATIC LEARNING OF CHEMICAL CONCEPTS: RESEARCH OCTANE NUMBER AND MOLECULAR SUBSTRUCTURES
【24h】

AUTOMATIC LEARNING OF CHEMICAL CONCEPTS: RESEARCH OCTANE NUMBER AND MOLECULAR SUBSTRUCTURES

机译:化学概念的自动学习:研究辛烷值和分子亚结构

获取原文
获取原文并翻译 | 示例
       

摘要

A set of 230 hydrocarbons is analyzed with respect to the research octane number (RON) using an extended version of the ID3 machine learning method of Quinlan. The basic ID3 method produces a decision tree. The questions within the decision tree ask whether given substructures are present or absent within a molecule (for example, is the hexane carbon skeleton present as a substructure?). The decision to be made is whether the RON value is within a given range (for example, is the RON greater than or equal to 90). In addition to the normal usage of decision trees to predict the RON value, qualitative information is extracted: for example, which substructures are significant in which RON ranges. The best results were found for those molecules having RON values between 90 and 105. This was due to the high density of samples in this region. The ID3 method is used to prune down an overabundance of parameters to a reasonable size. From the original set of 230 substructures, it was found that only 31 were needed for the RON description. The presence or absence of the substructures n -hexane, methylbenzene and benzene were particularly significant in determining the RON value. Considering that ID3 is not a traditional method of analysis, especially within chemistry, an effort is made to explain its principles using the problem at hand as an example. In addition, its use as the first step in analysis to gain intuitive information is explained.
机译:使用Quinlan的ID3机器学习方法的扩展版本,针对研究辛烷值(RON)分析了230种碳氢化合物。基本的ID3方法生成决策树。决策树中的问题询问分子中是否存在给定的亚结构(例如,是否存在作为子结构的己烷碳骨架?)。要做的决定是RON值是否在给定范围内(例如RON大于或等于90)。除了正常使用决策树预测RON值外,还提取定性信息:例如,哪些子结构在RON范围内很重要。对于RON值在90到105之间的分子,发现了最佳结果。这是由于该区域中样品的密度很高。 ID3方法用于将过多的参数缩减到合理的大小。从最初的230个子结构集中,发现RON描述仅需要31个。在确定RON值时,是否存在亚结构正己烷,甲基苯和苯尤为重要。考虑到ID3并不是传统的分析方法,尤其是在化学方法中,因此我们以当前的问题为例来说明其原理。此外,还解释了将其用作分析以获得直观信息的第一步。

著录项

  • 来源
    《Computers & Chemistry》 |1995年第2期|p.91-99|共9页
  • 作者

    EDWARD S. BLUROCK;

  • 作者单位

    Research Institute for Symbolic Computation, Johannes Kepler University, A-4020 Linz, Austria;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算机的应用;
  • 关键词

  • 入库时间 2022-08-18 01:07:41

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号