首页> 外文学位 >Classification methods and applications to mass spectral data.
【24h】

Classification methods and applications to mass spectral data.

机译:质谱数据的分类方法和应用。

获取原文
获取原文并翻译 | 示例

摘要

An important goal of data mining in chemistry is to try to extract useful information from databases, and then classify and recognize the compounds or medicines by their related molecular structure, topological index or chemical fingerprints. With the growth of chemical measurement and modern information technology, more and more huge databases containing a large amount of chemical compounds information are established, such as spectral databases, chromatographic databases, or databases on molecular structures and their substance properties. How to discover knowledge hidden in huge collections is a big challenge. Our work is mainly on the research of methodology and application of classification methods in huge data sets. In general, the classification methods which are introduced and proposed in this thesis can be applied to various classification problems. Here, we focus on the classification methods and applications in analysis of mass spectra. Mass spectrometry, an instrumental technique which is used to character and identify chemical compounds, produces large amounts of valuable data for chemical structure elucidation. Identification of compounds or automatic recognition of structural properties from mass spectra (MS) data is an important work in chemometrics. In this thesis, we first introduce different of classification methods based on classical multivariate data analysis, artificial intelligence or modern data mining techniques. These methods have been applied successfully to some extent in the automatic recognition of substructures or other structural properties form MS data. However, there are still many substructures which can not be recognized efficiently by existing classifiers. So seeking better techniques for mass spectral pattern recognition has being a mission in chemometrics.; In this thesis, I propose a new approach combining classification tree (CT) with sliced inverse regression (SIR) and apply it to the classification of mass spectra.
机译:化学数据挖掘的一个重要目标是尝试从数据库中提取有用的信息,然后根据其相关的分子结构,拓扑指数或化学指纹对化合物或药物进行分类和识别。随着化学测量和现代信息技术的发展,建立了越来越多的包含大量化合物信息的巨大数据库,例如光谱数据库,色谱数据库或分子结构及其物质特性数据库。如何发现藏在巨大藏书中的知识是一个巨大的挑战。我们的工作主要是在海量数据集中进行方法学研究和分类方法的应用。总体而言,本文介绍和提出的分类方法可以应用于各种分类问题。在这里,我们重点介绍质谱分析中的分类方法和应用。质谱法是一种用于表征和鉴定化合物的仪器技术,可产生大量有价值的数据以阐明化学结构。从质谱(MS)数据识别化合物或自动识别结构特性是化学计量学中的重要工作。本文首先介绍基于经典多元数据分析,人工智能或现代数据挖掘技术的不同分类方法。这些方法已成功地应用于自动识别MS数据中的子结构或其他结构特性。但是,仍有许多子结构无法被现有分类器有效地识别。因此,寻求更好的质谱模式识别技术已成为化学计量学的任务。本文提出了一种将分类树(CT)和切片逆回归(SIR)相结合的新方法,并将其应用于质谱的分类。

著录项

  • 作者

    He, Ping.;

  • 作者单位

    Hong Kong Baptist University (People's Republic of China).;

  • 授予单位 Hong Kong Baptist University (People's Republic of China).;
  • 学科 Mathematics.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 127 p.
  • 总页数 127
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 数学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号