...
首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Self Organising Maps for variable selection: Application to human saliva analysed by nuclear magnetic resonance spectroscopy to investigate the effect of an oral healthcare product
【24h】

Self Organising Maps for variable selection: Application to human saliva analysed by nuclear magnetic resonance spectroscopy to investigate the effect of an oral healthcare product

机译:用于变量选择的自组织图:通过核磁共振波谱分析在人体唾液中的应用,以研究口腔保健产品的效果

获取原文
获取原文并翻译 | 示例
           

摘要

SOMs (Self Organising Maps) are derived from the machine learning literature and serve as a valuable method for representing data. In this paper, the use of SOMs as a technique for determining the most significant variables (or markers) in a dataset is described. The method is applied to the NMR spectra of 96 human saliva samples, half of which have been treated with an oral rinse formulation and half of which are controls, and 49 variables consisting of bucketed intensities. In addition, three simulations, two of which consist of the same number of samples and variables as the experimental dataset and a third that contains a much larger number of variables, are described. Two of the simulations contain known discriminatory variables, and the remaining is treated as a null dataset without any specific discriminatory variables added. The described SOM method is contrasted to Partial Least Squares Discriminant Analysis, and a list of the markers determined to be most significant using both approaches was obtained and the differences arising are discussed. A SOM Discrimination Index (SOMDI) is defined, whose magnitude relates to how strongly a variable is considered to be a discriminator. In order to ensure that the model is stable and not dependent on the random starting point of the SOM, one hundred iterations were performed and variables that were consistently of high rank were selected. A variety of approaches for data representation are illustrated, and the main theoretical principles of employing SOMs for determining which variables are most significant are outlined. Software used in this paper was written in-house, allowing greater flexibility over existing packages, and tailored for the specific application in hand.
机译:SOM(自组织映射)是从机器学习文献中获得的,并且是表示数据的一种有价值的方法。在本文中,描述了使用SOM作为确定数据集中最重要变量(或标记)的技术。该方法适用于96个人类唾液样品的NMR光谱,其中一半已用漱口水配方处理,另一半是对照,以及49个由桶装强度组成的变量。此外,还描述了三种模拟,其中两种模拟包含与实验数据集相同数量的样本和变量,而第三种包含大量变量。其中两个模拟包含已知的区分变量,其余模拟被视为未添加任何特定区分变量的空数据集。所描述的SOM方法与偏最小二乘判别分析进行了对比,并获得了使用两种方法确定为最重要的标记的列表,并讨论了所产生的差异。定义了SOM歧视指数(SOMDI),其大小与变量被视为歧视者的程度有关。为了确保模型稳定且不依赖于SOM的随机起始点,执行了一百次迭代,并选择了始终具有较高等级的变量。说明了各种数据表示方法,并概述了使用SOM确定哪个变量最重要的主要理论原理。本文使用的软件是内部编写的,可以为现有软件包提供更大的灵活性,并针对手头的特定应用进行了量身定制。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号