首页> 美国卫生研究院文献>other >Bootstrap Classification and Point-Based Feature Selection from Age-Staged Mouse Cerebellum Tissues of Matrix Assisted Laser Desorption/Ionization Mass Spectra using a Fuzzy Rule-Building Expert System
【2h】

Bootstrap Classification and Point-Based Feature Selection from Age-Staged Mouse Cerebellum Tissues of Matrix Assisted Laser Desorption/Ionization Mass Spectra using a Fuzzy Rule-Building Expert System

机译:使用模糊规则建立专家系统从基质辅助激光解吸/电离质谱的年龄分级小鼠小脑组织进行引导分类和基于点的特征选择

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A bootstrap method for point-based detection of candidate biomarker peaks has been developed from pattern classifiers. Point-based detection methods are advantageous in comparison to peak-based methods. Peak determination and selection is problematic when spectral peaks are not baseline resolved or on a varying baseline. The benefit of point-based detection is that peaks can be globally determined from the characteristic features of the entire data set (i.e., subsets of candidate points) as opposed to the traditional method of selecting peaks from individual spectra and then combining the peak list into a data set. The point-based method is demonstrated to be more effective and efficient using a synthetic data set when compared to using Mahalanobis distance for feature selection. In addition, probabilities that characterize the uniqueness of the peaks are determined.This method was applied for detecting peaks that characterize age-specific patterns of protein expression of developing and adult mouse cerebella from matrix assisted laser desorption/ionization (MALDI) mass spectrometry (MS) data. The mice comprised three age groups; 42 adults, 19 14-day old pups, and 16 7-day old pups. Three sequential spectra were obtained from each tissue section to yield 126, 57 and 48 spectra for adult, 14-day old pup, and 7-day old pup spectra, respectively. Each spectrum comprised 71,879 mass measurements in a range of 3.5-50 kDa. A previous study revealed that 846 unique peaks were detected that were consistent for 50% of the mice in each age group.A fuzzy rule-building expert system (FuRES) was applied to investigate the correlation of age with features in the MS data. FuRES detected two outlier pup-14 spectra. Prediction was evaluated using 100 bootstrap samples of 2 Latin-partitions (i.e., 50:50 split between training and prediction set) of the mice. The spectra without the outliers yielded classification rates of 99.1±0.1%, 90.1±0.8%, and 97.0±0.6% for adults, 14-day old pups, and 7-day old pups, respectively. At a 95% level of significance, 100 bootstrap samples disclosed 35 adult and 21 pup distinguishing peaks for separating adults from pups; and 8 14-day old and 15 7-day old predictive peaks for separating 14-day old pup from 7-day old pup spectra. A compressed matrix comprising 40,393 points that were outside the 95% confidence intervals of one of the two FuRES discriminants was evaluated and the classification improved significantly for all classes. When peaks that satisfied a quality criterion were integrated, the 55 integrated peak areas furnished significantly improved classification for all classes: the selected peak areas furnished classification rates of 100%, 97.3±0.6%, and 97.4±0.3% for adult, 14-day old pups, and 7-day old pups using 100 bootstrap Latin partitions evaluations with the predictions averaged. When the bootstrap size was increased to 1000 samples, the results were not significantly affected. The FuRES predictions were consistent with those obtained by discriminant partial least squares (DPLS) classifications.
机译:已经从模式分类器中开发了用于基于点的候选生物标志物峰检测的自举方法。与基于峰的方法相比,基于点的检测方法具有优势。当光谱峰无法基线解析或在变化的基线上时,峰的确定和选择会出现问题。基于点的检测的好处是,可以从整个数据集(即候选点的子集)的特征中全局确定峰,这与从单个光谱中选择峰然后将峰列表合并为传统方法不同。数据集。与使用马哈拉诺比斯距离进行特征选择相比,使用合成数据集证明了基于点的方法更加有效。此外,还确定了表征峰唯一性的概率。该方法用于通过基质辅助激光解吸/电离(MALDI)质谱(MS)检测表征发育中和成年小鼠小脑蛋白质表达的特定年龄模式的峰)数据。小鼠分为三个年龄段。 42名成人,19只14天大的幼崽和16只7天大的幼崽。从每个组织切片获得三个顺序的光谱,分别产生成年,14天大的幼犬和7天大幼犬的光谱分别为126、57和48。每个光谱在3.5-50 kDa范围内包含71,879个质量测量值。先前的研究表明,在每个年龄组的50%的小鼠中检测到846个唯一峰, 是一致的。采用模糊规则建立专家系统(FuRES)来研究年龄与MS数据中的功能。 FuRES检测到两个异常的pup-14光谱。使用小鼠的2个拉丁分区(即训练和预测集之间的比例为50:50的比例)的100个引导程序样本对预测进行了评估。没有异常值的光谱对成人,14日龄幼犬和7天龄幼犬的分类率分别为99.1±0.1%,90.1±0.8%和97.0±0.6%。在95%的显着性水平下,有100个自举样本显示了35个成年幼崽和21个成年幼崽的区分峰,用于将成年幼崽与幼崽分开;和8个14天大的预测峰和15个7天大的预测峰,用于将7天大的幼犬光谱从14天大的幼仔中分离出来。评估了包含40393个点的压缩矩阵,该点在两个FuRES判别式之一的95%置信区间之外,并且所有类别的分类都得到了显着改善。当对符合质量标准的峰进行积分时,对55个积分峰区域的所有分类的分类效果都得到了显着改善:所选峰区域对14天成人的分类率分别为100%,97.3±0.6%和97.4±0.3%幼崽和7天大的幼崽使用100个引导拉丁分区评估,并对预测结果进行平均。当引导程序大小增加到1000个样本时,结果不会受到明显影响。 FuRES预测与通过判别偏最小二乘(DPLS)分类获得的预测一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号