...
首页> 外文期刊>Proteomics >Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method
【24h】

Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method

机译:潜在蛋白质投影法对质谱蛋白质组学数据进行大数据分析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

There are many data mining techniques for processing and general learning of multivariate data. However, we believe the wavelet transformation and latent variable projection method are particularly useful for spectroscopic and chromatographic data. Projection based methods are designed to handle hugely multivariate nature of such data effectively. For the actual analysis of the data we have used latent variable projection methods such as principal component analysis (PCA) and partial least squares projection to latent structures based discriminant analysis (PLS-DA) to analyze the raw data presented to the participants of the First Duke Proteomics Data Mining Conference. PCA was used to solve problem #1 (clustering problem) and the PLS-DA was used to solve problem #2 (classification problem). The idea of internal and external cross-validation was used to validate the model obtained from the classification analysis. The simple two-component PLS-DA model obtained from the analysis performed well. The model has completely separated the two groups from all the data. The same model applied on two-thirds of the data showed good performance by external validation with independent test set of remaining 13 specimens obtained by setting aside the spectra of every third specimen (accuracy of 85%).
机译:有许多数据挖掘技术可用于处理和学习多元数据。但是,我们认为小波变换和潜变量投影方法对于光谱和色谱数据特别有用。设计基于投影的方法以有效处理此类数据的巨大多变量性质。对于数据的实际分析,我们使用了潜在变量投影方法(例如主成分分析(PCA)和偏最小二乘法投影到基于判别分析的潜在结构(PLS-DA))来分析提供给第一届参与者的原始数据杜克蛋白质组学数据挖掘会议。 PCA用于解决问题1(聚类问题),而PLS-DA用于解决问题2(分类问题)。内部和外部交叉验证的思想被用来验证从分类分析中获得的模型。通过分析获得的简单的两组分PLS-DA模型表现良好。该模型已将两组数据与所有数据完全分开。应用于三分之二数据的相同模型通过外部验证通过独立测试集显示了良好的性能,剩下的13个样本通过将每三个样本的光谱放在一边而获得了独立的测试集(准确性为85%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号