首页> 外文期刊>Bioinformatics >Intersession reproducibility of mass spectrometry profiles and its effect on accuracy of multivariate classification models
【24h】

Intersession reproducibility of mass spectrometry profiles and its effect on accuracy of multivariate classification models

机译:质谱图之间的重现性及其对多元分类模型准确性的影响

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: The reproducibility of mass spectrometry proteomic profiling has become an intensely controversial topic. The mere mention of concern over the reproducibility of data generated from any particular platform can lead to the anxiety over the generalizability of its results and its role in the future of discovery proteomics. In this study, we examine the reproducibility of proteomic profiles generated by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS) across multiple data-generation sessions. We analyze the problem in terms of the reproducibility of signals, reproducibility of discriminative features and reproducibility of multivariate classification models on profiles for serum samples from early lung cancer and healthy control subjects. Results: Proteomic profiles in individual data-generation sessions experience within-session variability. We show that combining data from multiple sessions introduces additional (inter-session) noise. While additional noise can affect the discriminative analysis, we show that its average effect on profiles in our study is relatively small. Moreover, for the purposes of prediction on future (previously unseen) data, classifiers trained on multi-session data are able to adapt to inter-session noise and improve their classification accuracy. Contact: pelikan@cs.pitt.edu.
机译:动机:质谱蛋白质组学分析的可重复性已成为一个备受争议的话题。仅提及对从任何特定平台生成的数据的可重复性的关注,可能会导致对其结果的可概括性及其在发现蛋白质组学的未来中的作用感到焦虑。在这项研究中,我们研究了跨多个数据生成会话的表面增强的激光解吸/电离飞行时间质谱(SELDI-TOF-MS)生成的蛋白质组图谱的可重复性。我们从早期肺癌和健康对照组的血清样本中分析了信号的可再现性,判别特征的可再现性和多元分类模型的可再现性。结果:个体数据生成会话中的蛋白质组学概况会经历会话内的变异性。我们显示,合并来自多个会话的数据会引入额外的(会话间)噪声。尽管额外的噪声会影响判别分析,但我们表明,在我们的研究中,噪声对轮廓的平均影响相对较小。此外,出于对未来(以前看不见的)数据进行预测的目的,在多会话数据上训练的分类器能够适应会话间的噪声并提高分类精度。联系方式:pelikan@cs.pitt.edu。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号