首页> 美国卫生研究院文献>Bioinformatics >Improved quality control processing of peptide-centric LC-MS proteomics data
【2h】

Improved quality control processing of peptide-centric LC-MS proteomics data

机译:改进的以肽为中心的LC-MS蛋白质组学数据的质量控制处理

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Motivation: In the analysis of differential peptide peak intensities (i.e. abundance measures), LC-MS analyses with poor quality peptide abundance data can bias downstream statistical analyses and hence the biological interpretation for an otherwise high-quality dataset. Although considerable effort has been placed on assuring the quality of the peptide identification with respect to spectral processing, to date quality assessment of the subsequent peptide abundance data matrix has been limited to a subjective visual inspection of run-by-run correlation or individual peptide components. Identifying statistical outliers is a critical step in the processing of proteomics data as many of the downstream statistical analyses [e.g. analysis of variance (ANOVA)] rely upon accurate estimates of sample variance, and their results are influenced by extreme values.>Results: We describe a novel multivariate statistical strategy for the identification of LC-MS runs with extreme peptide abundance distributions. Comparison with current method (run-by-run correlation) demonstrates a significantly better rate of identification of outlier runs by the multivariate strategy. Simulation studies also suggest that this strategy significantly outperforms correlation alone in the identification of statistically extreme liquid chromatography-mass spectrometry (LC-MS) runs.>Availability: >Contact: >Supplementary information: is available at Bioinformatics online.
机译:>动机:在分析差异化肽峰强度(即丰度测量)时,具有较差质量的肽丰度数据的LC-MS分析可能会偏向下游的统计分析,因此可能会对高质量的数据集进行生物学解释。尽管已投入大量精力来确保肽段鉴定在光谱处理方面的质量,但迄今为止,后续肽丰度数据矩阵的质量评估仅限于主观目视检查逐次运行相关性或单个肽组分。与许多下游统计分析一样,识别蛋白质组统计数据是蛋白质组学数据处理中的关键步骤。方差分析(ANOVA)]依赖于样本方差的准确估计,其结果受极值的影响。>结果:我们描述了一种新颖的多元统计策略,用于识别极值LC-MS运行肽丰度分布。与当前方法(逐次运行相关性)的比较表明,通过多元策略可以更好地识别异常运行。仿真研究还表明,在确定统计极限液相色谱-质谱联用(LC-MS)运行时,该策略的性能明显优于单独的相关性。>可用性: >联系方式: >补充信息:可从在线生物信息学获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号