...
首页> 外文期刊>Journal of proteomics >A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics
【24h】

A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics

机译:shot弹枪蛋白质组学中肽和蛋白质鉴定的计算方法和错误率估计程序的概述

获取原文
获取原文并翻译 | 示例
           

摘要

This manuscript provides a comprehensive review of the peptide and protein identification process using tandem mass spectrometry (MS/MS) data generated in shotgun proteomic experiments. The commonly used methods for assigning peptide sequences to MS/MS spectra are critically discussed and compared, from basic strategies to advanced multi-stage approaches. A particular attention is paid to the problem of false-positive identifications. Existing statistical approaches for assessing the significance of peptide to spectrum matches are surveyed, ranging from single-spectrum approaches such as expectation values to global error rate estimation procedures such as false discovery rates and posterior probabilities. The importance of using auxiliary discriminant information (mass accuracy, peptide separation coordinates, digestion properties, and etc.) is discussed, and advanced computational approaches for joint modeling of multiple sources of information are presented. This review also includes a detailed analysis of the issues affecting the interpretation of data at the protein level, including the amplification of error rates when going from peptide to protein level, and the ambiguities in inferring the identifies of sample proteins in the presence of shared peptides. Commonly used methods for computing protein-level confidence scores are discussed in detail. The review concludes with a discussion of several outstanding computational issues.
机译:该手稿使用在shot弹枪蛋白质组实验中生成的串联质谱(MS / MS)数据对多肽和蛋白质的鉴定过程进行了全面回顾。从基本策略到高级的多阶段方法,都对批判性地讨论和比较了将肽序列分配给MS / MS谱的常用方法。特别注意假阳性识别问题。调查了评估肽对光谱匹配的重要性的现有统计方法,范围从单光谱方法(例如期望值)到全局错误率估计程序(例如错误发现率和后验概率)。讨论了使用辅助判别信息(质量准确度,肽分离坐标,消化特性等)的重要性,并提出了用于多种信息源联合建模的高级计算方法。这篇综述还对影响蛋白质水平数据解释的问题进行了详细分析,包括从肽水平转换为蛋白质水平时错误率的放大,以及在存在共享肽时推断样品蛋白质鉴定的含糊之处。详细讨论了计算蛋白质水平置信度得分的常用方法。审查结束时讨论了几个突出的计算问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号