首页> 外文期刊>Journal of proteomics >RT-SVR+q: a strategy for post-Mascot analysis using retention time and q value metric to improve peptide and protein identifications.
【24h】

RT-SVR+q: a strategy for post-Mascot analysis using retention time and q value metric to improve peptide and protein identifications.

机译:RT-SVR + q:Mascot分析的一种策略,使用保留时间和q值度量来改善肽和蛋白质的鉴定。

获取原文
获取原文并翻译 | 示例
           

摘要

Shotgun proteomics commonly utilizes database search like Mascot to identify proteins from tandem MS/MS spectra. False discovery rate (FDR) is often used to assess the confidence of peptide identifications. However, a widely accepted FDR of 1% sacrifices the sensitivity of peptide identification while improving the accuracy. This article details a machine learning approach combining retention time based support vector regressor (RT-SVR) with q value based statistical analysis to improve peptide and protein identifications with high sensitivity and accuracy. The use of confident peptide identifications as training examples and careful feature selection ensures high R values (>0.900) for all models. The application of RT-SVR model on Mascot results (p=0.10) increases the sensitivity of peptide identifications. q Value, as a function of deviation between predicted and experimental RTs (ΔRT), is used to assess the significance of peptide identifications. We demonstrate that the peptide and protein identifications increase by up to 89.4% and 83.5%, respectively, for a specified q value of 0.01 when applying the method to proteomic analysis of the natural killer leukemia cell line (NKL). This study establishes an effective methodology and provides a platform for profiling confident proteomes in more relevant species as well as a future investigation of accurate protein quantification.
机译:gun弹枪蛋白质组学通常利用诸如Mascot之类的数据库搜索来从串联MS / MS光谱中识别蛋白质。错误发现率(FDR)通常用于评估肽鉴定的可信度。但是,广泛接受的1%FDR牺牲了肽鉴定的敏感性,同时提高了准确性。本文详细介绍了一种将基于保留时间的支持向量回归(RT-SVR)与基于q值的统计分析相结合的机器学习方法,从而以高灵敏度和准确性改善了肽和蛋白质的鉴定。使用可靠的肽段鉴定作为训练示例和仔细的特征选择可确保所有模型的R值均高(> 0.900)。 RT-SVR模型在吉祥物结果上的应用(p = 0.10)提高了肽段鉴定的灵敏度。 q值是预测的和实验的RT之间的偏差的函数(ΔRT),用于评估肽鉴定的重要性。我们证明,当将该方法应用于天然杀伤性白血病细胞系(NKL)的蛋白质组学分析时,对于指定的q值为0.01,肽和蛋白质的鉴定分别增加了89.4%和83.5%。这项研究建立了一种有效的方法,并提供了一个平台,用于分析更多相关物种中的可靠蛋白质组,以及对准确蛋白质定量的进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号