首页> 外文会议>Intelligent Systems for Molecular Biology >Proteome coverage prediction-with-infinite Markov models
【24h】

Proteome coverage prediction-with-infinite Markov models

机译:蛋白质组覆盖预测 - 带无限的马尔可夫模型

获取原文

摘要

Motivation: Liquid chromatography tandem mass spectrometry (LC-MS/MS) is the 'Predominant method to comprehensively characterize complex protein mixtures such as samples from prefractionated or complete proteomes. In order to maximize proteome coverage for, the studied sample, i.e. identify as many traceable proteins as possible, LC-MS/MS experiments are typically repeated extensively and the results combined. 'Proteome coverage prediction is the task of estimating the number of peptide discoveries of future LC-MS/MS experiments. Proteome coverage prediction is important to enhance the design of efficient proteomics studies. To date, there does not exist any method to reliably estimate the increase of proteomecoverage at an early stage. Results: We propose an extended infinite Markov model DiriSim to extrapolate the progression of proteome coverage based on a small number of already performed LC-MS/MS experiments. The method explicitly accounts for the uncertainty of peptide identifications. We tested DiriSim on a set of 37 LC-MS/MS experiments of a complete proteome sample and demonstrated that DiriSim correctly predicts the coverage progression already from a small subset of experiments. The predicted progression enabled us to specify maximal coverage for the lest sample. We demonstrated that quality requirements on the final proteome map impose an upper bound on the number' of useful experiment repetitions and limit the achievable proteome coverage.
机译:动机:液相色谱串联质谱(LC-MS / MS)是综合表征复杂蛋白质混合物的主要方法,例如从预制物或完全蛋白质组的样品。为了最大化蛋白质组覆盖,所研究的样品,即鉴定尽可能多的可追踪蛋白质,通常会广泛地重复LC-MS / MS实验,并将结果组合在一起。 '蛋白质组覆盖预测是估算未来LC-MS / MS实验的肽发现数的任务。蛋白质组覆盖预测对于提高有效蛋白质组学研究的设计是重要的。迄今为止,不存在任何方法可以在早期估计早期癌症的增加。结果:我们提出了一个扩展的无限马尔可夫模型Dirisim,以基于少量进行的LC-MS / MS实验外推外推出蛋白质组覆盖的进展。该方法明确地占肽鉴定的不确定性。我们在完整蛋白质组样品的一组37 LC-MS / MS实验上测试了DirIsim,并证明DirIsim正确预测已经来自小实验的小型的覆盖进展。预测的进展使我们能够为Lest样本指定最大覆盖范围。我们证明,最终蛋白质组地图上的质量要求施加了有用的实验重复的数量的上限,并限制了可实现的蛋白质组覆盖率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号