...
首页> 外文期刊>Proteomics >Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines
【24h】

Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines

机译:通过利用多个搜索引擎,在大规模蛋白质组实验中最大化肽鉴定的灵敏度和可靠性

获取原文
获取原文并翻译 | 示例
           

摘要

Despite recent advances in qualitative proteomics, the automatic identification of peptides with optimal sensitivity and accuracy remains a difficult goal. To address this deficiency, a novel algorithm, Multiple Search Engines, Normalization and Consensus is described. The method employs six search engines and a re-scoring engine to search MS/MS spectra against protein and decoy sequences. After the peptide hits from each engine are normalized to error rates estimated from the decoy hits, peptide assignments are then deduced using a minimum consensus model. These assignments are produced in a series of progressively relaxed false-discovery rates, thus enabling a comprehensive interpretation of the data set. Additionally, the estimated false-discovery rate was found to have good concordance with the observed falsepositive rate calculated from known identities. Benchmarking against standard proteins data sets (ISBv1, sPRG2006) and their published analysis, demonstrated that the Multiple Search Engines, Normalization and Consensus algorithm consistently achieved significantly higher sensitivity in peptide identifications, which led to increased or more robust protein identifications in all data sets compared with prior methods. The sensitivity and the false-positive rate of peptide identification exhibit an inverse-proportional and linear relationship with the number of participating search engines.
机译:尽管最近在定性蛋白质组学方面取得了进展,但是以最佳的灵敏度和准确度自动鉴定肽仍然是一个困难的目标。为了解决这个缺陷,描述了一种新颖的算法,多个搜索引擎,标准化和共识。该方法采用六个搜索引擎和一个重新评分引擎来针对蛋白质和诱饵序列搜索MS / MS光谱。将每个引擎的肽命中率标准化为根据诱饵命中估计的错误率后,然后使用最小共有模型推导肽分配。这些分配是通过一系列逐步放松的错误发现率产生的,因此可以对数据集进行全面的解释。此外,发现估计的错误发现率与根据已知身份计算出的观察到的错误肯定率具有良好的一致性。根据标准蛋白质数据集(ISBv1,sPRG2006)进行的基准测试及其发表的分析表明,多重搜索引擎,标准化和共识算法始终在肽段鉴定中实现了显着更高的敏感性,从而导致与所有数据集中相比,蛋白质鉴定的数量增加或更加可靠与先前的方法。肽鉴定的敏感性和假阳性率与参与搜索引擎的数量呈反比例和线性关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号