Towards Centralized MS/MS Spectra Preprocessing: An Empirical Evaluation of Peptides Search Engines using Ground Truth Datasets

机译：迈向集中式MS / MS光谱预处理：使用地面真相数据集的肽搜索引擎的经验评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

several peptides search engines have been developed in the recent decades. Most of the time and for the same inputs, different search enginesâ€™ result in different peptides were identified, which can confuse the stakeholders in the field of proteomics. The massive amount of generated spectra by high throughput spectrometers adds another challenge which handicaps the current search engines. This motivates the researchers to evaluate the combination of several search engines. Several studies provided ensemble solutions over shared and distributed computing environments for reliable results. However, the massive amount of MS/MS spectra is a cumbersome traffic over the systemsâ€™ networks. This issue directly impacts the searching performance and also adds unnecessary extra costs (computing, storage, network traffic) if cloud cluster is being used. The main question of this paper is: Can we build a central MS/MS spectra preprocessing for semantically different protein search engines? We evaluate different statistical reduction techniques using four popular protein search engines. In order to fairly evaluate the results, we build ground truth unanimous-based datasets for two different species; yeast and human. Our techniques result in significant peak reduction, where only around 30% of the spectra peaks are enough to report reliable identifications from the used search engines in this study.

机译：在最近的几十年中，已经开发了几种肽搜索引擎。在大多数情况下，对于相同的输入，会识别出不同的搜索引擎导致产生不同的肽，这可能会使蛋白质组学领域的利益相关者感到困惑。高通量光谱仪产生的大量光谱增加了另一个挑战，这阻碍了当前的搜索引擎。这激励研究人员评估几种搜索引擎的组合。多项研究提供了在共享和分布式计算环境上的集成解决方案，以获得可靠的结果。但是，大量的MS / MS频谱是系统网络上繁琐的流量。如果正在使用云群集，此问题将直接影响搜索性能，并且还会增加不必要的额外成本（计算，存储，网络流量）。本文的主要问题是：我们可以为语义上不同的蛋白质搜索引擎构建中央MS / MS谱图预处理吗？我们使用四个流行的蛋白质搜索引擎评估不同的统计归约技术。为了公平地评估结果，我们为两个不同的物种建立了基于地面一致数据的数据集。酵母和人。我们的技术可显着减少峰，在此研究中，只有大约30％的光谱峰足以报告使用的搜索引擎提供的可靠标识。

著录项

来源
《IEEE International Conference on Bioinformatics and Bioengineering》|2017年|194-199|共6页
会议地点
作者
Majdi Maabreh; Ajay Gupta; Izzat Alsmadi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Search engines; Peptides; Proteins; Tools; Proteomics; Databases; Buildings;

机译：搜索引擎;肽;蛋白质;工具;蛋白质组学;数据库;建筑物;

相似文献

外文文献
中文文献
专利

1. Comparison of different search engines using validated MS/MS test datasets [J] . Boutilier K, Ross M, Podtelejnikov AV, Analytica chimica acta . 2005,第1期

机译：使用经过验证的MS / MS测试数据集比较不同的搜索引擎
2. Mass spectrum sequential subtraction speeds up searching large peptide MS/MS spectra datasets against large nucleotide databases for proteogenomics. [J] . Helmy M, Sugiyama N, Tomita M, Genes to cells : . 2012,第8期

机译：质谱序列减法可以加快针对大型核苷酸数据库的大型肽MS / MS质谱图数据集的蛋白质组学研究。
3. When less can yield more - Computational preprocessing of MS/MS spectra for peptide identification [J] . Proteomics . 2009,第21期

机译：当更少时可以得到更多-用于肽鉴定的MS / MS光谱的计算预处理
4. Towards Centralized MS/MS Spectra Preprocessing: An Empirical Evaluation of Peptides Search Engines using Ground Truth Datasets [C] . Majdi Maabreh, Ajay Gupta, Izzat Alsmadi IEEE International Conference on Bioinformatics and Bioengineering . 2017

机译：用于集中的MS / MS Spectra预处理：使用地面真理数据集的肽搜索引擎的实证评估
5. Data mining of peptide MS/MS spectra to elucidate gas phase peptide dissociation mechanisms and improve protein identification. [D] . Huang, Yingying. 2005

机译：肽MS / MS质谱图的数据挖掘可阐明气相肽的解离机理并改善蛋白质鉴定。
6. Identification and Characterization of Disulfide Bonds in Proteins and Peptides from Tandem MS Data by Use of the MassMatrix MS/MS Search Engine [O] . Hua Xu, Liwen Zhang, Michael A. Freitas -1

机译：通过使用MassMatrix MS / MS搜索引擎从串联MS数据中鉴定和表征蛋白质和多肽中的二硫键
7. Mass spectrum sequential subtraction speeds up searching large peptide MS/MS spectra datasets against large nucleotide databases for proteogenomics [O] . Mohamed Helmy, Naoyuki Sugiyama, Masaru Tomita, 2012

机译：质谱序列减法速度加速搜索大型肽MS / MS光谱数据集针对蛋白质组织的大核苷酸数据库

Towards Centralized MS/MS Spectra Preprocessing: An Empirical Evaluation of Peptides Search Engines using Ground Truth Datasets

摘要

著录项

相似文献

相关主题

期刊订阅