...
首页> 外文期刊>Journal of proteome research >pClean: An Algorithm To Preprocess High-Resolution Tandem Mass Spectra for Database Searching
【24h】

pClean: An Algorithm To Preprocess High-Resolution Tandem Mass Spectra for Database Searching

机译:PCLean:一种用于预处理高分辨率串联质谱的算法,用于数据库搜索

获取原文
获取原文并翻译 | 示例

摘要

Database searches of MS/MS spectra are the main approach to peptide/protein identification in proteomics. Since most database search engines only utilize a small portion of the original MS/MS signals for peptide detection, how to improve the quality of MS/MS signals is a primary concern for enhancement of the peptide/protein identification rate. A fundamental issue is that some noise MS signals, informative or uninformative, have to be filtered out prior to database searching. Herein, an integrative preprocessing algorithm was designed, termed pClean, which incorporates three modules to preprocess MS/MS spectra, such as the removal of isobaric-labeling related ions, the reduction in isotopic peaks, the deconvolution of ions with higher charges, and the clearance of uninformative MS/MS signals. In contrast to the currently available approaches to MS/MS data preprocessing, pClean enables treatment of MS/MS spectra with high mass accuracy and favors filtering for the labeling or nonlabeling of peptides. Data sets at various scales gained from mass spectrometers with high resolution were used to assess the quality of peptides identified after pClean treatment and to compare the pClean improvement with those of other software programs. On the basis of the analysis of peptides identified and the Mascot ion score, pClean was proven to be effective in the removal of mass spectral noise and the reduction of random matching. Compared with other software programs, pClean appeared to be beneficial in terms of preprocessing performances for the enhancement of confidence scores and the increase in peptides identified. pClean is available at https://github.com/AimeeD90/pClean_release.
机译:MS / MS光谱的数据库搜索是蛋白质组学中肽/蛋白质鉴定的主要方法。由于大多数数据库搜索引擎仅利用原始MS / MS信号的一小部分进行肽检测,因此如何提高MS / MS信号的质量是提高肽/蛋白质识别率的主要问题。基本问题是,必须在数据库搜索之前过滤一些噪声MS信号,信息或不知情,但必须赘肉。在此,设计了一种综合预处理算法,其被称为PCLean,其将三个模块包含在预处理MS / MS光谱中,例如去除异瓣标记相关离子,同位素峰的还原,具有较高电荷的离子的去卷积,以及具有较高电荷的离子卷积,以及清除未表知MS / MS信号。与目前可用的MS / MS数据预处理的方法相比,PCLean能够以高质量准确度处理MS / MS光谱,并有利于过滤肽的标记或非标记。从具有高分辨率的质谱仪获得的各种刻度的数据集用于评估PCLean治疗后鉴定的肽的质量,并与其他软件程序的PCLean改进进行比较。在鉴定的肽和吉祥物离子评分的分析的基础上,被证明在去除质谱噪声和随机匹配的减少方面是有效的。与其他软件程序相比,PCLean在预处理性能方面似乎有益,以提高置信评分和鉴定的肽的增加。 pclean在https://github.com/aimeed90/pclean_release提供。

著录项

  • 来源
    《Journal of proteome research》 |2019年第9期|共10页
  • 作者单位

    Chinese Acad Sci Beijing Inst Genom CAS Key Lab Genome Sci &

    Informat Beijing 100101 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Chinese Acad Sci Beijing Inst Genom CAS Key Lab Genome Sci &

    Informat Beijing 100101 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Baylor Coll Med Dept Mol &

    Human Genet Houston TX 77030 USA;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    BGI Shenzhen Shenzhen 518083 Peoples R China;

    Chinese Acad Sci Beijing Inst Genom CAS Key Lab Genome Sci &

    Informat Beijing 100101 Peoples R China;

    Chinese Acad Sci Beijing Inst Genom CAS Key Lab Genome Sci &

    Informat Beijing 100101 Peoples R China;

    Chinese Acad Sci Beijing Inst Genom CAS Key Lab Genome Sci &

    Informat Beijing 100101 Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 分子生物学;蛋白质;
  • 关键词

    pClean; proteomics; bioinformatics; MS/MS; preprocessing; database search;

    机译:Pclean;蛋白质组学;生物信息学;MS / MS;预处理;数据库搜索;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号