首页> 外文期刊>Molecular & cellular proteomics: MCP >A bioinformatics workflow for variant peptide detection in shotgun proteomics.
【24h】

A bioinformatics workflow for variant peptide detection in shotgun proteomics.

机译:用于shot弹枪蛋白质组学中变体肽检测的生物信息学工作流程。

获取原文
获取原文并翻译 | 示例
           

摘要

Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics.
机译:gun弹枪蛋白质组学数据分析通常依赖于数据库搜索。但是,常用的蛋白质序列数据库不包含有关蛋白质变体的信息,因此会阻止鉴定变体肽和蛋白质。在蛋白质序列数据库中包含已知的编码变异可以帮助缓解此问题。基于我们最近发布的人类癌症蛋白质组变异数据库,我们创建了一个蛋白质序列数据库,可全面注释从癌症蛋白质组变异数据库中收集的数千种与癌症相关的编码变异以及从单核苷酸多态性数据库(dbSNP)获得的非癌特异性编码变异。 )。然后,使用该数据库,我们开发了一种数据分析工作流程,用于在shot弹枪蛋白质组学中鉴定变体多肽。通过改进的错误发现率估计方法解决了错误肯定变异识别的高风险。对结肠直肠癌细胞系SW480,RKO和HCT-116的分析揭示了总共81种含有非癌特异性或癌相关变异的肽。通过基因组测序确定了从81个随机选择的26个变体中的23个。我们进一步将工作流程应用于来自三个大肠直肠肿瘤标本的数据集。总共检测到204种不同的变异肽,其中5种带有已知的癌症相关突变。每个人都显示出与癌症相关的突变的特定模式,这表明这种信息可用于个性化医学。工作流的兼容性已经通过四个流行的数据库搜索引擎(包括Sequest,Mascot,X!Tandem和MyriMatch)进行了测试。总而言之,我们开发了一种工作流程,可以有效地利用现有的基因组数据来实现蛋白质组学中变体多肽的检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号