【24h】

An Automata Approach to Match Gapped Sequence Tags Against Protein Database

机译:一种针对蛋白质数据库匹配空缺序列标签的自动机方法

获取原文
获取原文并翻译 | 示例

摘要

Tandem mass spectrometry (MS/MS) is the most important method for the peptide and protein identification. One approach to interpret the MS/MS data is de novo sequencing, which is becoming more and more accurate and important. However De novo sequencing usually can only confidently determine partial sequences, while the undetermined parts are represented by "mass gaps". We call such a partially determined sequence a gapped sequence tag. When a gapped sequence tag is searched in a database for protein identification, the determined parts should match the database sequence exactly, while each mass gap should match a substring of amino acids whose masses total up to the value of the mass gap. In such a case, the standard string matching algorithm does not work any more. In this paper, we present a new efficient algorithm to find the matches of gapped sequence tags in a protein database.
机译:串联质谱(MS / MS)是鉴定肽和蛋白质的最重要方法。从头测序是解释MS / MS数据的一种方法,这种方法变得越来越准确和重要。但是,从头测序通常只能确定地确定部分序列,而未确定的部分则由“质量缺口”表示。我们称这种部分确定的序列为缺口序列标签。在数据库中搜索有空位的序列标签以进行蛋白质鉴定时,确定的部分应与数据库序列完全匹配,而每个质量缺口应匹配一个氨基酸子串,其氨基酸的总和等于质量缺口的值。在这种情况下,标准字符串匹配算法不再起作用。在本文中,我们提出了一种新的有效算法,可以在蛋白质数据库中找到缺口序列标签的匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号