An Automata Approach to Match Gapped Sequence Tags Against Protein Database

机译：一种针对蛋白质数据库匹配空缺序列标签的自动机方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Tandem mass spectrometry (MS/MS) is the most important method for the peptide and protein identification. One approach to interpret the MS/MS data is de novo sequencing, which is becoming more and more accurate and important. However De novo sequencing usually can only confidently determine partial sequences, while the undetermined parts are represented by "mass gaps". We call such a partially determined sequence a gapped sequence tag. When a gapped sequence tag is searched in a database for protein identification, the determined parts should match the database sequence exactly, while each mass gap should match a substring of amino acids whose masses total up to the value of the mass gap. In such a case, the standard string matching algorithm does not work any more. In this paper, we present a new efficient algorithm to find the matches of gapped sequence tags in a protein database.

机译：串联质谱（MS / MS）是鉴定肽和蛋白质的最重要方法。从头测序是解释MS / MS数据的一种方法，这种方法变得越来越准确和重要。但是，从头测序通常只能确定地确定部分序列，而未确定的部分则由“质量缺口”表示。我们称这种部分确定的序列为缺口序列标签。在数据库中搜索有空位的序列标签以进行蛋白质鉴定时，确定的部分应与数据库序列完全匹配，而每个质量缺口应匹配一个氨基酸子串，其氨基酸的总和等于质量缺口的值。在这种情况下，标准字符串匹配算法不再起作用。在本文中，我们提出了一种新的有效算法，可以在蛋白质数据库中找到缺口序列标签的匹配。

著录项

来源
《International Conference on Implementation and Application of Automata(CIAA 2004); 20040722-24; Kingston(CA)》|2004年|P.167-177|共11页
会议地点 Kingston(CA)
作者
Yonghua Han; Bin Ma; Kaizhong Zhang;
展开▼
作者单位

Department of Computer Science, University of Western Ontario, London Ontario, N6A 5B7 Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动控制、自动控制系统;
关键词

相似文献

外文文献
中文文献
专利

1. AN AUTOMATA APPROACH TO MATCH GAPPED SEQUENCE TAGS AGAINST PROTEIN DATABASE [J] . YONGHUA HAN, BIN MA, KAIZHONG ZHANG International Journal of Foundations of Computer Science . 2005,第3期

机译：一种针对蛋白质数据库匹配缺口序列标签的自动方法
2. On-line capillary liquid chromatography tandem mass spectrometry on an ion trap/reflectron time-of-flight mass spectrometer using the sequence tag database search approach for peptide sequencing and protein identification [J] . Huang PQ., Parus S., Lubman DM., Journal of the American Society for Mass Spectrometry . 2000,第2期

机译：使用序列标签数据库搜索方法在离子阱/反射飞行时间质谱仪上进行在线毛细管液相色谱串联质谱法，用于肽段测序和蛋白质鉴定
3. A comprehensive approach for establishment of the platform to analyze functions of KIAA proteins II: public release of inaugural version of InGaP database containing gene/protein expression profiles for 127 mouse KIAA genes/proteins. [J] . Koga H, Yuasa S, Nagase T, DNA research: an international journal for rapid publication of reports on genes and genomes . 2004,第4期

机译：建立用于分析KIAA蛋白功能的平台的综合方法II：公开发布InGaP数据库的首个版本，其中包含127个小鼠KIAA基因/蛋白的基因/蛋白表达谱。
4. An Automata Approach to Match Gapped Sequence Tags Against Protein Database [C] . Yonghua Han, Bin Ma, Kaizhong Zhang International Conference on Implementation and Application of Automata . 2005

机译：一种匹配蛋白质数据库的喷涂序列标记的自动机方法
5. Isolation of cDNAS of genes involved in programmed cell death and construction of an expressed sequence tag database for Aponogeton madagascariensis. [D] . Rantong, Gaolathe. 2010

机译：涉及程序性细胞死亡的基因的cDNAS的分离和马达加蓬（Aponogeton madagascariensis）表达序列标签数据库的构建。
6. Helminth secretome database (HSD): a collection of helminth excretory/secretory proteins predicted from expressed sequence tags (ESTs) [O] . Gagan Garg, Shoba Ranganathan 2012

机译：蠕虫分泌组数据库（HSD）：从表达的序列标签（EST）预测的蠕虫分泌/分泌蛋白集合
7. Updated catalogue of homologues to human disease-related proteins in the yeast genome1Notation: nucleotide and protein sequences identifiers are given as DATABASE:IDENTIFIER where the codes for the corresponding data bases are SW (for SwissProt), EMBL, TREMBL, SPTREMBL, PIR, and OMIM. The corresponding sequences can be retrieved through the web using SRS (T. Etzold, http://www.ebi.ac.uk/srs/srsc), and the OMIM data base (B. Brylawski, http://www.ncbi.nlm.nih.gov/Omim/). All yeast sequences are tagged by their MIPS gene identifier (Yxynnnz, where x indicates chromosome, y stands for the arm, nnn is a numeral, and z indicates the direction of translation).1 [O] . Andrade Miguel A, Sander Chris, Valencia Alfonso 1998

机译：酵母基因组中与人类疾病相关的蛋白质的同源物更新目录1注释：核苷酸和蛋白质序列的标识符以DATABASE：IDENTIFIER给出，其中相应数据库的代码为SW（SwissProt），EMBL，TREMBL，SPTREMBL，PIR和OMIM。可以使用SRS（T。Etzold，http：//www.ebi.ac.uk/srs/srsc）和OMIM数据库（B. Brylawski，http：//www.ncbi）通过网络检索相应的序列。 .nlm.nih.gov / Omim /）。所有酵母序列均通过其MIPS基因标识符进行标记（Yxynnnz，其中x表示染色体，y表示臂，nnn是数字，z表示翻译方向）。1

An Automata Approach to Match Gapped Sequence Tags Against Protein Database

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅